Scan your hardware and find compatible Ollama LLMs

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

sandy-sp

These details have not been verified by PyPI

Project description

ollama-scout

One command to find the right LLMs for your hardware.

ollama-scout scans your GPU VRAM, CPU, and RAM, then recommends compatible Ollama models grouped by use case — with an interactive guided mode for new users and full CLI flags for power users.

Demo

$ ollama-scout

  ollama-scout v0.2.0  |  LLM Hardware Advisor  (Interactive Mode)

  Welcome! Let's find the best LLMs for your hardware.
  Press Enter to scan your system, or Ctrl+C to exit.

                   System Hardware
  ┌─────────────────┬────────────────────────────────────────┐
  │ OS              │ Linux                                  │
  │ CPU             │ AMD Ryzen 9 5900X                      │
  │ Cores / Threads │ 12 cores / 24 threads                  │
  │ RAM             │ 32.0 GB                                │
  │ GPU             │ NVIDIA RTX 3080 (10.0 GB VRAM)         │
  └─────────────────┴────────────────────────────────────────┘

  What are you mainly using this for?
    1. All categories    2. Coding    3. Reasoning    4. Chat

  Coding Models
  Model             Tag    Quant    Size    Fit        Mode     Status
  deepseek-coder    6.7b   Q4_K_M   3.8GB   Excellent  GPU      Available
  codellama         7b     Q4_K_M   3.8GB   Excellent  GPU      Pulled

  Would you like to compare two models? [y/N]:
  Save results as a Markdown report? [y/N]:

Quick Start

# Recommended
pipx install ollama-scout

# Or with pip
pip install ollama-scout

# Or from source
git clone https://github.com/sandy-sp/ollama-scout.git
cd ollama-scout && pip install -e .

Then run:

ollama-scout          # interactive guided mode — no flags needed

Requires Python 3.10+ and Ollama installed.

Features

Feature	Description
Interactive guided mode	Step-by-step session with no flags needed (`ollama-scout` or `-i`)
Hardware detection	NVIDIA, AMD ROCm, Apple Silicon unified memory, multi-GPU
Live + offline models	Fetches from Ollama API with 24hr cache; `--offline` uses built-in list
Smart scoring	GPU > Multi-GPU > CPU+GPU offload > CPU-only, with time estimates
Use-case grouping	Coding, Reasoning, Chat with per-category tables
Real benchmark timing	Measures actual tokens/sec on pulled models via `ollama run`
Model comparison	`--compare model1 model2` for side-by-side analysis with verdict
Model detail view	`--model NAME` shows all variants scored against your hardware
Markdown export	`--export` saves a formatted report
Persistent config	XDG-compliant paths with `--config` and `--config-set`
Auto-pull	Pull recommended models interactively or via `--pull`

CLI Reference

Flag	Description	Example
`-i`, `--interactive`	Launch guided mode (default with no args)	`ollama-scout -i`
`--use-case`	Filter by category	`--use-case coding`
`--flat`	Flat list instead of grouped tables	`--flat`
`--top N`	Limit number of results	`--top 20`
`--offline`	Use built-in fallback model list	`--offline`
`--benchmark`	Show inference speed estimates	`--benchmark`
`--model NAME`	Detail view for a specific model	`--model deepseek-coder`
`--compare M1 M2`	Side-by-side model comparison	`--compare llama3.2 mistral`
`--export`	Auto-export Markdown report	`--export`
`--output PATH`	Export to a specific file	`--output ~/report.md`
`--pull MODEL`	Pull a model via ollama	`--pull llama3.2:3b`
`--update-models`	Force-refresh model list cache	`--update-models`
`--config`	Show current configuration	`--config`
`--config-set K=V`	Set a config value	`--config-set offline_mode=true`
`--no-pull-prompt`	Skip interactive pull prompt	`--no-pull-prompt`
`--version`	Show version	`--version`

Run ollama-scout --help for the full list. See docs/USAGE.md for detailed examples.

How Scoring Works

Fit	Mode	Meaning
Excellent	GPU	Model fits fully in single GPU VRAM
Excellent	Multi-GPU	Model distributed across multiple GPUs
Good	CPU+GPU	Partially offloaded to RAM — usable but slower
Possible	CPU	CPU-only inference with time estimate
(excluded)	—	Model too large for available memory

Apple Silicon: unified memory is treated as VRAM, so M1/M2/M3/M4 Macs get GPU-tier scoring with a 4GB system reserve.

Roadmap

Interactive guided mode
Real benchmark timing (ollama run)
Model comparison mode (--compare)
Multi-GPU support
XDG-compliant config paths
Model list caching (24hr TTL)
pip installable on PyPI
Live streaming benchmarks with progress bar
Model search and filter by keyword
Web UI version
Config profiles (work / gaming / minimal)

Contributing

Contributions welcome! See CONTRIBUTING.md for setup instructions and guidelines.

License

MIT — see LICENSE.

Generated by ollama-scout

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

sandy-sp

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.0

Feb 23, 2026

0.2.1

Feb 21, 2026

0.2.0

Feb 19, 2026

0.1.1

Feb 19, 2026

0.1.0

Feb 18, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ollama_scout-0.3.0.tar.gz (46.5 kB view details)

Uploaded Feb 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ollama_scout-0.3.0-py3-none-any.whl (35.1 kB view details)

Uploaded Feb 23, 2026 Python 3

File details

Details for the file ollama_scout-0.3.0.tar.gz.

File metadata

Download URL: ollama_scout-0.3.0.tar.gz
Upload date: Feb 23, 2026
Size: 46.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ollama_scout-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`1ceb626853022021d7341cffefe509f18fa70ca6d11c8cb991300ccf626df9da`
MD5	`a766521d4a8b19097a2ea9469e81d318`
BLAKE2b-256	`5c5f43bb6f9afac77b8c5dcbddb307a5e575e5d261d3a97442e4e22be941d948`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ollama_scout-0.3.0.tar.gz:

Publisher: release.yml on sandy-sp/ollama-scout

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ollama_scout-0.3.0.tar.gz
- Subject digest: 1ceb626853022021d7341cffefe509f18fa70ca6d11c8cb991300ccf626df9da
- Sigstore transparency entry: 983469936
- Sigstore integration time: Feb 23, 2026
Source repository:
- Permalink: sandy-sp/ollama-scout@a964b92d635d3ab85ca12e68bd85d54d77e7b592
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/sandy-sp
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@a964b92d635d3ab85ca12e68bd85d54d77e7b592
- Trigger Event: push

File details

Details for the file ollama_scout-0.3.0-py3-none-any.whl.

File metadata

Download URL: ollama_scout-0.3.0-py3-none-any.whl
Upload date: Feb 23, 2026
Size: 35.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ollama_scout-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a1e4f10a9ad160d077b4395dbeea59d258a2ade8fa63eed69c5c7cd19d875373`
MD5	`713cba6addf95733927a9125b6047488`
BLAKE2b-256	`913e8341763a980bddcac51fab2b2d6cfdec93686bbf363bf6fcc7c4ea6a7c4b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ollama_scout-0.3.0-py3-none-any.whl:

Publisher: release.yml on sandy-sp/ollama-scout

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ollama_scout-0.3.0-py3-none-any.whl
- Subject digest: a1e4f10a9ad160d077b4395dbeea59d258a2ade8fa63eed69c5c7cd19d875373
- Sigstore transparency entry: 983469943
- Sigstore integration time: Feb 23, 2026
Source repository:
- Permalink: sandy-sp/ollama-scout@a964b92d635d3ab85ca12e68bd85d54d77e7b592
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/sandy-sp
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@a964b92d635d3ab85ca12e68bd85d54d77e7b592
- Trigger Event: push

ollama-scout 0.3.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

ollama-scout

Demo

Quick Start

Features

CLI Reference

How Scoring Works

Roadmap

Contributing

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance