Scan your hardware and find compatible Ollama LLMs
Project description
ollama-scout
One command to find the right LLMs for your hardware.
ollama-scout scans your GPU VRAM, CPU, and RAM, then recommends compatible Ollama models grouped by use case — with an interactive guided mode for new users and full CLI flags for power users.
Demo
$ ollama-scout
ollama-scout v0.2.0 | LLM Hardware Advisor (Interactive Mode)
Welcome! Let's find the best LLMs for your hardware.
Press Enter to scan your system, or Ctrl+C to exit.
System Hardware
┌─────────────────┬────────────────────────────────────────┐
│ OS │ Linux │
│ CPU │ AMD Ryzen 9 5900X │
│ Cores / Threads │ 12 cores / 24 threads │
│ RAM │ 32.0 GB │
│ GPU │ NVIDIA RTX 3080 (10.0 GB VRAM) │
└─────────────────┴────────────────────────────────────────┘
What are you mainly using this for?
1. All categories 2. Coding 3. Reasoning 4. Chat
Coding Models
Model Tag Quant Size Fit Mode Status
deepseek-coder 6.7b Q4_K_M 3.8GB Excellent GPU Available
codellama 7b Q4_K_M 3.8GB Excellent GPU Pulled
Would you like to compare two models? [y/N]:
Save results as a Markdown report? [y/N]:
Quick Start
# Recommended
pipx install ollama-scout
# Or with pip
pip install ollama-scout
# Or from source
git clone https://github.com/sandy-sp/ollama-scout.git
cd ollama-scout && pip install -e .
Then run:
ollama-scout # interactive guided mode — no flags needed
Requires Python 3.10+ and Ollama installed.
Features
| Feature | Description |
|---|---|
| Interactive guided mode | Step-by-step session with no flags needed (ollama-scout or -i) |
| Hardware detection | NVIDIA, AMD ROCm, Apple Silicon unified memory, multi-GPU |
| Live + offline models | Fetches from Ollama API with 24hr cache; --offline uses built-in list |
| Smart scoring | GPU > Multi-GPU > CPU+GPU offload > CPU-only, with time estimates |
| Use-case grouping | Coding, Reasoning, Chat with per-category tables |
| Real benchmark timing | Measures actual tokens/sec on pulled models via ollama run |
| Model comparison | --compare model1 model2 for side-by-side analysis with verdict |
| Model detail view | --model NAME shows all variants scored against your hardware |
| Markdown export | --export saves a formatted report |
| Persistent config | XDG-compliant paths with --config and --config-set |
| Auto-pull | Pull recommended models interactively or via --pull |
CLI Reference
| Flag | Description | Example |
|---|---|---|
-i, --interactive |
Launch guided mode (default with no args) | ollama-scout -i |
--use-case |
Filter by category | --use-case coding |
--flat |
Flat list instead of grouped tables | --flat |
--top N |
Limit number of results | --top 20 |
--offline |
Use built-in fallback model list | --offline |
--benchmark |
Show inference speed estimates | --benchmark |
--model NAME |
Detail view for a specific model | --model deepseek-coder |
--compare M1 M2 |
Side-by-side model comparison | --compare llama3.2 mistral |
--export |
Auto-export Markdown report | --export |
--output PATH |
Export to a specific file | --output ~/report.md |
--pull MODEL |
Pull a model via ollama | --pull llama3.2:3b |
--update-models |
Force-refresh model list cache | --update-models |
--config |
Show current configuration | --config |
--config-set K=V |
Set a config value | --config-set offline_mode=true |
--no-pull-prompt |
Skip interactive pull prompt | --no-pull-prompt |
--version |
Show version | --version |
Run ollama-scout --help for the full list. See docs/USAGE.md for detailed examples.
How Scoring Works
| Fit | Mode | Meaning |
|---|---|---|
| Excellent | GPU | Model fits fully in single GPU VRAM |
| Excellent | Multi-GPU | Model distributed across multiple GPUs |
| Good | CPU+GPU | Partially offloaded to RAM — usable but slower |
| Possible | CPU | CPU-only inference with time estimate |
| (excluded) | — | Model too large for available memory |
Apple Silicon: unified memory is treated as VRAM, so M1/M2/M3/M4 Macs get GPU-tier scoring with a 4GB system reserve.
Roadmap
- Interactive guided mode
- Real benchmark timing (
ollama run) - Model comparison mode (
--compare) - Multi-GPU support
- XDG-compliant config paths
- Model list caching (24hr TTL)
- pip installable on PyPI
- Live streaming benchmarks with progress bar
- Model search and filter by keyword
- Web UI version
- Config profiles (work / gaming / minimal)
Contributing
Contributions welcome! See CONTRIBUTING.md for setup instructions and guidelines.
License
MIT — see LICENSE.
Generated by ollama-scout
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ollama_scout-0.3.0.tar.gz.
File metadata
- Download URL: ollama_scout-0.3.0.tar.gz
- Upload date:
- Size: 46.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1ceb626853022021d7341cffefe509f18fa70ca6d11c8cb991300ccf626df9da
|
|
| MD5 |
a766521d4a8b19097a2ea9469e81d318
|
|
| BLAKE2b-256 |
5c5f43bb6f9afac77b8c5dcbddb307a5e575e5d261d3a97442e4e22be941d948
|
Provenance
The following attestation bundles were made for ollama_scout-0.3.0.tar.gz:
Publisher:
release.yml on sandy-sp/ollama-scout
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
ollama_scout-0.3.0.tar.gz -
Subject digest:
1ceb626853022021d7341cffefe509f18fa70ca6d11c8cb991300ccf626df9da - Sigstore transparency entry: 983469936
- Sigstore integration time:
-
Permalink:
sandy-sp/ollama-scout@a964b92d635d3ab85ca12e68bd85d54d77e7b592 -
Branch / Tag:
refs/tags/v0.3.0 - Owner: https://github.com/sandy-sp
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@a964b92d635d3ab85ca12e68bd85d54d77e7b592 -
Trigger Event:
push
-
Statement type:
File details
Details for the file ollama_scout-0.3.0-py3-none-any.whl.
File metadata
- Download URL: ollama_scout-0.3.0-py3-none-any.whl
- Upload date:
- Size: 35.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a1e4f10a9ad160d077b4395dbeea59d258a2ade8fa63eed69c5c7cd19d875373
|
|
| MD5 |
713cba6addf95733927a9125b6047488
|
|
| BLAKE2b-256 |
913e8341763a980bddcac51fab2b2d6cfdec93686bbf363bf6fcc7c4ea6a7c4b
|
Provenance
The following attestation bundles were made for ollama_scout-0.3.0-py3-none-any.whl:
Publisher:
release.yml on sandy-sp/ollama-scout
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
ollama_scout-0.3.0-py3-none-any.whl -
Subject digest:
a1e4f10a9ad160d077b4395dbeea59d258a2ade8fa63eed69c5c7cd19d875373 - Sigstore transparency entry: 983469943
- Sigstore integration time:
-
Permalink:
sandy-sp/ollama-scout@a964b92d635d3ab85ca12e68bd85d54d77e7b592 -
Branch / Tag:
refs/tags/v0.3.0 - Owner: https://github.com/sandy-sp
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@a964b92d635d3ab85ca12e68bd85d54d77e7b592 -
Trigger Event:
push
-
Statement type: