Scan your hardware and find compatible Ollama LLMs
Project description
ollama-scout
Scan your hardware. Find the right LLMs. Pull them instantly.
ollama-scout is a cross-platform CLI tool that detects your GPU VRAM, CPU, and RAM, then recommends compatible Ollama models grouped by use case.
Demo
╭──────────────────────────────────────────────────────────╮
│ ollama-scout | LLM Hardware Advisor │
╰──────────────────────────────────────────────────────────╯
System Hardware
╭─────────────────┬────────────────────────────────────────╮
│ OS │ Linux │
│ CPU │ AMD Ryzen 9 5900X │
│ Cores / Threads │ 12 cores / 24 threads │
│ RAM │ 32.0 GB │
│ GPU │ NVIDIA RTX 3080 (10.0 GB VRAM) │
╰─────────────────┴────────────────────────────────────────╯
Coding Models
Model Tag Quant Size Fit Mode Status
deepseek-coder 6.7b Q4_K_M 3.8GB Excellent GPU Available
codellama 7b Q4_K_M 3.8GB Excellent GPU Pulled
qwen2.5-coder 7b Q4_K_M 4.4GB Excellent GPU Available
Reasoning Models
deepseek-r1 7b Q4_K_M 4.7GB Excellent GPU Available
phi4 14b Q4_K_M 8.4GB Good CPU+GPU Available
Chat Models
llama3.2 3b Q4_K_M 2.0GB Excellent GPU Pulled
mistral 7b Q4_K_M 4.1GB Excellent GPU Available
How It Works
1. Scan Detects GPU VRAM, CPU cores/threads, RAM
Supports NVIDIA, AMD (ROCm), Apple Silicon unified memory
|
2. Fetch Pulls latest models from Ollama library API
Falls back to built-in list if offline
|
3. Score Matches each model variant to your hardware
GPU fit > CPU+GPU offload > CPU-only > excluded
|
4. Recommend Groups results by use case: Coding, Reasoning, Chat
Shows fit label, run mode, and pull status
Installation
git clone https://github.com/sandy-sp/ollama-scout.git
cd ollama-scout
pip install -r requirements.txt
Requires Python 3.10+ and Ollama installed.
Usage
python main.py # Full scan, grouped by use case
python main.py --use-case coding # Filter by use case
python main.py --flat # Flat list instead of grouped
python main.py --top 20 # Show top 20 results
python main.py --offline # Use built-in model list (no network)
python main.py --benchmark # Show inference speed estimates
python main.py --model deepseek-coder # Detail view for a specific model
python main.py --export # Auto-export to Markdown report
python main.py --output ~/report.md # Export to specific path
python main.py --pull llama3.2:latest # Pull a model directly
python main.py --no-pull-prompt # Skip interactive pull prompt
python main.py --config # Show current config
python main.py --config-set offline_mode=true # Set a config value
See docs/USAGE.md for the full guide with platform-specific notes and FAQ.
Features
- Hardware detection — GPU VRAM, CPU, RAM on Windows, macOS, Linux
- Apple Silicon support — Treats unified memory as VRAM for accurate scoring
- Live + offline modes — Fetches from Ollama API or uses built-in fallback list
- Smart recommendations — Full GPU / partial CPU+GPU offload / CPU-only scoring
- Use-case grouping — Coding, Reasoning, Chat
- Benchmark estimates — Rough tokens/sec estimation per model
- Model detail view — Deep dive into a specific model's variants and compatibility
- Already-pulled detection — Highlights models you've downloaded
- Auto-pull — Pull a recommended model interactively
- Markdown export — Save results as a formatted report
- Config file — Persistent defaults via
~/.ollama-scout.json
Requirements
| Package | Purpose |
|---|---|
rich |
Terminal UI (tables, panels, spinners) |
requests |
Fetch Ollama library API |
psutil |
Cross-platform RAM detection |
Roadmap
- Config file support (~/.ollama-scout.json)
- pip installable package (
ollama-scoutCLI command) - GPU benchmark integration (real
ollama runtiming) - Model comparison mode (side-by-side two models)
- XDG config path support (~/.config/ollama-scout/)
- Web UI version
Contributing
PRs welcome! Especially for:
- Better use-case mapping for new models
- Multi-GPU scoring improvements
- Additional platform testing (Windows ARM, Linux ARM)
License
MIT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ollama_scout-0.1.0.tar.gz.
File metadata
- Download URL: ollama_scout-0.1.0.tar.gz
- Upload date:
- Size: 24.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c4690add76d9e938c22282173c03108fc17155259ab219ebe925f510c850675a
|
|
| MD5 |
6e523374ab05e97232e925350c6298c5
|
|
| BLAKE2b-256 |
3becfdfc7439f825df06bc87193de920a313e12f8f9f4a3b772ae2fc7b9377d0
|
Provenance
The following attestation bundles were made for ollama_scout-0.1.0.tar.gz:
Publisher:
release.yml on sandy-sp/ollama-scout
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
ollama_scout-0.1.0.tar.gz -
Subject digest:
c4690add76d9e938c22282173c03108fc17155259ab219ebe925f510c850675a - Sigstore transparency entry: 961950622
- Sigstore integration time:
-
Permalink:
sandy-sp/ollama-scout@6692b67d4ff3f2c8a7facf8ec0c8d732982f4b0b -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/sandy-sp
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@6692b67d4ff3f2c8a7facf8ec0c8d732982f4b0b -
Trigger Event:
push
-
Statement type:
File details
Details for the file ollama_scout-0.1.0-py3-none-any.whl.
File metadata
- Download URL: ollama_scout-0.1.0-py3-none-any.whl
- Upload date:
- Size: 22.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
be6afa7b98466090592ede78f8f01f77ed06ca2217a100050188aca5335c3d6f
|
|
| MD5 |
d6e5bc0f235508f732bc548ce12852d5
|
|
| BLAKE2b-256 |
6b78ffeb19a5ae005713d3d7f754f1a07a8d921e788cd72c3a127015b11d6bed
|
Provenance
The following attestation bundles were made for ollama_scout-0.1.0-py3-none-any.whl:
Publisher:
release.yml on sandy-sp/ollama-scout
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
ollama_scout-0.1.0-py3-none-any.whl -
Subject digest:
be6afa7b98466090592ede78f8f01f77ed06ca2217a100050188aca5335c3d6f - Sigstore transparency entry: 961950650
- Sigstore integration time:
-
Permalink:
sandy-sp/ollama-scout@6692b67d4ff3f2c8a7facf8ec0c8d732982f4b0b -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/sandy-sp
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@6692b67d4ff3f2c8a7facf8ec0c8d732982f4b0b -
Trigger Event:
push
-
Statement type: