Skip to main content

Scan your hardware and find compatible Ollama LLMs

Project description

ollama-scout

Python 3.10+ License: MIT Platform

Scan your hardware. Find the right LLMs. Pull them instantly.

ollama-scout is a cross-platform CLI tool that detects your GPU VRAM, CPU, and RAM, then recommends compatible Ollama models grouped by use case.

Demo

╭──────────────────────────────────────────────────────────╮
│  ollama-scout  |  LLM Hardware Advisor                   │
╰──────────────────────────────────────────────────────────╯

                   System Hardware
╭─────────────────┬────────────────────────────────────────╮
│ OS              │ Linux                                  │
│ CPU             │ AMD Ryzen 9 5900X                      │
│ Cores / Threads │ 12 cores / 24 threads                  │
│ RAM             │ 32.0 GB                                │
│ GPU             │ NVIDIA RTX 3080 (10.0 GB VRAM)         │
╰─────────────────┴────────────────────────────────────────╯

  Coding Models
  Model             Tag      Quant    Size    Fit        Mode     Status
  deepseek-coder    6.7b     Q4_K_M   3.8GB   Excellent  GPU      Available
  codellama         7b       Q4_K_M   3.8GB   Excellent  GPU      Pulled
  qwen2.5-coder    7b       Q4_K_M   4.4GB   Excellent  GPU      Available

  Reasoning Models
  deepseek-r1       7b       Q4_K_M   4.7GB   Excellent  GPU      Available
  phi4              14b      Q4_K_M   8.4GB   Good       CPU+GPU  Available

  Chat Models
  llama3.2          3b       Q4_K_M   2.0GB   Excellent  GPU      Pulled
  mistral           7b       Q4_K_M   4.1GB   Excellent  GPU      Available

How It Works

1. Scan       Detects GPU VRAM, CPU cores/threads, RAM
              Supports NVIDIA, AMD (ROCm), Apple Silicon unified memory
                                    |
2. Fetch      Pulls latest models from Ollama library API
              Falls back to built-in list if offline
                                    |
3. Score      Matches each model variant to your hardware
              GPU fit > CPU+GPU offload > CPU-only > excluded
                                    |
4. Recommend  Groups results by use case: Coding, Reasoning, Chat
              Shows fit label, run mode, and pull status

Installation

git clone https://github.com/sandy-sp/ollama-scout.git
cd ollama-scout
pip install -r requirements.txt

Requires Python 3.10+ and Ollama installed.

Usage

python main.py                              # Full scan, grouped by use case
python main.py --use-case coding            # Filter by use case
python main.py --flat                       # Flat list instead of grouped
python main.py --top 20                     # Show top 20 results
python main.py --offline                    # Use built-in model list (no network)
python main.py --benchmark                  # Show inference speed estimates
python main.py --model deepseek-coder       # Detail view for a specific model
python main.py --export                     # Auto-export to Markdown report
python main.py --output ~/report.md         # Export to specific path
python main.py --pull llama3.2:latest       # Pull a model directly
python main.py --no-pull-prompt             # Skip interactive pull prompt
python main.py --config                     # Show current config
python main.py --config-set offline_mode=true  # Set a config value

See docs/USAGE.md for the full guide with platform-specific notes and FAQ.

Features

  • Hardware detection — GPU VRAM, CPU, RAM on Windows, macOS, Linux
  • Apple Silicon support — Treats unified memory as VRAM for accurate scoring
  • Live + offline modes — Fetches from Ollama API or uses built-in fallback list
  • Smart recommendations — Full GPU / partial CPU+GPU offload / CPU-only scoring
  • Use-case grouping — Coding, Reasoning, Chat
  • Benchmark estimates — Rough tokens/sec estimation per model
  • Model detail view — Deep dive into a specific model's variants and compatibility
  • Already-pulled detection — Highlights models you've downloaded
  • Auto-pull — Pull a recommended model interactively
  • Markdown export — Save results as a formatted report
  • Config file — Persistent defaults via ~/.ollama-scout.json

Requirements

Package Purpose
rich Terminal UI (tables, panels, spinners)
requests Fetch Ollama library API
psutil Cross-platform RAM detection

Roadmap

  • Config file support (~/.ollama-scout.json)
  • pip installable package (ollama-scout CLI command)
  • GPU benchmark integration (real ollama run timing)
  • Model comparison mode (side-by-side two models)
  • XDG config path support (~/.config/ollama-scout/)
  • Web UI version

Contributing

PRs welcome! Especially for:

  • Better use-case mapping for new models
  • Multi-GPU scoring improvements
  • Additional platform testing (Windows ARM, Linux ARM)

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ollama_scout-0.1.0.tar.gz (24.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ollama_scout-0.1.0-py3-none-any.whl (22.0 kB view details)

Uploaded Python 3

File details

Details for the file ollama_scout-0.1.0.tar.gz.

File metadata

  • Download URL: ollama_scout-0.1.0.tar.gz
  • Upload date:
  • Size: 24.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ollama_scout-0.1.0.tar.gz
Algorithm Hash digest
SHA256 c4690add76d9e938c22282173c03108fc17155259ab219ebe925f510c850675a
MD5 6e523374ab05e97232e925350c6298c5
BLAKE2b-256 3becfdfc7439f825df06bc87193de920a313e12f8f9f4a3b772ae2fc7b9377d0

See more details on using hashes here.

Provenance

The following attestation bundles were made for ollama_scout-0.1.0.tar.gz:

Publisher: release.yml on sandy-sp/ollama-scout

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file ollama_scout-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: ollama_scout-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 22.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ollama_scout-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 be6afa7b98466090592ede78f8f01f77ed06ca2217a100050188aca5335c3d6f
MD5 d6e5bc0f235508f732bc548ce12852d5
BLAKE2b-256 6b78ffeb19a5ae005713d3d7f754f1a07a8d921e788cd72c3a127015b11d6bed

See more details on using hashes here.

Provenance

The following attestation bundles were made for ollama_scout-0.1.0-py3-none-any.whl:

Publisher: release.yml on sandy-sp/ollama-scout

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page