Skip to main content

Scan your hardware and find compatible Ollama LLMs

Project description

ollama-scout

Python 3.10+ License: MIT Platform

Scan your hardware. Find the right LLMs. Pull them instantly.

ollama-scout is a cross-platform CLI tool that detects your GPU VRAM, CPU, and RAM, then recommends compatible Ollama models grouped by use case.

Demo

╭──────────────────────────────────────────────────────────╮
│  ollama-scout  |  LLM Hardware Advisor                   │
╰──────────────────────────────────────────────────────────╯

                   System Hardware
╭─────────────────┬────────────────────────────────────────╮
│ OS              │ Linux                                  │
│ CPU             │ AMD Ryzen 9 5900X                      │
│ Cores / Threads │ 12 cores / 24 threads                  │
│ RAM             │ 32.0 GB                                │
│ GPU             │ NVIDIA RTX 3080 (10.0 GB VRAM)         │
╰─────────────────┴────────────────────────────────────────╯

  Coding Models
  Model             Tag      Quant    Size    Fit        Mode     Status
  deepseek-coder    6.7b     Q4_K_M   3.8GB   Excellent  GPU      Available
  codellama         7b       Q4_K_M   3.8GB   Excellent  GPU      Pulled
  qwen2.5-coder    7b       Q4_K_M   4.4GB   Excellent  GPU      Available

  Reasoning Models
  deepseek-r1       7b       Q4_K_M   4.7GB   Excellent  GPU      Available
  phi4              14b      Q4_K_M   8.4GB   Good       CPU+GPU  Available

  Chat Models
  llama3.2          3b       Q4_K_M   2.0GB   Excellent  GPU      Pulled
  mistral           7b       Q4_K_M   4.1GB   Excellent  GPU      Available

Interactive Mode

Just run ollama-scout with no arguments for a guided experience:

ollama-scout

The interactive mode walks you through hardware scanning, use case selection, and model recommendations step by step — no flags needed. You can also launch it explicitly with ollama-scout -i.

How It Works

1. Scan       Detects GPU VRAM, CPU cores/threads, RAM
              Supports NVIDIA, AMD (ROCm), Apple Silicon unified memory
                                    |
2. Fetch      Pulls latest models from Ollama library API
              Falls back to built-in list if offline
                                    |
3. Score      Matches each model variant to your hardware
              GPU fit > CPU+GPU offload > CPU-only > excluded
                                    |
4. Recommend  Groups results by use case: Coding, Reasoning, Chat
              Shows fit label, run mode, and pull status

Installation

git clone https://github.com/sandy-sp/ollama-scout.git
cd ollama-scout
pip install -r requirements.txt

Requires Python 3.10+ and Ollama installed.

Usage

ollama-scout                                # Interactive guided mode (default)
ollama-scout -i                             # Explicit interactive mode
python main.py                              # Full scan, grouped by use case
python main.py --use-case coding            # Filter by use case
python main.py --flat                       # Flat list instead of grouped
python main.py --top 20                     # Show top 20 results
python main.py --offline                    # Use built-in model list (no network)
python main.py --benchmark                  # Show inference speed estimates
python main.py --model deepseek-coder       # Detail view for a specific model
python main.py --export                     # Auto-export to Markdown report
python main.py --output ~/report.md         # Export to specific path
python main.py --pull llama3.2:latest       # Pull a model directly
python main.py --no-pull-prompt             # Skip interactive pull prompt
python main.py --config                     # Show current config
python main.py --config-set offline_mode=true  # Set a config value

See docs/USAGE.md for the full guide with platform-specific notes and FAQ.

Features

  • Interactive mode — Guided step-by-step session when run with no arguments
  • Hardware detection — GPU VRAM, CPU, RAM on Windows, macOS, Linux
  • Apple Silicon support — Treats unified memory as VRAM for accurate scoring
  • Live + offline modes — Fetches from Ollama API or uses built-in fallback list
  • Smart recommendations — Full GPU / partial CPU+GPU offload / CPU-only scoring
  • Use-case grouping — Coding, Reasoning, Chat
  • Benchmark estimates — Rough tokens/sec estimation per model
  • Model detail view — Deep dive into a specific model's variants and compatibility
  • Already-pulled detection — Highlights models you've downloaded
  • Auto-pull — Pull a recommended model interactively
  • Markdown export — Save results as a formatted report
  • Config file — Persistent defaults via ~/.ollama-scout.json

Requirements

Package Purpose
rich Terminal UI (tables, panels, spinners)
requests Fetch Ollama library API
psutil Cross-platform RAM detection

Roadmap

  • Config file support (~/.ollama-scout.json)
  • pip installable package (ollama-scout CLI command)
  • GPU benchmark integration (real ollama run timing)
  • Model comparison mode (side-by-side two models)
  • XDG config path support (~/.config/ollama-scout/)
  • Web UI version

Contributing

PRs welcome! Especially for:

  • Better use-case mapping for new models
  • Multi-GPU scoring improvements
  • Additional platform testing (Windows ARM, Linux ARM)

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ollama_scout-0.2.0.tar.gz (36.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ollama_scout-0.2.0-py3-none-any.whl (30.8 kB view details)

Uploaded Python 3

File details

Details for the file ollama_scout-0.2.0.tar.gz.

File metadata

  • Download URL: ollama_scout-0.2.0.tar.gz
  • Upload date:
  • Size: 36.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ollama_scout-0.2.0.tar.gz
Algorithm Hash digest
SHA256 1c10fbf8746f38a5b94a9aae4d94a05b6a09f8d2db86774bf70be5d636008b6b
MD5 28f523a846445ad03bb9dd6342705321
BLAKE2b-256 e9b7299d47fe0c5d347427e3bef2868310f673c9e3a21790c26b0ae2d59d050f

See more details on using hashes here.

Provenance

The following attestation bundles were made for ollama_scout-0.2.0.tar.gz:

Publisher: release.yml on sandy-sp/ollama-scout

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file ollama_scout-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: ollama_scout-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 30.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ollama_scout-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 85e1e73699f3aa353b1446a5d755492a58a86e6fa913fd8c71083a3eb977296d
MD5 b41ef6edbce82323c012d717f8350093
BLAKE2b-256 67c38e37bfa0ac48f4680f38ff57300ec77389d5e4a7bd97d0e954f342c0d7e1

See more details on using hashes here.

Provenance

The following attestation bundles were made for ollama_scout-0.2.0-py3-none-any.whl:

Publisher: release.yml on sandy-sp/ollama-scout

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page