Skip to main content

Scan your hardware and find compatible Ollama LLMs

Project description

ollama-scout

Python 3.10+ License: MIT Platform

Scan your hardware. Find the right LLMs. Pull them instantly.

ollama-scout is a cross-platform CLI tool that detects your GPU VRAM, CPU, and RAM, then recommends compatible Ollama models grouped by use case.

Demo

╭──────────────────────────────────────────────────────────╮
│  ollama-scout  |  LLM Hardware Advisor                   │
╰──────────────────────────────────────────────────────────╯

                   System Hardware
╭─────────────────┬────────────────────────────────────────╮
│ OS              │ Linux                                  │
│ CPU             │ AMD Ryzen 9 5900X                      │
│ Cores / Threads │ 12 cores / 24 threads                  │
│ RAM             │ 32.0 GB                                │
│ GPU             │ NVIDIA RTX 3080 (10.0 GB VRAM)         │
╰─────────────────┴────────────────────────────────────────╯

  Coding Models
  Model             Tag      Quant    Size    Fit        Mode     Status
  deepseek-coder    6.7b     Q4_K_M   3.8GB   Excellent  GPU      Available
  codellama         7b       Q4_K_M   3.8GB   Excellent  GPU      Pulled
  qwen2.5-coder    7b       Q4_K_M   4.4GB   Excellent  GPU      Available

  Reasoning Models
  deepseek-r1       7b       Q4_K_M   4.7GB   Excellent  GPU      Available
  phi4              14b      Q4_K_M   8.4GB   Good       CPU+GPU  Available

  Chat Models
  llama3.2          3b       Q4_K_M   2.0GB   Excellent  GPU      Pulled
  mistral           7b       Q4_K_M   4.1GB   Excellent  GPU      Available

Interactive Mode

Just run ollama-scout with no arguments for a guided experience:

ollama-scout

The interactive mode walks you through hardware scanning, use case selection, and model recommendations step by step — no flags needed. You can also launch it explicitly with ollama-scout -i.

How It Works

1. Scan       Detects GPU VRAM, CPU cores/threads, RAM
              Supports NVIDIA, AMD (ROCm), Apple Silicon unified memory
                                    |
2. Fetch      Pulls latest models from Ollama library API
              Falls back to built-in list if offline
                                    |
3. Score      Matches each model variant to your hardware
              GPU fit > CPU+GPU offload > CPU-only > excluded
                                    |
4. Recommend  Groups results by use case: Coding, Reasoning, Chat
              Shows fit label, run mode, and pull status

Installation

git clone https://github.com/sandy-sp/ollama-scout.git
cd ollama-scout
pip install -r requirements.txt

Requires Python 3.10+ and Ollama installed.

Usage

ollama-scout                                # Interactive guided mode (default)
ollama-scout -i                             # Explicit interactive mode
python main.py                              # Full scan, grouped by use case
python main.py --use-case coding            # Filter by use case
python main.py --flat                       # Flat list instead of grouped
python main.py --top 20                     # Show top 20 results
python main.py --offline                    # Use built-in model list (no network)
python main.py --benchmark                  # Show inference speed estimates
python main.py --model deepseek-coder       # Detail view for a specific model
python main.py --export                     # Auto-export to Markdown report
python main.py --output ~/report.md         # Export to specific path
python main.py --pull llama3.2:latest       # Pull a model directly
python main.py --no-pull-prompt             # Skip interactive pull prompt
python main.py --config                     # Show current config
python main.py --config-set offline_mode=true  # Set a config value

See docs/USAGE.md for the full guide with platform-specific notes and FAQ.

Features

  • Interactive mode — Guided step-by-step session when run with no arguments
  • Hardware detection — GPU VRAM, CPU, RAM on Windows, macOS, Linux
  • Apple Silicon support — Treats unified memory as VRAM for accurate scoring
  • Live + offline modes — Fetches from Ollama API or uses built-in fallback list
  • Smart recommendations — Full GPU / partial CPU+GPU offload / CPU-only scoring
  • Use-case grouping — Coding, Reasoning, Chat
  • Benchmark estimates — Rough tokens/sec estimation per model
  • Model detail view — Deep dive into a specific model's variants and compatibility
  • Already-pulled detection — Highlights models you've downloaded
  • Auto-pull — Pull a recommended model interactively
  • Markdown export — Save results as a formatted report
  • Config file — Persistent defaults via ~/.ollama-scout.json

Requirements

Package Purpose
rich Terminal UI (tables, panels, spinners)
requests Fetch Ollama library API
psutil Cross-platform RAM detection

Roadmap

  • Config file support (~/.ollama-scout.json)
  • pip installable package (ollama-scout CLI command)
  • GPU benchmark integration (real ollama run timing)
  • Model comparison mode (side-by-side two models)
  • XDG config path support (~/.config/ollama-scout/)
  • Web UI version

Contributing

PRs welcome! Especially for:

  • Better use-case mapping for new models
  • Multi-GPU scoring improvements
  • Additional platform testing (Windows ARM, Linux ARM)

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ollama_scout-0.1.1.tar.gz (34.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ollama_scout-0.1.1-py3-none-any.whl (29.3 kB view details)

Uploaded Python 3

File details

Details for the file ollama_scout-0.1.1.tar.gz.

File metadata

  • Download URL: ollama_scout-0.1.1.tar.gz
  • Upload date:
  • Size: 34.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ollama_scout-0.1.1.tar.gz
Algorithm Hash digest
SHA256 b02df187819052deed57e27538c0615fc60898649e3471554e7f4048ef70b362
MD5 f3403a2e089405ec026c343549e45141
BLAKE2b-256 9f6674e9c77743f582dbb71b231d7899deeb84e7dbe7671d92168af0be1670fe

See more details on using hashes here.

Provenance

The following attestation bundles were made for ollama_scout-0.1.1.tar.gz:

Publisher: release.yml on sandy-sp/ollama-scout

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file ollama_scout-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: ollama_scout-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 29.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ollama_scout-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 2bcc14613bc342630c73e3200e16ed81ef709444c131cb99466cdb0812bc00e0
MD5 7adff45f1ac79cec20767fb7ee47e7b0
BLAKE2b-256 4f247dcff29b5572cbc1baa53443e282550136fe6b385b5fcd098c26bab72778

See more details on using hashes here.

Provenance

The following attestation bundles were made for ollama_scout-0.1.1-py3-none-any.whl:

Publisher: release.yml on sandy-sp/ollama-scout

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page