Skip to main content

A lightweight CLI to monitor Ollama models and usage

Project description

olmon

A lightweight CLI to monitor your Ollama models and usage in real time.


PyPI PyPI Downloads License

$ olmon top

╭─────────────────────────────────────── olmon top 01:09:32 ───────────────────────────────────────╮
│ Model                        VRAM              VRAM %            Expires In      Status          │
│ ornith:9b                    5.0 GB            ██░░░  41%        0m 22s          ● expiring      │
│ phi4-mini:latest             2.9 GB            █░░░░  23%        0m 17s          ● expiring      │
╰────────────────────────────── VRAM: 7.8 GB / 12.0 GB |  2 running ───────────────────────────────╯




Features

  • 🔄 Real-time dashboard — live auto-refreshing view of running models
  • 📋 Model listing — browse all installed models with size, family, and quantization
  • 🔍 Model inspection — full details on any installed model
  • 🟢 Status indicators — green / blue / red at a glance
  • ⚙️ Configurable — set your API host and refresh interval
  • 🪶 Lightweight — minimal dependencies, works over SSH on headless servers
  • 🖥️ htop-style monitoringolmon top with VRAM usage and expiry countdown
  • 🛑 Model control — force unload models from VRAM
  • ⚖️ Model comparison — side by side spec comparison of multiple models
  • 🔧 Scripting friendly--json flag and exit codes on every command

Requirements

  • Python 3.13+
  • Ollama installed and running

Installation

pip

pip install olmon

curl (Linux / macOS)

curl -fsSL https://raw.githubusercontent.com/glemiu6/olmon/master/scripts/install.sh | sh

From source

git clone https://github.com/glemiu6/olmon.git
cd olmon
uv pip install -e .

Windows

Native Windows binary is not currently supported.
Use WSL and follow the Linux installation instructions.


Usage

olmon status               # quick health check
olmon models               # list all installed models
olmon models --sort size   # sort by size
olmon models --filter llama  # filter by name or family
olmon inspect llama3:latest  # full details on a model
olmon ps                   # show currently running models
olmon watch                # live auto-refreshing dashboard
olmon watch --interval 5   # refresh every 5 seconds
olmon top                  # htop-style live monitoring
olmon stop qwen2.5:7b      # unload a model from VRAM
olmon compare qwen2.5:7b llama3.2:latest  # compare models
olmon --no-color models    # pipe-friendly output
olmon models --json        # output as JSON

Global flags

olmon --host http://192.168.1.10:11434 status   # connect to remote Ollama
olmon --version                                  # print version

Status Indicators

Indicator Meaning
🟢 Green Models are loaded and running
🔵 Blue Ollama is idle, no models loaded
🔴 Red Ollama is offline or unreachable

Configuration

olmon init          # create default config file

Config is stored at ~/.config/olmon/config.json:

{
  "host": "http://localhost:11434",
  "interval": 2,
  "no_color": false,
  "default_sort": "name"
}

Update & Uninstall

olmon update      # update to latest version
olmon uninstall   # remove olmon and config

Why olmon?

Most Ollama monitoring tools are GUI or system tray apps. olmon is built for:

  • Headless Linux servers — no GUI required
  • Remote monitoring — works over SSH
  • Shell scripting — pipe-friendly with --json flag and exit codes
  • DevOps workflows — integrate into scripts and cron jobs

GPU Support

  • NVIDIA — full VRAM monitoring via nvidia-smi
  • AMD — coming soon
  • CPU only — VRAM stats not available

Roadmap

See ROADMAP.md for the full plan.


Contributing

Contributions are welcome. Feel free to open an issue or submit a pull request.


License

MIT — see LICENSE for details.


Made with ❤️ by Vlad Digori

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

olmon-0.2.0.tar.gz (15.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

olmon-0.2.0-py3-none-any.whl (17.4 kB view details)

Uploaded Python 3

File details

Details for the file olmon-0.2.0.tar.gz.

File metadata

  • Download URL: olmon-0.2.0.tar.gz
  • Upload date:
  • Size: 15.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for olmon-0.2.0.tar.gz
Algorithm Hash digest
SHA256 3d0ca8c734f025d3d966e39dc378e0c5287956d81f1afe0726cb7fb88153eb34
MD5 9cd75aec074143bd0568f205f8e5c9e8
BLAKE2b-256 e3eb2b633ae366f4b769925444db17b99e99e2800467632a98f267a038e0cbf2

See more details on using hashes here.

Provenance

The following attestation bundles were made for olmon-0.2.0.tar.gz:

Publisher: release.yaml on glemiu6/olmon

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file olmon-0.2.0-py3-none-any.whl.

File metadata

  • Download URL: olmon-0.2.0-py3-none-any.whl
  • Upload date:
  • Size: 17.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for olmon-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a46f331b86e832e459dabf9d5bff4f425a3daee1b4ce151bf62ed48e049563f0
MD5 38d87ad5e54973c12b8029d68fce41ce
BLAKE2b-256 565d132f48d412acf1a2b749e5962bcebc691c80be04acf83b2a5098e9c07f47

See more details on using hashes here.

Provenance

The following attestation bundles were made for olmon-0.2.0-py3-none-any.whl:

Publisher: release.yaml on glemiu6/olmon

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page