htop for AI agents — liveness, CPU/mem/GPU usage, and a kill switch for headless agents (openclaw, hermes, ollama, vllm, claude-code).

These details have not been verified by PyPI

Project links

Project description

agent-usage-manager

A tiny, single-file web dashboard for headless AI agents running on a machine — OpenClaw, Hermes, Claude Code, Ollama, vLLM, llama.cpp, or anything you name. It shows which agents are alive and what they're costing you (CPU, memory, GPU), and gives you a kill button per agent.

No database, no auth layer, no dependencies beyond FastAPI + psutil. Runs on macOS and Linux. Meant to be cloned, configured, and run on any node in a fleet.

AGENT          PID    STATUS     CPU %   MEM MB   GPU MB   UPTIME   COMMAND          ┆
openclaw +3    48213  ● running    62.4    1840     7320     2h 11m  openclaw serve … [kill] [force]
claude-code +9 73590  ● running    97.4    7630        —     1h 02m  claude --chann … [kill] [force]
hermes         49001  ● running    18.0     512        —     44m     hermes worker …  [kill] [force]
ollama         50122  ● running     3.1    9210    14080     6h 02m  ollama runner …  [kill] [force]

(+N = child processes rolled up under the agent; CPU/mem/GPU are tree totals.)

A live web UI (auto-refreshing every 3s). The rendered GIF lands here once recorded — see demo.tape.

What it does

One row per agent. Agents are grouped by process tree — the spawned children of an agent (inference subprocesses, MCP servers, helpers) are rolled up under it with a +N badge instead of cluttering the list as separate rows.
Liveness — green dot = running, red = zombie/dead. Status column shows the OS state.
Usage — CPU %, resident memory (MB), GPU memory (MB, NVIDIA only), and uptime, refreshed every 3s. CPU/mem/GPU are tree totals — the agent's true cost including everything it spawned.
Kill the tree — kill sends SIGTERM to the agent and its children (so spawned helpers don't leak resources), force sends SIGKILL. SIGTERM auto-escalates to SIGKILL after 3s. The confirm dialog tells you how many child processes will stop.

Safety

This is the important part — a web page that can kill processes needs guardrails:

Allowlist only. Only processes matching a pattern in agents.yaml are ever listed or killable. The kill endpoint re-checks the match server-side before sending any signal, so the dashboard can never be used to kill an arbitrary PID.
Protected patterns. Anything matching protect: in agents.yaml — plus the monitor's own process and PID 1 — shows a disabled, greyed-out kill button and is refused server-side.
Secret redaction. Command lines often carry tokens/keys in env vars or flags (FOO_TOKEN=..., --api-key ..., sk-..., ghp_..., JWTs). The command column redacts these to *** before they ever reach the browser — safe to screenshot.
Bind local by default. It listens on 127.0.0.1. Don't expose it to a network without putting auth in front of it (reverse proxy + basic auth, SSH tunnel, etc.) — it has no built-in authentication.

Quick start

Run it without installing anything (needs uv):

uvx agent-usage-manager
# open http://127.0.0.1:8765

Or install it:

pipx install agent-usage-manager   # or: pip install agent-usage-manager
agent-usage-manager --port 8765

From a clone (for hacking on it):

git clone <this-repo> && cd agent-usage-manager
./run.sh                           # venv + editable install, serves on :8765

Flags: --host, --port, --config /path/to/agents.yaml.

Configure which processes are "agents"

Edit agents.yaml:

agents:
  - label: openclaw           # shown as the badge in the UI
    match: openclaw           # case-insensitive substring of the command line
  - label: hermes
    match: hermes
  - label: claude-code
    match: "claude(\\s|$|-code)"
    regex: true               # treat `match` as a regex instead of substring

protect:                      # never killable, even if matched above
  - uvicorn

A process matches if the pattern hits its full command line or its process name. Point at a different file with AGENTS_CONFIG=/path/to/agents.yaml.

GPU notes

Per-process GPU memory comes from nvidia-smi when it's on PATH (Linux / NVIDIA). Apple Silicon has no per-process GPU accounting API, so the GPU column stays blank on Macs — CPU and memory are the meaningful resource signals there.

API

GET /api/agents → { agents: [...], host, cpu_count, ts }
POST /api/kill/{pid}?force=false → SIGTERM (or SIGKILL with force=true)

Run as a service

Linux (systemd), ~/.config/systemd/user/agent-usage-manager.service:

[Unit]
Description=agent usage manager
[Service]
ExecStart=%h/agent-usage-manager/.venv/bin/uvicorn app:app --port 8765
WorkingDirectory=%h/agent-usage-manager
Restart=on-failure
[Install]
WantedBy=default.target

systemctl --user enable --now agent-usage-manager

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.0

Jun 13, 2026

0.1.3

Jun 8, 2026

0.1.2

Jun 8, 2026

0.1.1

Jun 8, 2026

This version

0.1.0

Jun 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_usage_manager-0.1.0.tar.gz (11.2 kB view details)

Uploaded Jun 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agent_usage_manager-0.1.0-py3-none-any.whl (12.7 kB view details)

Uploaded Jun 8, 2026 Python 3

File details

Details for the file agent_usage_manager-0.1.0.tar.gz.

File metadata

Download URL: agent_usage_manager-0.1.0.tar.gz
Upload date: Jun 8, 2026
Size: 11.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for agent_usage_manager-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`92e6d0a942e5f1c2328b1c881c39e4a94b49018951773851b3e707eb6e8d13be`
MD5	`79635abccdc972afc63d446658b41a42`
BLAKE2b-256	`1ede1713c766b02ef40af44b26ef31dea6f3a168fca427b23ad3f79257555c26`

See more details on using hashes here.

File details

Details for the file agent_usage_manager-0.1.0-py3-none-any.whl.

File metadata

Download URL: agent_usage_manager-0.1.0-py3-none-any.whl
Upload date: Jun 8, 2026
Size: 12.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for agent_usage_manager-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7a2294773744b5fb301efa79cd7507b1fa8a812390e1f2bbb6bc08c2ecf59fb0`
MD5	`d74c81e4354bf9528110a8d2aa01f748`
BLAKE2b-256	`815ded67c14f785a1cf2c320bd1a844db7d95459b5501b9ebf1a69ad017af460`

See more details on using hashes here.

agent-usage-manager 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

agent-usage-manager

What it does

Safety

Quick start

Configure which processes are "agents"

GPU notes

API

Run as a service

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes