Skip to main content

Restart-on-exit supervisor for autonomous CLI agents

Project description

中文 · English

CI PyPI Downloads License codecov

agent-runner

A restart-on-exit supervisor for autonomous coding CLIs. Tested with Claude Code and aider out of the box; any prompt-arg CLI via custom config. Spawn the agent round-after-round under defenses that prevent the failure modes that bite in production: stuck rounds, orphan commits, OAuth burn loops, full disks, runaway memory.

┌──────────────────────────────────────────┐
│ Layer 3: The Witness (monitor)           │  10 detectors + auto-stop
├──────────────────────────────────────────┤
│ Layer 2: The Loop (serve, ~120 LOC)      │  signal-trapping restart loop
├──────────────────────────────────────────┤
│ Layer 1: The Round (round)               │  one agent invocation
└──────────────────────────────────────────┘

Install

pip install cli-agent-runner

The installed CLI command is agent-runner (the PyPI distribution name is prefixed for namespace disambiguation; the import name and command are not).

Quick start

cd your-project
agent-runner init                 # scaffold agent-runner.toml + prompts/main.md
$EDITOR agent-runner.toml         # point agent.command at your CLI
agent-runner install --monitor    # systemd user units for serve + monitor
agent-runner status               # confirm running
agent-runner peek                 # snapshot of project state
agent-runner monitor              # live anomaly detection

Full walkthrough: docs/quickstart.md.

13 verbs

Lifecycle Observation
init / install / uninstall peek — state snapshot
start / stop / kill / cancel watch — peek in a refresh loop
restart / status monitor — 10 detectors, alerts, auto-stop
round / serve

Verb reference: docs/commands.md.

Defenses (built in)

11 named defenses, structured as data — see agent-runner peek --select defenses. Each carries the historical incident it codifies and the invariant test that guards it. Highlights:

  • round_timeout_s — hard wall, never the agent's word on when to stop
  • process_group_isolation — kill the round, not just the parent
  • orphan_stash_idempotency_s — no 3-stashes-per-second pile-ups
  • sha_locked_stashstash@{N} indices drift; SHAs don't
  • set_diff_classification — line-set comparison, not unified-diff +/- scan
  • startup_smoke_check — refuse to run with a clearly-truncated prompt

Full list and rationale: docs/architecture.md.

Monitor: 10 detectors

Notify only: timeout_rate, hung, orphan_chain, disk_warning, mem_pressure, smoke_fail_rate, network_fail, rate_limit_active.

Auto-stop the service (continuing is harmful):

  • oauth_fail — burning API quota on auth-rejected rounds
  • disk_critical — writing to a near-full disk risks corruption

Runs locally or against a remote host via ssh:

agent-runner monitor                  # local, 30s poll
agent-runner monitor --host pi        # remote, 60s poll
agent-runner monitor --json | jq -c   # pipe to downstream consumers

SSH trust boundary: monitor --host <alias> shells out via plain SSH using your local ~/.ssh/config (aliases, StrictHostKeyChecking behavior). A monitor with auto_stop enabled can issue agent-runner stop on the remote — verify your SSH config before pointing monitor at a remote service. See docs/runbook.md § "Remote monitor & SSH trust" for recommended hygiene.

Documentation

Development

git clone https://github.com/wan9yu/cli-agent-runner.git
cd cli-agent-runner
python3 -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"

./build.sh check                          # full local-CI sweep
./build.sh test                           # unit + integration only
AGENT_RUNNER_E2E_PI=1 ./build.sh e2e      # opt-in pi e2e (needs ssh alias `pi`)

Some docs/*.md blocks are generated from code — ./build.sh docs rewrites the <!-- gen:* --> regions, and ./build.sh check verifies they are fresh.

POSIX-only (Linux, macOS). Tested under Python 3.11+ on x86_64 and aarch64.

License

Apache License 2.0.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cli_agent_runner-0.1.30.tar.gz (253.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cli_agent_runner-0.1.30-py3-none-any.whl (99.7 kB view details)

Uploaded Python 3

File details

Details for the file cli_agent_runner-0.1.30.tar.gz.

File metadata

  • Download URL: cli_agent_runner-0.1.30.tar.gz
  • Upload date:
  • Size: 253.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for cli_agent_runner-0.1.30.tar.gz
Algorithm Hash digest
SHA256 c96017b3b74384d813fd4302e79bb80b9a87e560597e36ca1004eb42c2a648db
MD5 f0ddde1cfbf2b84c9160a0dd76b84de5
BLAKE2b-256 ffa9b97dccae61accab4c8eab5f6582b79a342d3aa3f5f4ece3b94385076e0e3

See more details on using hashes here.

Provenance

The following attestation bundles were made for cli_agent_runner-0.1.30.tar.gz:

Publisher: release.yml on wan9yu/cli-agent-runner

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file cli_agent_runner-0.1.30-py3-none-any.whl.

File metadata

File hashes

Hashes for cli_agent_runner-0.1.30-py3-none-any.whl
Algorithm Hash digest
SHA256 aaa707c44937caf6777e60714bc07f6435ae252c73abb9d3b43bc26dd59ddb0b
MD5 7c4a7385e000efece9d759b7f0ed258e
BLAKE2b-256 b02f94532594c0b4f2e5c45898192c3f077baee0328b6c7c7addaba5daca4271

See more details on using hashes here.

Provenance

The following attestation bundles were made for cli_agent_runner-0.1.30-py3-none-any.whl:

Publisher: release.yml on wan9yu/cli-agent-runner

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page