Outer-loop orchestrator for Claude Code — run iterative, self-improving coding sessions with automatic progress tracking, verification, and cost reporting.

These details have not been verified by PyPI

Project links

Project description

ralph-runner

Outer-loop orchestrator for Claude Code — run iterative, self-improving coding sessions with automatic progress tracking, verification, and cost reporting.

ralph-runner spawns repeated Claude Code sessions in a loop, feeding each one the accumulated progress from prior iterations. Claude works on your task, writes progress notes, and ralph-runner verifies the result, tracks costs, and decides whether to continue or stop. The result is autonomous, multi-hour coding runs that converge on a solution.

How It Works

┌─────────────────────────────────────────────────┐
│                  ralph-runner                    │
│                                                  │
│  ┌──────────┐   ┌──────────┐   ┌──────────┐    │
│  │ Iter 1   │──▶│ Iter 2   │──▶│ Iter N   │    │
│  │ Claude   │   │ Claude   │   │ Claude   │    │
│  │ Session  │   │ Session  │   │ Session  │    │
│  └────┬─────┘   └────┬─────┘   └────┬─────┘    │
│       │              │              │           │
│       ▼              ▼              ▼           │
│  ┌──────────────────────────────────────────┐   │
│  │            progress.md                    │   │
│  │  (shared memory across iterations)        │   │
│  └──────────────────────────────────────────┘   │
│       │              │              │           │
│       ▼              ▼              ▼           │
│  ┌──────────────────────────────────────────┐   │
│  │         verify command (optional)         │   │
│  │         e.g. make test, pytest            │   │
│  └──────────────────────────────────────────┘   │
└─────────────────────────────────────────────────┘

Each iteration is a fresh Claude Code session that reads the progress file, continues the work, and updates the file. Between iterations, ralph-runner runs your verification command, tracks token usage and costs, detects timeouts and crashes, and enforces minimum iteration counts before accepting completion.

Installation

As a CLI tool

pip install ralph-runner

Or install from source:

git clone https://github.com/gzxultra/ralph-runner.git
cd ralph-runner
pip install -e .

As a Claude Code plugin

Install directly from the repo — no pip needed for plugin use:

# Add the marketplace
/plugin marketplace add gzxultra/ralph-runner

# Install the plugin
/plugin install ralph-runner@ralph-runner

Or install from a local clone:

claude plugin add ./ralph-runner
# or
claude --plugin-dir ./ralph-runner

Prerequisites

Python 3.11+
Claude Code CLI (claude) installed and authenticated — see Claude Code docs

Quick Start

From the command line

# Basic run — 10 iterations on a task
ralph-runner --prompt "Refactor the auth module to use JWT tokens"

# With verification
ralph-runner --prompt "Fix the failing tests in src/api/" \
  --verify "pytest tests/ -x"

# Shorter run with internet access
ralph-runner --prompt "Add rate limiting to the API" \
  --iterations 5 --max-iterations 15 --internet

From within Claude Code (plugin)

/ralph-runner:run Refactor the auth module to use JWT tokens
/ralph-runner:run --verify "pytest" Fix the failing tests
/ralph-runner:status
/ralph-runner:resume

Usage

ralph-runner --prompt PROMPT [OPTIONS]

Required

Flag	Description
`--prompt TEXT`	The task description for Claude to work on

Options

Flag	Default	Description
`--iterations N`	`10`	Minimum iterations before completion is accepted
`--max-iterations N`	`50`	Hard stop after this many iterations
`--verify CMD`	(none)	Shell command to run after each iteration (exit 0 = pass)
`--mode {afk,hitl}`	`afk`	`afk` = fully autonomous, `hitl` = pause between iterations
`--model NAME`	`sonnet`	Claude model to use
`--timeout SECS`	`900`	Hard timeout per iteration (15 min)
`--idle-timeout SECS`	`120`	Kill iteration if no output for this long
`--internet`	off	Enable web access for Claude sessions
`--plan / --no-plan`	`--plan`	Create a plan.md file in the first iteration
`--resume DIR`	(none)	Resume a previous run from its output directory
`--debug`	off	Save prompts and enable verbose stderr logging

Modes

AFK Mode (default)

Fully autonomous. Claude runs with bypassPermissions — no confirmation prompts. Ideal for overnight or background runs on trusted codebases.

ralph-runner --prompt "Migrate the database schema to v2" --mode afk

HITL Mode (Human-in-the-Loop)

Pauses after each iteration so you can review progress before continuing. Press Enter to continue or q to quit.

ralph-runner --prompt "Redesign the dashboard layout" --mode hitl

Verification

The --verify flag runs a command after each iteration. If it exits 0, the iteration passes; otherwise it fails. ralph-runner tracks pass/fail trends and blocks completion if verification fails.

# Run tests
ralph-runner --prompt "Fix all type errors" --verify "mypy src/"

# Run a test suite
ralph-runner --prompt "Implement the search feature" --verify "pytest tests/test_search.py"

# Chain multiple checks
ralph-runner --prompt "Clean up the codebase" \
  --verify "ruff check src/ && mypy src/ && pytest tests/"

The verification trend is displayed as a convergence sequence:

✓✓✗✓✓  (4/5 passed, converging)

Output

All run artifacts are saved to ~/.ralph-runner/runs/<timestamp>-<prefix>/:

File	Description
`progress.md`	Cumulative progress notes from all iterations
`plan.md`	Task plan created in the first iteration
`stats.json`	Machine-readable stats (costs, tokens, timing)
`summary.md`	LLM-generated summary of accomplishments
`iter-N.jsonl`	Raw Claude stream output for iteration N
`iter-N.txt`	Claude's text output for iteration N

Resuming

If a run is interrupted, resume it:

ralph-runner --resume ~/.ralph-runner/runs/20260220-143000-fix-auth/ \
  --prompt "Fix the auth module"

Live Display

ralph-runner shows a rich terminal display during execution:

Spinner during Claude connection and MCP initialization
Live tool calls — see what Claude is doing in real time (reading files, running commands, editing code)
Heartbeat every 30 seconds showing elapsed time
Iteration summary with status, duration, cost, and token counts
Verification results with pass/fail trends
Final summary with total cost, token breakdown by model, and an LLM-generated accomplishment summary

Claude Code Plugin

ralph-runner ships as a Claude Code plugin, so you can invoke it directly from within Claude Code sessions. The repository doubles as both a pip-installable Python package and a Claude Code plugin marketplace.

Install from the marketplace

# Add the marketplace (one-time)
/plugin marketplace add gzxultra/ralph-runner

# Install the plugin
/plugin install ralph-runner@ralph-runner

# Update to latest version
/plugin marketplace update

Install from local clone

# Option A: add as a plugin
claude plugin add ./ralph-runner

# Option B: load for a single session
claude --plugin-dir ./ralph-runner

Available skills and commands

Command	Description
`/ralph-runner:run <task>`	Launch an outer-loop orchestration session
`/ralph-runner:status`	Check progress, costs, and results of runs
`/ralph-runner:resume`	Resume an interrupted run

Example plugin usage

> /ralph-runner:run Fix all failing tests and get to 100% pass rate
> /ralph-runner:run --verify "pytest" Refactor the database layer
> /ralph-runner:status
> /ralph-runner:resume

Architecture

ralph-runner/
├── .claude-plugin/
│   ├── plugin.json          # Plugin manifest (name, version, metadata)
│   └── marketplace.json     # Marketplace catalog for distribution
├── skills/
│   ├── run/SKILL.md         # Skill: launch an orchestration session
│   └── status/SKILL.md      # Skill: inspect run progress and costs
├── commands/
│   ├── run.md               # Slash command: /ralph-runner:run
│   ├── status.md            # Slash command: /ralph-runner:status
│   └── resume.md            # Slash command: /ralph-runner:resume
├── settings.json            # Default permission grants
├── src/ralph_runner/
│   ├── cli.py               # CLI entry point and main orchestration loop
│   ├── runner.py            # Core iteration engine — spawns and monitors Claude
│   ├── prompt.py            # Prompt construction with progress injection
│   ├── verify.py            # Verification command runner and trend tracking
│   ├── stats.py             # Stats, progress files, and summary generation
│   ├── display.py           # Terminal colors, spinners, formatting
│   ├── tools.py             # Tool-call description formatting
│   └── models.py            # Data classes (IterationResult)
├── tests/
│   └── test_basics.py       # Unit tests
├── pyproject.toml            # Python package configuration
├── LICENSE                   # MIT
└── README.md

License

MIT — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Feb 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ralph_runner-0.1.0.tar.gz (4.7 MB view details)

Uploaded Feb 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ralph_runner-0.1.0-py3-none-any.whl (20.3 kB view details)

Uploaded Feb 21, 2026 Python 3

File details

Details for the file ralph_runner-0.1.0.tar.gz.

File metadata

Download URL: ralph_runner-0.1.0.tar.gz
Upload date: Feb 21, 2026
Size: 4.7 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.0rc1

File hashes

Hashes for ralph_runner-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`5d3226cd1ce42ddb15a9d3dda1928239cca674daa0167b36d29fc7594d862df5`
MD5	`7d94d37db011c01bf80c1e70137d0a1c`
BLAKE2b-256	`365d4d19eb2580c199deec212866dc55fc5bd9340180da9e15712f76e2f8d527`

See more details on using hashes here.

File details

Details for the file ralph_runner-0.1.0-py3-none-any.whl.

File metadata

Download URL: ralph_runner-0.1.0-py3-none-any.whl
Upload date: Feb 21, 2026
Size: 20.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.0rc1

File hashes

Hashes for ralph_runner-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f600516b743b6bb4d44d59a45f4b2a3e60c60e73a020ae92d92522c1ebe52a8f`
MD5	`13924dbfb3f0cae9445b21776cf1a4c1`
BLAKE2b-256	`666d9e3f1ae1d65a0059680e6f333fea43a5ee610cede24491e2087705a5e196`

See more details on using hashes here.

ralph-runner 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ralph-runner

How It Works

Installation

As a CLI tool

As a Claude Code plugin

Prerequisites

Quick Start

From the command line

From within Claude Code (plugin)

Usage

Required

Options

Modes

AFK Mode (default)

HITL Mode (Human-in-the-Loop)

Verification

Output

Resuming

Live Display

Claude Code Plugin

Install from the marketplace

Install from local clone

Available skills and commands

Example plugin usage

Architecture

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes