Multi-agent session handoff framework for Claude Code and Codex CLI

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

newblacc

These details have not been verified by PyPI

Project description

superharness

Multi-agent task coordination for Claude Code and Codex CLI

superharness lets AI coding assistants work on the same project without stepping on each other. It provides a shared contract, queue-based delegation, and handoff/ledger state so tasks survive across sessions.

AI agent installing this? Read docs/INSTALL-AGENT.md — it tells you exactly what to detect, what to ask the user (just two questions), and how to set everything up without human terminal interaction.

Using superharness

Via Claude Code or Codex CLI (recommended)

Step 1 — Install superharness once (terminal):

pipx install superharness

Alternative: install from source

curl -fsSL https://raw.githubusercontent.com/celstnblacc/superharness/main/scripts/install-remote.sh | bash
# export PATH="$HOME/.local/bin:$PATH"  # add to ~/.zshrc or ~/.bashrc if needed

Or clone manually:

git clone https://github.com/celstnblacc/superharness.git ~/.local/share/superharness
bash ~/.local/share/superharness/scripts/install-wrapper.sh

Step 2 — Go to your project and open Claude Code or Codex CLI.

Step 3 — Type these phrases directly to the agent:

shux init              # bootstrap .superharness/ for this project
shux doctor            # check prerequisites and protocol health
shux contract          # show all tasks with status and next-task suggestion
shux continue          # resume active contract automatically
shux delegate <id>     # create task + enqueue in one step (task must be plan_approved or later)
shux test-type <id>    # set mandatory test types for a task
shux verify <id>       # record verification result (pass/fail)
shux close <id>        # mark done (task must be report_ready or review_passed); use --force to bypass
shux task create       # create a task with --blocked-by, --tdd-red/green/refactor, --criteria flags
shux task status       # update task lifecycle status (todo → plan_proposed → plan_approved → in_progress → report_ready → done)
shux status            # dashboard: tasks, watcher, profile
shux recall <keywords> # search past handoffs and ledger
shux uninstall         # remove watcher and system artifacts for this project
shux hygiene           # validate protocol compliance (contract, handoffs, ledger)
shux hygiene --repair  # auto-fix missing handoffs, ledger entries, and stuck statuses
shux dashboard         # open browser dashboard
shux watch             # start continuous watcher in foreground
shux update            # pull latest superharness + refresh templates, hooks, and watcher
shux discuss           # start or manage a cross-agent discussion (topic, owners, optional ID)
shux install-hooks     # merge adapter hooks into ~/.claude/settings.json (portable, run once per machine)
shux init --skip-hooks # init without modifying ~/.claude/settings.json (for CI or conservative setups)
shux benchmark         # show dispatch cost/duration leaderboard (--top N, --agents)
shux diff <id>         # preview agent changes for a task before closing (--stat, --base)
shux daemon start      # start background watcher daemon (portable, no launchd/systemd needed)
shux daemon stop       # stop the daemon
shux daemon status     # show daemon running state and PID
shux pack export       # bundle .superharness/ into a portable .tar.gz for handoff
shux pack import       # restore a pack into a new project
shux help              # show all shux shortcuts in the terminal

That's it. Steps 1 and 2 are one-time. From then on, shux contract starts every session.

Intelligence layer (v1.7.0)

Dispatch is now smarter. These features activate automatically — no extra setup needed.

Feature	What it does
Pre-flight analysis	Validates task spec, TDD block, dependencies, and git state before dispatch. Blocks on unresolved deps, warns on missing criteria.
Complexity estimator	Scores acceptance criteria + TDD scope and suggests single/fanout/swarm mode.
Failure pattern matching	15 built-in classifiers (ImportError, timeout, git conflict, etc.) analyze errors and inject fix hints into the next dispatch.
Skill extraction	When a task completes, extracts category, techniques, and diff stats into `skills.yaml`. Future dispatches for similar tasks get technique hints.
Benchmark leaderboard	Tracks cost, duration, and outcome per dispatch in `benchmark.jsonl`. View with `shux benchmark`.
Parallel fan-out	Run N agents concurrently on isolated git worktrees. Use `fanout_dispatch()` from the SDK.
Swarm mode	N workers solve the same task, then an Opus reviewer picks the best solution. Optional auto-merge.

Via Terminal (alternative)

For scripting, CI, or users who prefer direct shell access.

Requires: bash, python3. See Prerequisites.

# Try first — no install needed
PYTHONPATH=src python3 -m superharness demo

# Install CLI
pipx install superharness && superharness --version

# Initialize project
cd /path/to/project
superharness init --interactive   # or: superharness init "Name" "Stack" "active"

# Verify
superharness doctor --project .

# Contract snapshot
superharness contract today --project .

# Delegate to agent
superharness delegate --to codex-cli --project .

# Queue management
superharness enqueue --project . --to codex-cli --task my-task --priority 1
superharness dispatch --project . --to codex-cli

# Protocol hygiene + browser dashboard
superharness hygiene --project .
superharness dashboard-ui --project .

Run tests:

uv sync --dev
pytest tests/ -q

Full terminal reference: docs/GUIDE.md

Quick Links

📘 User Guide — Commands, background watcher, troubleshooting 🏗️ Architecture — Why it exists, how it works, design decisions 🔒 Security — Threat model and operational safety notes

What You Get

shux shortcuts — Control superharness from inside Claude Code or Codex CLI
superharness init — Bootstrap protocol files (.superharness/); auto-installs Claude Code hooks and background watcher (macOS)
superharness task — Create and update tasks: --blocked-by <id> dependency tracking, --tdd-red/green/refactor TDD block, --criteria acceptance criteria; task status enforces the full lifecycle (todo → plan_proposed → plan_approved → in_progress → report_ready → done)
superharness delegate — Launch agent with contract context (requires task status ≥ plan_approved; auto model routing)
superharness verify — Record verification result before closing a task
superharness close — Close a verified task (requires report_ready or review_passed; use --force to bypass lifecycle gate)
superharness enqueue|dispatch|watch — Queue-based task routing
superharness hygiene — Protocol compliance checks
superharness watch --foreground — Cross-platform continuous watcher
superharness dashboard-ui — Browser dashboard: inbox, tasks, watcher state, enqueue with TDD instructions
superharness doctor — Prerequisite and setup health check
superharness uninstall — Clean removal of system artifacts
Background watcher — Unattended execution via macOS launchd or Linux systemd (opt-in)

Is this for me?

superharness is for you if any of these are true:

You use Claude Code or Codex CLI and find yourself re-explaining project context at the start of every session
You want to hand off a task to one agent while you work with another
You need an append-only audit trail of what each agent did and decided
You run agents unattended in the background (e.g. via launchd/systemd)

You probably don't need superharness if you only ever run a single agent interactively and don't switch between sessions.

What you need to use it

Feature	Requirements
Core protocol (contracts, handoffs, ledger)	`bash`, `python3`
Agent shortcuts (`shux`)	+ `claude` or `codex` CLI
Background auto-dispatch	+ launchd (macOS) or systemd (Linux)
Browser dashboard	+ `python3 -m http.server` (built-in)

You can start with just the core and add agent CLIs and background services later. --print-only mode lets you preview every dispatch without launching anything.

Platform Support

Cross-platform: macOS, Linux, Windows. All user-facing commands are Python and work everywhere python3 is available. CI runs on all three platforms.

Background watcher has automated service installers for macOS (launchd), Linux (systemd), and Windows (Task Scheduler via schtasks.exe). superharness watch --foreground works everywhere as an alternative.

Prerequisites

python3 3.11+ + pyyaml — uv sync --dev (or pip install pyyaml click ruamel.yaml)
bash — only needed for macOS/Linux watcher service install scripts; not required on Windows or for any core commands
claude CLI (for Claude delegation commands): npm install -g @anthropic-ai/claude-code
codex CLI (for Codex delegation commands): npm install -g @openai/codex
macOS launchd or Linux systemd for background watcher (see Platform Support); --foreground mode works everywhere

Project Runtime State

Per-project state lives in .superharness/:

.superharness/
├── contract.yaml          # tasks, decisions, failures
├── handoffs/              # session handoff state
├── ledger.md              # append-only event log
├── decisions.yaml         # cross-agent ADRs
├── failures.yaml          # failure memory
└── inbox.yaml             # dispatch queue

Architecture details: docs/ARCHITECTURE.md

Repository Layout

superharness/
├── superharness            # thin Bash shim → delegates to Python
├── src/superharness/       # Python CLI + engine + command modules
├── protocol/              # protocol spec + templates
├── adapters/              # Claude/Codex adapter assets
├── scripts/               # launchd installer + CI guard scripts
├── docs/                  # architecture and user guide
├── tests/                 # unit/integration/e2e tests
└── CHANGELOG.md

Security Note

The background watcher enables unattended execution (agents run without human supervision). This is powerful but requires explicit confirmation:

macOS (launchd):

bash scripts/install-launchd-inbox-watcher.sh \
  --project /path/to/project \
  --interval 30 \
  --confirm-non-interactive yes \
  --confirm-skip-permissions yes

Linux (systemd):

CONFIRM_NON_INTERACTIVE=yes bash scripts/install-systemd-inbox-watcher.sh \
  --project /path/to/project \
  --interval 30

Read the full threat model: SECURITY.md

Current Version

Current version: v1.1.1

See CHANGELOG.md for the full iteration log.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

newblacc

These details have not been verified by PyPI

Release history Release notifications | RSS feed

1.15.2

Apr 7, 2026

1.15.1

Apr 7, 2026

1.15.0

Apr 7, 2026

1.14.0

Apr 7, 2026

1.13.0

Apr 7, 2026

1.11.1

Apr 6, 2026

1.11.0

Apr 6, 2026

1.10.6

Apr 6, 2026

1.10.5

Apr 6, 2026

1.10.4

Apr 6, 2026

1.10.3

Apr 6, 2026

1.10.2

Apr 6, 2026

1.10.1

Apr 6, 2026

1.10.0

Apr 6, 2026

This version

1.8.0

Apr 6, 2026

1.5.0

Apr 3, 2026

1.3.4

Mar 30, 2026

1.2.8

Mar 28, 2026

1.2.7

Mar 26, 2026

1.2.4

Mar 26, 2026

1.2.3

Mar 26, 2026

1.2.2

Mar 26, 2026

1.2.1

Mar 26, 2026

1.1.1

Mar 21, 2026

1.0.0

Mar 19, 2026

0.9.23

Mar 18, 2026

0.9.22

Mar 18, 2026

0.9.21

Mar 16, 2026

0.9.16

Mar 16, 2026

0.9.10

Mar 15, 2026

0.9.9

Mar 15, 2026

0.9.8

Mar 15, 2026

0.9.7

Mar 15, 2026

0.9.6

Mar 15, 2026

0.9.5

Mar 15, 2026

0.9.4

Mar 15, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

superharness-1.8.0.tar.gz (222.5 kB view details)

Uploaded Apr 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

superharness-1.8.0-py3-none-any.whl (280.1 kB view details)

Uploaded Apr 6, 2026 Python 3

File details

Details for the file superharness-1.8.0.tar.gz.

File metadata

Download URL: superharness-1.8.0.tar.gz
Upload date: Apr 6, 2026
Size: 222.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for superharness-1.8.0.tar.gz
Algorithm	Hash digest
SHA256	`b1c20782c284908a5f7c841ea128d6ad1c41a8943a0c8780b759eaf4e9a66e7c`
MD5	`bc4813e111514d0415e14ed5ac62f668`
BLAKE2b-256	`06ea47afa25dd5844b1ff537550fd9fe3c5393757bc1da045f826092421c6f00`

See more details on using hashes here.

Provenance

The following attestation bundles were made for superharness-1.8.0.tar.gz:

Publisher: publish.yml on celstnblacc/superharness

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: superharness-1.8.0.tar.gz
- Subject digest: b1c20782c284908a5f7c841ea128d6ad1c41a8943a0c8780b759eaf4e9a66e7c
- Sigstore transparency entry: 1242692193
- Sigstore integration time: Apr 6, 2026
Source repository:
- Permalink: celstnblacc/superharness@a18fe2602d3a992d6f88b2db3ed39a0a879ee40a
- Branch / Tag: refs/tags/v1.8.0
- Owner: https://github.com/celstnblacc
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@a18fe2602d3a992d6f88b2db3ed39a0a879ee40a
- Trigger Event: release

File details

Details for the file superharness-1.8.0-py3-none-any.whl.

File metadata

Download URL: superharness-1.8.0-py3-none-any.whl
Upload date: Apr 6, 2026
Size: 280.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for superharness-1.8.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c1574cb9995867481d5dac3d715c94e04f8bdd1ff740f09545c64333e59b7428`
MD5	`a283e38f77cd4feca7988bca207cab14`
BLAKE2b-256	`5ca4da832a0aa047580adc5195413c7ba288f82e6ac4c250b776a7afcda55d36`

See more details on using hashes here.

Provenance

The following attestation bundles were made for superharness-1.8.0-py3-none-any.whl:

Publisher: publish.yml on celstnblacc/superharness

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: superharness-1.8.0-py3-none-any.whl
- Subject digest: c1574cb9995867481d5dac3d715c94e04f8bdd1ff740f09545c64333e59b7428
- Sigstore transparency entry: 1242692198
- Sigstore integration time: Apr 6, 2026
Source repository:
- Permalink: celstnblacc/superharness@a18fe2602d3a992d6f88b2db3ed39a0a879ee40a
- Branch / Tag: refs/tags/v1.8.0
- Owner: https://github.com/celstnblacc
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@a18fe2602d3a992d6f88b2db3ed39a0a879ee40a
- Trigger Event: release

superharness 1.8.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

superharness

Using superharness

Via Claude Code or Codex CLI (recommended)

Intelligence layer (v1.7.0)

Via Terminal (alternative)

Quick Links

What You Get

Is this for me?

What you need to use it

Platform Support

Prerequisites

Project Runtime State

Repository Layout

Security Note

Current Version

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance