Millrace runtime rewrite baseline package

These details have not been verified by PyPI

Project description

Millrace

Millrace signal mark

Other agents win sprints. Millrace wins marathons.

Raw agent harnesses - Codex, Claude Code, Gemini - are built for sprints. Give them a tight spec in a greenfield repo, let them rip, ship the result. They are very good at this.

Millrace starts where those runs end. When the work spans sessions, accumulates state, needs to survive a crash without losing context, or has to gate on real acceptance criteria rather than "the agent said it was done", that is the problem Millrace was built for.

Millrace is a governed runtime for long-running agent work. The harness still does the local stage work. Millrace owns the queue, the compiled plan, runtime state, recovery paths, and closure behavior around that work.

If you are an agent and your harness supports repo-local skills, for example Codex or Claude Code, install docs/skills/millrace-autonomous-delegation/SKILL.md when you have authority to choose whether substantial work should enter Millrace. Once Millrace is selected or explicitly requested, use docs/skills/millrace-ops-agent-manual/SKILL.md as the operator runbook.

Agent Install Examples

Codex: if your environment supports repo-local skills, load docs/skills/millrace-autonomous-delegation/ for autonomous delegation decisions and docs/skills/millrace-ops-agent-manual/ for Millrace operation. The entry files are each package's SKILL.md.
Claude Code: use the same package path or entry file as the local project skill. Use the autonomous delegation skill only when the user or workspace has granted that authority; otherwise follow the ops manual handshake.
Other harnesses: if the harness accepts a SKILL.md, point it at docs/skills/millrace-autonomous-delegation/SKILL.md for the decision layer and docs/skills/millrace-ops-agent-manual/SKILL.md for operation; if it supports directory packages, prefer the containing directories.

Runtime Lifecycle

Need the full implementation-accurate flow instead of the landing-page view? See the standalone lifecycle chart.

flowchart TD
    A["Initialize workspace, then compile the plan"] --> B{"Deterministic tick loop"}
    B --> C["Process control inputs:<br/>mailbox commands, watcher intake, reconciliation"]
    C --> D{"Scheduler claim decision"}
    D -- planning probe, incident, or spec --> E["Planning loop:<br/>classify probes,<br/>interpret specs and incidents,<br/>govern remediation, emit executable work"]
    D -- execution task --> F["Execution loop:<br/>build, verify, repair, recover, update"]
    D -- learning request --> K["Learning loop:<br/>analyze runtime evidence,<br/>prepare optional skills,<br/>curate accepted updates"]
    D -- nothing claimable --> G{"Completion behavior eligible?"}
    G -- yes --> H["Arbiter closure pass"]
    G -- no --> I["Idle until the next tick"]
    E --> J["Runtime applies results,<br/>persists state, and routes the next action"]
    F --> J
    K --> J
    H --> J
    J --> B
    I --> B

Millrace does not try to replace raw harness reasoning with a thicker prompt. It wraps long-horizon work in a runtime with a few hard contracts:

Explicit workspace lifecycle: operators initialize workspaces with millrace init, update the Python package with their package manager, and use millrace upgrade only to preview or apply managed workspace asset refreshes.
Compiler-owned runtime structure: startup and config reload compile a fingerprinted plan; if inputs drift and the persisted plan is stale, the daemon refuses to keep running on a last-known-good plan.
Runtime-owned execution: stage results are routed by the runtime, mutation stays single-writer and serialized, and daemon scheduling follows the compiled plane scheduler. Default modes are serial; learning-enabled modes may run one Learning stage alongside one foreground Planning or Execution stage.
Bounded autonomous recovery: when queued same-lineage tasks are stranded behind a blocked predecessor caused by a classified transient runner, network, provider, or timeout failure, the daemon can requeue that dependency through an audited retry path. Semantic blocked states and durable local setup failures still require operator review.
Audited operator intervention: when work is bad intake rather than a retryable failure, operators can cancel, supersede, retarget, resolve, archive, or retry supported blocked work items through CLI/control commands. Live daemons receive those changes through the mailbox and apply them only at safe no-active-run mutation boundaries.
Opt-in quality loops: integrated Codex modes use a more expensive builder -> integrator -> checker execution path. Integrator reviews the Builder diff, runs explicit or discoverable gates, and writes integration_report.md before normal Checker QA.
Probe-first intake: lightweight probe requests enter Planning through Recon, which researches enough repo evidence to route the request into a generated execution task, a generated planning spec, a no-op, or a blocked packet without letting stages mutate queues directly.
Closure-safe remediation: runtime-generated planning handoff incidents preserve source work-item lineage, so same-root remediation remains claimable while unrelated root specs stay backpressured. Arbiter activates only when no lineage work remains and closure behavior is ready.
Inspectable governance and evidence: usage governance can pause and auto-resume between stages when configured quota rules are reached. Compiled execution capability grants describe which stage powers are granted, denied, approval-gated, unsupported, enforced, or advisory, while typed terminal results, status/monitor output, and persisted run artifacts keep post-run inspection grounded in runtime evidence.

The shipped core includes separate Planning and Execution loops. Learning-enabled modes add Analyst, Professor, Curator, and Librarian stages for evidence-backed skill improvement and post-Planner optional-skill preparation. Generic success-triggered learning is Analyst-first, Planner completion can trigger Librarian to install relevant remote optional skills into the workspace, and reviewed no-change learning can close as no-op instead of being treated as blocked. Blueprint Planning is available through opt-in blueprint_codex and blueprint_learning_codex modes. Both keep implementation inside the standard Execution loop, but route Planner output through Manager Blueprint, Contractor Blueprint, and Evaluator Blueprint before approved generated tasks enter Execution. The learning-enabled variant keeps the normal post-Planner Librarian trigger.

For operational details, see docs/runtime/README.md, docs/runtime/millrace-cli-reference.md, and docs/runtime/millrace-workspace-baselines-and-upgrades.md.

Early Proof

Millrace's strongest early proof point is self-referential: Python millrace-ai built the first released Rust parity implementation of Millrace.

The campaign used Python millrace-ai v0.16.1 in learning_codex mode to drive the Rust millrace-ai v0.1.0 implementation from seeded parity ideas through planning, execution, QA, Arbiter closure, remediation, and release-ready workspace state. After the operator started the daemon, there were no pause/resume cycles, continuation prompts, or external code interventions. The run proceeded to completion with zero outside assistance. The only external post-run action was publication: Millrace's ops agent published the completed result to GitHub and as a Rust crate without touching the code Millrace had produced.

Headline evidence from the autonomous build campaign:

Metric	Value
Seeded parity slices	`8`
Completed specs	`11`
Completed tasks	`57`
Recorded runs	`99`
Recorded stage results	`261`
Resolved incidents/remediations	`5`
Wall-clock campaign span	`28h 9m 49.5s`
Input plus output tokens	`730,406,757`
Cached-input share	`95.47%`
Release tag	`v0.1.0`
Release commit	`4c82685`

The release moved the Rust crate from an initial claimed package to a parity runtime across 193 changed files and 87,992 insertions. The finished crate also passed a post-publish real daemon smoke: an installed millrace-ai v0.1.0 crate completed a real Codex-backed builder -> checker -> updater run in 6m 32.9s and produced the expected filesystem output.

The caveat is important and narrow: this proves that Python Millrace could autonomously build the Rust parity runtime. It does not claim that the Rust crate independently self-hosted the whole port campaign.

Read the full public evidence pack here:

millrace-rs-port-docs

How Millrace Fits With Raw Harnesses

Millrace is not a replacement for Codex, Claude Code, Aider, or similar raw agent harnesses. It is the runtime layer you put around them when the work is too long-running, stateful, or recovery-sensitive to trust to a single session.

Think of the split this way:

the raw harness reasons locally, edits code, and emits a stage result
Millrace decides which stage runs next and what contract that stage receives
Millrace persists queue state, runtime snapshots, artifacts, and recovery context after each handoff
the operator or ops agent decides when work enters the runtime and how the workspace is configured

If a direct Codex or Claude Code session is enough, use the direct session. Millrace matters when the work has crossed out of sprint territory.

When To Use Millrace

Use Millrace when:

the work will outlast a single agent session
you want explicit stage gates instead of "done enough" chat conclusions
recovery and resumability matter
you need durable state, queue artifacts, and run history under <workspace>/millrace-agents/
completion has to clear a real closure pass rather than informal optimism
an operator or ops agent is intentionally managing intake and runtime control

Do not use Millrace when:

the task is small, bounded, and cleanly handled in one direct session
the work is exploratory and governance would add more overhead than value
single-session throughput matters more than persistence and recovery
nobody is available to manage runtime configuration, intake, and workspace hygiene

60-Second Proof

Install:

pip install millrace-ai

Then point Millrace at a workspace:

export WORKSPACE=/absolute/path/to/your/workspace

millrace init --workspace "$WORKSPACE"
millrace compile validate --workspace "$WORKSPACE"
millrace compile graph --workspace "$WORKSPACE"
millrace run daemon --max-ticks 1 --workspace "$WORKSPACE"
millrace status --workspace "$WORKSPACE"

That flow proves seven things quickly:

workspace bootstrap is explicit and creates the managed baseline under millrace-agents/
the selected mode compiles into one persisted compiled_plan.json before execution
compile output fingerprints the selected mode, runtime config, and packaged assets so compile show / status can report whether the plan is current or stale
that compiled plan carries node bindings, execution capability grants, intake entries, recovery policies, closure-target activation, and post-stage routing
compile graph exposes that legal topology as a stable compiled-stage-graph export, while runs trace <run_id> shows the concrete path one run actually followed
the shipped default_codex mode freezes closure behavior directly into that single compiled artifact
status and run inspection carry compiled-plan identity so operators can tie runtime activity back to the compiled plan that produced it
the runtime can execute a deterministic tick and report persisted status

For a visible long-running session, use millrace run daemon --monitor basic. The default daemon remains quiet unless that monitor is requested explicitly. The basic monitor is a human-facing stream: it compacts stage labels, shortens long run ids for display, omits unknown token filler, and leaves full ids and artifacts to millrace runs ... inspection commands. The basic monitor prints the first idle reason=no_work line immediately, then throttles repeated no_work idles to a 6-hour heartbeat until runtime activity or a different idle reason appears. Use --monitor-log <path> when you want the same clean monitor stream written to a file without necessarily printing live monitor lines to stdout.

For an optional local dashboard, install the separate millrace-web package from PyPI and run millrace-web serve --workspace "$WORKSPACE". The web dashboard is a read-only observer with Detail and Flow views; it is not included in the millrace-ai wheel and does not acquire runtime ownership locks.

When the packaged workspace baseline changes, use millrace upgrade first to preview the managed-file classifications, then millrace upgrade --apply to apply safe baseline updates. This does not update the installed Python package; for runtime-code fixes, update millrace-ai through the environment's package manager first and verify with millrace --version or millrace version. If compile inputs drift and the persisted plan is stale, runtime startup and config reload refuse to keep running on the stale plan.

Stage config supports all execution, planning, and learning stage names. stages.<stage>.thinking_level sets a runner-neutral per-stage thinking level that the compiler freezes into node bindings, stage requests, runner artifacts, persisted stage results, and run inspection. Codex translates it to model_reasoning_effort="<value>"; Pi translates it to --thinking <value>. The older stages.<stage>.model_reasoning_effort field remains accepted as a Codex compatibility alias.

Execution capability policy is configured under [execution_capabilities]. Grant-affecting changes are recompile changes. Approval-gated grants are handled through millrace approvals ls/show/approve/deny, and millrace runs show prints compact per-stage grant/support summaries for completed or blocked runs.

Canonical shipped modes today:

default_codex
default_pi
default_codex_integrated
blueprint_codex

Learning-enabled shipped modes:

learning_codex
learning_pi
learning_codex_integrated
blueprint_learning_codex

The standard learning modes use the same execution and planning topology as the default modes, while blueprint_learning_codex uses the Blueprint Planning topology. All learning-enabled modes add learning.standard and freeze learning trigger rules into the compiled plan.

The integrated Codex modes are opt-in quality loops. They keep the same Planning and optional Learning behavior as their non-integrated counterparts, but select execution.with_integrator so every successful Builder result runs through Integrator before Checker. Existing workspaces receive those managed assets with millrace upgrade --apply after updating the installed package.

The Blueprint Codex modes are opt-in Planning loops. They select planning.blueprint, use standard Execution, and validate implementation plans before promoting generated tasks. blueprint_codex omits Learning; blueprint_learning_codex adds learning.standard.

Compatibility alias:

standard_plain -> default_codex

Read By Journey

Need the single dense system explainer first? Start with docs/millrace-technical-overview.md.

Start Here

docs/runtime/README.md
docs/skills/millrace-autonomous-delegation/SKILL.md if you are authorized to decide whether substantial work should use Millrace
docs/skills/millrace-ops-agent-manual/SKILL.md if you are operating Millrace as an agent

Run It

docs/runtime/millrace-cli-reference.md
docs/runtime/millrace-runtime-architecture.md
docs/runtime/millrace-usage-governance.md

Understand It

docs/runtime/millrace-compiler-and-frozen-plans.md
docs/runtime/millrace-modes-and-loops.md
docs/runtime/millrace-blueprint-planning.md
docs/runtime/millrace-arbiter-and-completion-behavior.md
docs/runtime/millrace-runner-architecture.md

Extend It

docs/runtime/millrace-entrypoint-mapping.md
docs/runtime/millrace-loop-authoring.md
docs/skills/millrace-loop-authoring/SKILL.md
docs/source-package-map.md

Status

Millrace ships as a maintained pre-1.0 runtime line. If you depend on exact behavior, pin to a patch version and verify against the current CLI and docs rather than assuming every newer build is identical.

License

See LICENSE.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.20.3

May 28, 2026

0.20.2

May 28, 2026

This version

0.20.1

May 23, 2026

0.20.0

May 21, 2026

0.19.0

May 18, 2026

0.18.6

May 13, 2026

0.18.5

May 13, 2026

0.18.4

May 13, 2026

0.18.3

May 12, 2026

0.18.2

May 10, 2026

0.18.1

May 9, 2026

0.18.0

May 5, 2026

0.17.4

May 3, 2026

0.17.3

May 3, 2026

0.17.2

May 3, 2026

0.17.1

May 3, 2026

0.17.0

May 3, 2026

0.16.3

Apr 29, 2026

0.16.2

Apr 28, 2026

0.16.1

Apr 28, 2026

0.16.0

Apr 28, 2026

0.15.9

Apr 28, 2026

0.15.8

Apr 28, 2026

0.15.7

Apr 27, 2026

0.15.6

Apr 27, 2026

0.15.5

Apr 27, 2026

0.15.4

Apr 27, 2026

0.15.3

Apr 26, 2026

0.15.2

Apr 26, 2026

0.15.1

Apr 26, 2026

0.15.0

Apr 25, 2026

0.14.1

Apr 25, 2026

0.14.0

Apr 24, 2026

0.13.0

Apr 21, 2026

0.12.5

Apr 21, 2026

0.12.4

Apr 21, 2026

0.12.3

Apr 20, 2026

0.12.2

Apr 20, 2026

0.12.1

Apr 19, 2026

0.12.0

Apr 19, 2026

0.11.5

Apr 17, 2026

0.11.4

Apr 17, 2026

0.11.2

Apr 17, 2026

0.11.1

Apr 17, 2026

0.11.0

Apr 17, 2026

0.10.1

Apr 14, 2026

0.10.0

Apr 12, 2026

0.9.3

Apr 11, 2026

0.9.2

Apr 11, 2026

0.9.1

Apr 11, 2026

0.9.0

Apr 10, 2026

0.8.1

Apr 10, 2026

0.8.0

Apr 10, 2026

0.7.1

Apr 10, 2026

0.7.0

Apr 9, 2026

0.6.3

Apr 8, 2026

0.6.2

Apr 8, 2026

0.6.1

Apr 8, 2026

0.6.0

Apr 7, 2026

0.5.1

Apr 7, 2026

0.5.0

Apr 7, 2026

0.4.3

Apr 6, 2026

0.4.2

Apr 6, 2026

0.4.1

Apr 5, 2026

0.4.0

Apr 5, 2026

0.3.1

Apr 5, 2026

0.3.0

Apr 5, 2026

0.2.1

Apr 1, 2026

0.2.0

Mar 31, 2026

0.1.2

Mar 30, 2026

0.1.1

Mar 30, 2026

0.1.0

Mar 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

millrace_ai-0.20.1.tar.gz (423.9 kB view details)

Uploaded May 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

millrace_ai-0.20.1-py3-none-any.whl (599.5 kB view details)

Uploaded May 23, 2026 Python 3

File details

Details for the file millrace_ai-0.20.1.tar.gz.

File metadata

Download URL: millrace_ai-0.20.1.tar.gz
Upload date: May 23, 2026
Size: 423.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for millrace_ai-0.20.1.tar.gz
Algorithm	Hash digest
SHA256	`7ce50b032800c901bbfae0c0f74acfd37d4531187a787d02dddb2a79a0b6d7bc`
MD5	`10e43cb5b9e19d78370b7468ca07dfb3`
BLAKE2b-256	`b292fd3b625e68a94f30623d73291c3daaf4a07eb955dbf6cd689bf40d747749`

See more details on using hashes here.

Provenance

The following attestation bundles were made for millrace_ai-0.20.1.tar.gz:

Publisher: publish-to-pypi.yml on tim-osterhus/millrace

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: millrace_ai-0.20.1.tar.gz
- Subject digest: 7ce50b032800c901bbfae0c0f74acfd37d4531187a787d02dddb2a79a0b6d7bc
- Sigstore transparency entry: 1610140981
- Sigstore integration time: May 23, 2026
Source repository:
- Permalink: tim-osterhus/millrace@83178d37497d7c299dbcaa50264ee0e51b150a18
- Branch / Tag: refs/tags/v0.20.1
- Owner: https://github.com/tim-osterhus
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yml@83178d37497d7c299dbcaa50264ee0e51b150a18
- Trigger Event: push

File details

Details for the file millrace_ai-0.20.1-py3-none-any.whl.

File metadata

Download URL: millrace_ai-0.20.1-py3-none-any.whl
Upload date: May 23, 2026
Size: 599.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for millrace_ai-0.20.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fcaea94e5fdb8404c413f1af61d89a9294bbbed45baff384a22e7303d3863e4d`
MD5	`caaf77a73f51bb4005f0e30d2d87b6d8`
BLAKE2b-256	`ff430bf5edb427d29b545bd794db2d8e0811214de6e64fb03a559653b26b9ee5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for millrace_ai-0.20.1-py3-none-any.whl:

Publisher: publish-to-pypi.yml on tim-osterhus/millrace

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: millrace_ai-0.20.1-py3-none-any.whl
- Subject digest: fcaea94e5fdb8404c413f1af61d89a9294bbbed45baff384a22e7303d3863e4d
- Sigstore transparency entry: 1610141041
- Sigstore integration time: May 23, 2026
Source repository:
- Permalink: tim-osterhus/millrace@83178d37497d7c299dbcaa50264ee0e51b150a18
- Branch / Tag: refs/tags/v0.20.1
- Owner: https://github.com/tim-osterhus
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yml@83178d37497d7c299dbcaa50264ee0e51b150a18
- Trigger Event: push

millrace-ai 0.20.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Millrace

Other agents win sprints. Millrace wins marathons.

Agent Install Examples

Runtime Lifecycle

Early Proof

How Millrace Fits With Raw Harnesses

When To Use Millrace

60-Second Proof

Read By Journey

Start Here

Run It

Understand It

Extend It

Status

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance