autocontext control plane for iterative strategy evolution.
Project description
autocontext
autocontext is the Python control-plane package for running scenarios, carrying forward validated knowledge, exporting artifacts, and distilling stable behavior into cheaper runtimes over time.
The intended use is to hand the harness a real task in plain language, let it solve or simulate the problem mostly hands-off, and then inspect the resulting traces, reports, playbooks, datasets, and optional distilled model.
Install
pip install autocontext
The current PyPI release line is autocontext==0.4.0.
The PyPI package name is now autocontext. The CLI entrypoint remains autoctx.
Working Directory
Run the commands in this README from the autocontext/ directory. The Python package, CLI entrypoint, tests, and migrations all live here.
What It Does
- Runs iterative generation loops against game scenarios and agent-task scenarios
- Adds a first-class
simulatesurface for modeled-world exploration, replay, compare, and export - Persists playbooks, hints, tools, reports, and snapshots across runs
- Supports staged validation, harness synthesis, and harness-aware routing
- Exports training data and runs autoresearch-style local training loops
- Exposes evaluation, validation, artifact, and discovery operations over MCP and HTTP
Surface Summary
The Python package is the full control-plane surface in this repo. It currently includes:
- generation-loop execution via
autoctx run - plain-language simulation via
autoctx simulate - local training workflows via
autoctx export-training-dataandautoctx train - scenario creation and materialization via
autoctx new-scenario - HTTP API and MCP server surfaces via
autoctx serveandautoctx mcp-serve
Some newer operator-facing surfaces are currently TypeScript-first:
autoctx investigateautoctx analyze- the interactive terminal UI via
npx autoctx tui
campaign currently lives in that same bucket: it has partial TypeScript CLI/API/MCP support, but the Python package does not expose a campaign control-plane workflow yet.
Quick Start
From the repo root:
cd autocontext
uv venv
source .venv/bin/activate
uv sync --group dev
Use the repo-level .env.example as the reference for available AUTOCONTEXT_* settings.
operator-in-the-loop is a runnable scenario family for escalation and clarification experiments. Use it when you want executable operator-loop simulations, judgment evaluation, and live-agent escalation workflow testing.
Run a deterministic local scenario:
AUTOCONTEXT_AGENT_PROVIDER=deterministic \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3
Run with Anthropic:
AUTOCONTEXT_AGENT_PROVIDER=anthropic \
AUTOCONTEXT_ANTHROPIC_API_KEY=... \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3
Run with Pi CLI (local Pi agent runtime):
AUTOCONTEXT_AGENT_PROVIDER=pi \
AUTOCONTEXT_PI_COMMAND=pi \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3
autoctx simulate now follows the effective architect-role runtime surface, so AUTOCONTEXT_ARCHITECT_PROVIDER and other role-routing overrides apply to live simulation generation as well.
Run with Pi RPC (remote Pi agent via HTTP):
AUTOCONTEXT_AGENT_PROVIDER=pi-rpc \
AUTOCONTEXT_PI_RPC_ENDPOINT=http://localhost:3284 \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3
Run with Hermes (via OpenAI-compatible gateway):
AUTOCONTEXT_AGENT_PROVIDER=openai-compatible \
AUTOCONTEXT_AGENT_BASE_URL=http://localhost:8080/v1 \
AUTOCONTEXT_AGENT_API_KEY=no-key \
AUTOCONTEXT_AGENT_DEFAULT_MODEL=hermes-3-llama-3.1-8b \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3
Start the API server:
uv run autoctx serve --host 127.0.0.1 --port 8000
Inspect http://127.0.0.1:8000/ for the API index after the server starts. For an interactive terminal UI, use the TypeScript package: npx autoctx tui.
Start the MCP server:
uv sync --group dev --extra mcp
uv run autoctx mcp-serve
Main CLI Commands
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3
uv run autoctx simulate --description "simulate deploying a web service with rollback"
uv run autoctx simulate --replay deploy_sim --variables threshold=0.9
uv run autoctx list
uv run autoctx status <run_id>
uv run autoctx replay <run_id> --generation 1
uv run autoctx run --scenario support_triage --gens 3
uv run autoctx benchmark --scenario support_triage --runs 5
uv run autoctx new-scenario --template prompt-optimization --name support_triage
uv run autoctx export-training-data --scenario support_triage --all-runs --output training/support_triage.jsonl
uv run autoctx train --scenario support_triage --data training/support_triage.jsonl --time-budget 300
uv run autoctx serve --host 127.0.0.1 --port 8000
uv run autoctx mcp-serve
uv run autoctx wait <condition_id> --json
Useful variants:
AUTOCONTEXT_AGENT_PROVIDER=anthropic AUTOCONTEXT_ANTHROPIC_API_KEY=... \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3
AUTOCONTEXT_AGENT_PROVIDER=deterministic AUTOCONTEXT_RLM_ENABLED=true \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3
Training Workflow
Export JSONL training data from completed runs:
uv run autoctx export-training-data \
--scenario support_triage \
--all-runs \
--output training/support_triage.jsonl
Launch the autoresearch-style training loop:
uv sync --group dev --extra mlx
uv run autoctx train \
--scenario support_triage \
--data training/support_triage.jsonl \
--time-budget 300
MLX training is host-only. It must run on an Apple Silicon macOS machine with Metal access. It will not run correctly inside a Docker sandbox on macOS.
If you only want to inspect generated training data first, export without training and open the JSONL directly.
For host setup details and OpenClaw automation via a file-based watcher bridge, see docs/mlx-training.md.
Configuration
Configuration is loaded from AUTOCONTEXT_* environment variables in src/autocontext/config/settings.py.
Common settings:
AUTOCONTEXT_AGENT_PROVIDERAUTOCONTEXT_EXECUTOR_MODEAUTOCONTEXT_MODEL_COMPETITORAUTOCONTEXT_MATCHES_PER_GENERATIONAUTOCONTEXT_MAX_RETRIESAUTOCONTEXT_JUDGE_PROVIDERAUTOCONTEXT_PI_TIMEOUT(defaults to 300 seconds for Pi-backed live runs)AUTOCONTEXT_RLM_ENABLEDAUTOCONTEXT_HARNESS_PREFLIGHT_ENABLEDAUTOCONTEXT_STAGED_VALIDATION_ENABLED
See the repo-level .env.example for a working starting point.
Repository Structure
autocontext/
src/autocontext/ Python package
tests/ Pytest suite
docs/ Package-specific documentation
migrations/ SQLite migrations
ts/ TypeScript package
infra/ Docker, Fly.io, bootstrap scripts
Validation and Development
uv run ruff check src tests
uv run mypy src
uv run pytest
If you change protocol messages, regenerate the derived protocol artifacts from the repo root:
cd ..
uv run --directory autocontext python scripts/generate_protocol.py
OpenClaw / ClawHub
autocontext exposes:
- artifact contracts for harnesses, policies, and distilled models
- REST and MCP operations for evaluate, validate, publish, import, and discover
- ClawHub skill manifests and scenario discovery metadata
- an adapter layer for running OpenClaw agents inside the harness
Additional Docs
- Canonical concept model
- Agent integration guide — CLI-first integration for external agents, MCP fallback, JSON output reference
- Sandbox modes
- MLX host training
- TypeScript package guide —
investigate,analyze, and interactive TUI surfaces - Demo data notes
- Copy-paste examples
- Change history
- Repository overview
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file autocontext-0.4.0.tar.gz.
File metadata
- Download URL: autocontext-0.4.0.tar.gz
- Upload date:
- Size: 1.3 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d1602626d00d4be9a301fa164927be0448bc2e9679663e80bf31de9c2746c319
|
|
| MD5 |
edaa5aa9120fe4d2c629687d3f763460
|
|
| BLAKE2b-256 |
ac30145b7ff8a06774e587c83d25efba3def0a66ae13d1389363b856cd59ed20
|
Provenance
The following attestation bundles were made for autocontext-0.4.0.tar.gz:
Publisher:
publish-python.yml on greyhaven-ai/autocontext
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
autocontext-0.4.0.tar.gz -
Subject digest:
d1602626d00d4be9a301fa164927be0448bc2e9679663e80bf31de9c2746c319 - Sigstore transparency entry: 1292900325
- Sigstore integration time:
-
Permalink:
greyhaven-ai/autocontext@b95952b139ace23eb12bda0c093c44b6313e535f -
Branch / Tag:
refs/tags/py-v0.4.0 - Owner: https://github.com/greyhaven-ai
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-python.yml@b95952b139ace23eb12bda0c093c44b6313e535f -
Trigger Event:
push
-
Statement type:
File details
Details for the file autocontext-0.4.0-py3-none-any.whl.
File metadata
- Download URL: autocontext-0.4.0-py3-none-any.whl
- Upload date:
- Size: 789.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
aa041d45227d7c6200835e2f6a35556b918bf6b72680dffabd660c6ac2489db9
|
|
| MD5 |
67f89c2d14cf97ac14829082cb7ae3c9
|
|
| BLAKE2b-256 |
e4766643cd6c59c75615e2129d3c78e39cd21b49c0ec6ef6fc253d759634859a
|
Provenance
The following attestation bundles were made for autocontext-0.4.0-py3-none-any.whl:
Publisher:
publish-python.yml on greyhaven-ai/autocontext
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
autocontext-0.4.0-py3-none-any.whl -
Subject digest:
aa041d45227d7c6200835e2f6a35556b918bf6b72680dffabd660c6ac2489db9 - Sigstore transparency entry: 1292900336
- Sigstore integration time:
-
Permalink:
greyhaven-ai/autocontext@b95952b139ace23eb12bda0c093c44b6313e535f -
Branch / Tag:
refs/tags/py-v0.4.0 - Owner: https://github.com/greyhaven-ai
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-python.yml@b95952b139ace23eb12bda0c093c44b6313e535f -
Trigger Event:
push
-
Statement type: