autocontext control plane for iterative strategy evolution.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

autocontext

autocontext is the Python control-plane package for running scenarios, carrying forward validated knowledge, exporting artifacts, and distilling stable behavior into cheaper runtimes over time.

The intended use is to hand the harness a real task in plain language, let it solve or simulate the problem mostly hands-off, and then inspect the resulting traces, reports, playbooks, datasets, and optional distilled model.

Install

pip install autocontext

The current PyPI release line is autocontext==0.4.4. The PyPI package name is now autocontext. The CLI entrypoint remains autoctx.

Working Directory

Run the commands in this README from the autocontext/ directory. The Python package, CLI entrypoint, tests, and migrations all live here.

What It Does

Runs iterative generation loops against game scenarios and agent-task scenarios
Adds a first-class simulate surface for modeled-world exploration, replay, compare, and export
Persists playbooks, hints, tools, reports, and snapshots across runs
Supports staged validation, harness synthesis, and harness-aware routing
Exports training data and runs autoresearch-style local training loops
Exposes evaluation, validation, artifact, and discovery operations over MCP and HTTP

Surface Summary

The Python package is the full control-plane surface in this repo. It currently includes:

generation-loop execution via autoctx run
plain-language simulation via autoctx simulate
plain-language investigation via autoctx investigate
local training workflows via autoctx export-training-data and autoctx train
scenario creation and materialization via autoctx new-scenario
HTTP API and MCP server surfaces via autoctx serve and autoctx mcp-serve

Some newer operator-facing surfaces are currently TypeScript-first:

autoctx analyze
the interactive terminal UI via npx autoctx tui

campaign currently lives in that same bucket: it has partial TypeScript CLI/API/MCP support, but the Python package does not expose a campaign control-plane workflow yet.

Quick Start

From the repo root:

cd autocontext
uv venv
source .venv/bin/activate
uv sync --group dev

Use the repo-level .env.example as the reference for available AUTOCONTEXT_* settings and supported provider-native credential aliases such as ANTHROPIC_API_KEY.

operator-in-the-loop is a runnable scenario family for escalation and clarification experiments. Use it when you want executable operator-loop simulations, judgment evaluation, and live-agent escalation workflow testing.

Run a deterministic local scenario:

AUTOCONTEXT_AGENT_PROVIDER=deterministic \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3

Run with Anthropic:

AUTOCONTEXT_AGENT_PROVIDER=anthropic \
ANTHROPIC_API_KEY=... \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3

ANTHROPIC_API_KEY is the preferred Anthropic credential env var. AUTOCONTEXT_ANTHROPIC_API_KEY remains supported as a compatibility alias.

Run with Claude CLI (claude -p via a local authenticated Claude Code runtime):

AUTOCONTEXT_AGENT_PROVIDER=claude-cli \
AUTOCONTEXT_CLAUDE_MODEL=sonnet \
AUTOCONTEXT_CLAUDE_TIMEOUT=300 \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3

For longer live prompts, autoctx solve, autoctx judge, and autoctx improve all accept --timeout <seconds>. autoctx solve also accepts --generation-time-budget <seconds> to cap per-generation solve runtime. You can still use provider env vars such as AUTOCONTEXT_CLAUDE_TIMEOUT or AUTOCONTEXT_PI_TIMEOUT.

Run with Codex CLI (codex exec via a local authenticated Codex runtime):

AUTOCONTEXT_AGENT_PROVIDER=codex \
AUTOCONTEXT_CODEX_MODEL=o4-mini \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3

Run with Pi CLI (local Pi agent runtime):

AUTOCONTEXT_AGENT_PROVIDER=pi \
AUTOCONTEXT_PI_COMMAND=pi \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3

autoctx simulate now follows the effective architect-role runtime surface, so AUTOCONTEXT_ARCHITECT_PROVIDER, other role-routing overrides, and per-call --provider <name> overrides all apply to live simulation generation.

autoctx investigate now ships as a first-class Python CLI surface as well. It uses the architect runtime for investigation-spec synthesis and the analyst runtime for hypothesis generation, so role-routing overrides apply there too.

Run with Pi RPC (local Pi subprocess using pi --mode rpc JSONL):

AUTOCONTEXT_AGENT_PROVIDER=pi-rpc \
AUTOCONTEXT_PI_COMMAND=pi \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3

For deterministic evals where Pi should ignore repo-local AGENTS.md / CLAUDE.md, add:

AUTOCONTEXT_PI_NO_CONTEXT_FILES=true

Run with Hermes (via OpenAI-compatible gateway):

AUTOCONTEXT_AGENT_PROVIDER=openai-compatible \
AUTOCONTEXT_AGENT_BASE_URL=http://localhost:8080/v1 \
AUTOCONTEXT_AGENT_API_KEY=no-key \
AUTOCONTEXT_AGENT_DEFAULT_MODEL=hermes-3-llama-3.1-8b \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3

Start the API server:

uv run autoctx serve --host 127.0.0.1 --port 8000

Inspect http://127.0.0.1:8000/ for the API index after the server starts. For an interactive terminal UI, use the TypeScript package: npx autoctx tui.

Start the MCP server:

uv sync --group dev --extra mcp
uv run autoctx mcp-serve

Main CLI Commands

uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3
uv run autoctx simulate --description "simulate deploying a web service with rollback"
uv run autoctx simulate --description "simulate deploying a web service with rollback" --provider claude-cli
uv run autoctx investigate --description "why did conversion drop after Tuesday's release"
uv run autoctx queue add --task-prompt "Write a 1-line fact about primes" --rubric "correct" --threshold 0.8 --rounds 2
uv run autoctx simulate --replay deploy_sim --variables threshold=0.9
uv run autoctx list
uv run autoctx status <run_id>
uv run autoctx replay <run_id> --generation 1
uv run autoctx run --scenario support_triage --gens 3
uv run autoctx benchmark --scenario support_triage --runs 5
uv run autoctx new-scenario --template prompt-optimization --name support_triage
uv run autoctx export-training-data --scenario support_triage --all-runs --output training/support_triage.jsonl
uv run autoctx train --scenario support_triage --data training/support_triage.jsonl --time-budget 300
uv run autoctx serve --host 127.0.0.1 --port 8000
uv run autoctx mcp-serve
uv run autoctx wait <condition_id> --json

Saved custom scenarios under knowledge/_custom_scenarios/ can be rerun and benchmarked by name once their spec.json has been persisted, so the new-scenario / solve workflow lines up with the named run and benchmark surfaces.

Useful variants:

AUTOCONTEXT_AGENT_PROVIDER=anthropic ANTHROPIC_API_KEY=... \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3

AUTOCONTEXT_AGENT_PROVIDER=anthropic \
ANTHROPIC_API_KEY=sk-ant-primary \
AUTOCONTEXT_COMPETITOR_PROVIDER=openai-compatible \
AUTOCONTEXT_COMPETITOR_API_KEY=sk-role \
AUTOCONTEXT_COMPETITOR_BASE_URL=http://localhost:8000/v1 \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3

AUTOCONTEXT_AGENT_PROVIDER=deterministic AUTOCONTEXT_RLM_ENABLED=true \
uv run autoctx solve --description "improve customer-support replies for billing disputes" --gens 3

Training Workflow

Export JSONL training data from completed runs:

uv run autoctx export-training-data \
  --scenario support_triage \
  --all-runs \
  --output training/support_triage.jsonl

Launch the autoresearch-style training loop:

uv sync --group dev --extra mlx
uv run autoctx train \
  --scenario support_triage \
  --data training/support_triage.jsonl \
  --time-budget 300

MLX training is host-only. It must run on an Apple Silicon macOS machine with Metal access. It will not run correctly inside a Docker sandbox on macOS.

If you only want to inspect generated training data first, export without training and open the JSONL directly.

For host setup details and OpenClaw automation via a file-based watcher bridge, see docs/mlx-training.md.

Configuration

Configuration is loaded from AUTOCONTEXT_* environment variables in src/autocontext/config/settings.py.

Common settings:

AUTOCONTEXT_AGENT_PROVIDER
AUTOCONTEXT_EXECUTOR_MODE
AUTOCONTEXT_MODEL_COMPETITOR
AUTOCONTEXT_MATCHES_PER_GENERATION
AUTOCONTEXT_MAX_RETRIES
AUTOCONTEXT_JUDGE_PROVIDER
AUTOCONTEXT_PI_TIMEOUT (defaults to 300 seconds for Pi-backed live runs)
AUTOCONTEXT_RLM_ENABLED
AUTOCONTEXT_HARNESS_PREFLIGHT_ENABLED
AUTOCONTEXT_STAGED_VALIDATION_ENABLED

See the repo-level .env.example for a working starting point.

Repository Structure

autocontext/
  src/autocontext/   Python package
  tests/             Pytest suite
  docs/              Package-specific documentation
  migrations/        SQLite migrations
ts/                  TypeScript package
infra/               Docker, Fly.io, bootstrap scripts

Validation and Development

uv run ruff check src tests
uv run mypy src
uv run pytest

If you change protocol messages, regenerate the derived protocol artifacts from the repo root:

cd ..
uv run --directory autocontext python scripts/generate_protocol.py

OpenClaw / ClawHub

autocontext exposes:

artifact contracts for harnesses, policies, and distilled models
REST and MCP operations for evaluate, validate, publish, import, and discover
ClawHub skill manifests and scenario discovery metadata
an adapter layer for running OpenClaw agents inside the harness

Additional Docs

Canonical concept model
Agent integration guide — CLI-first integration for external agents, MCP fallback, JSON output reference
Sandbox modes
MLX host training
TypeScript package guide — analyze, mission control, and interactive TUI surfaces
Demo data notes
Copy-paste examples
Change history
Repository overview

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

jayscambler

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.5.1

May 14, 2026

0.5.0

May 1, 2026

0.4.9

Apr 30, 2026

0.4.8

Apr 30, 2026

0.4.7

Apr 29, 2026

0.4.6

Apr 23, 2026

0.4.5

Apr 21, 2026

This version

0.4.4

Apr 20, 2026

0.4.3

Apr 17, 2026

0.4.2

Apr 16, 2026

0.4.1

Apr 14, 2026

0.4.0

Apr 14, 2026

0.3.7

Apr 8, 2026

0.3.6

Apr 7, 2026

0.3.5

Apr 6, 2026

0.3.4

Apr 4, 2026

0.3.3

Apr 4, 2026

0.3.2

Apr 2, 2026

0.3.1

Apr 1, 2026

0.0.0

Apr 2, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autocontext-0.4.4.tar.gz (1.4 MB view details)

Uploaded Apr 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

autocontext-0.4.4-py3-none-any.whl (846.4 kB view details)

Uploaded Apr 20, 2026 Python 3

File details

Details for the file autocontext-0.4.4.tar.gz.

File metadata

Download URL: autocontext-0.4.4.tar.gz
Upload date: Apr 20, 2026
Size: 1.4 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for autocontext-0.4.4.tar.gz
Algorithm	Hash digest
SHA256	`42b39a47e5e014b48730b3a26fe3e16086044c6c636c2f34a45c2fce81101d7a`
MD5	`5b4b70814caffb0af8be392b8ff30632`
BLAKE2b-256	`ffc35b3cb21b6423855b21797af23acc9ff1b130941385c627b3d8cfa5f91032`

See more details on using hashes here.

Provenance

The following attestation bundles were made for autocontext-0.4.4.tar.gz:

Publisher: publish-python.yml on greyhaven-ai/autocontext

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: autocontext-0.4.4.tar.gz
- Subject digest: 42b39a47e5e014b48730b3a26fe3e16086044c6c636c2f34a45c2fce81101d7a
- Sigstore transparency entry: 1343492523
- Sigstore integration time: Apr 20, 2026
Source repository:
- Permalink: greyhaven-ai/autocontext@65c54435306ee86d189040294fe89b27b6122605
- Branch / Tag: refs/tags/py-v0.4.4
- Owner: https://github.com/greyhaven-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-python.yml@65c54435306ee86d189040294fe89b27b6122605
- Trigger Event: push

File details

Details for the file autocontext-0.4.4-py3-none-any.whl.

File metadata

Download URL: autocontext-0.4.4-py3-none-any.whl
Upload date: Apr 20, 2026
Size: 846.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for autocontext-0.4.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ebcfa81858aeafc2e6d2653ce16ab0f004861a08ac210b234c51da40a35de831`
MD5	`79e478a23b9fd54183fc29c616b355b5`
BLAKE2b-256	`7066312d9580b01ee81682393fc2461252627baad75dc02b05ba8c2d9bd8f98f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for autocontext-0.4.4-py3-none-any.whl:

Publisher: publish-python.yml on greyhaven-ai/autocontext

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: autocontext-0.4.4-py3-none-any.whl
- Subject digest: ebcfa81858aeafc2e6d2653ce16ab0f004861a08ac210b234c51da40a35de831
- Sigstore transparency entry: 1343492529
- Sigstore integration time: Apr 20, 2026
Source repository:
- Permalink: greyhaven-ai/autocontext@65c54435306ee86d189040294fe89b27b6122605
- Branch / Tag: refs/tags/py-v0.4.4
- Owner: https://github.com/greyhaven-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-python.yml@65c54435306ee86d189040294fe89b27b6122605
- Trigger Event: push

autocontext 0.4.4

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

autocontext

Install

Working Directory

What It Does

Surface Summary

Quick Start

Main CLI Commands

Training Workflow

Configuration

Repository Structure

Validation and Development

OpenClaw / ClawHub

Additional Docs

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance