AutoContext control plane for iterative strategy evolution.
Project description
AutoContext
AutoContext is a control plane for improving agent behavior over repeated runs. It combines multi-agent candidate generation, staged validation, scenario execution, knowledge accumulation, optional local distillation, and OpenClaw-facing APIs.
Working Directory
Run the commands in this README from the autocontext/ directory. The Python package, CLI entrypoint, tests, migrations, and dashboard assets all live here.
What It Does
- Runs iterative generation loops against game scenarios and agent-task scenarios
- Persists playbooks, hints, tools, reports, and snapshots across runs
- Supports staged validation, harness synthesis, and harness-aware routing
- Exports training data and runs autoresearch-style local training loops
- Exposes evaluation, validation, artifact, and discovery operations over MCP and HTTP
Quick Start
From the repo root:
cd autocontext
uv venv
source .venv/bin/activate
uv sync --group dev
Use the repo-level .env.example as the reference for available AUTOCONTEXT_* settings.
Run a deterministic local scenario:
AUTOCONTEXT_AGENT_PROVIDER=deterministic \
uv run autoctx run --scenario grid_ctf --gens 3 --run-id quickstart
Run with Anthropic:
AUTOCONTEXT_AGENT_PROVIDER=anthropic \
AUTOCONTEXT_ANTHROPIC_API_KEY=... \
uv run autoctx run --scenario grid_ctf --gens 3
Start the API server and dashboard:
uv run autoctx serve --host 127.0.0.1 --port 8000
Open http://127.0.0.1:8000 after the server starts.
Start the MCP server:
uv sync --group dev --extra mcp
uv run autoctx mcp-serve
Main CLI Commands
uv run autoctx run --scenario grid_ctf --gens 3
uv run autoctx list
uv run autoctx status <run_id>
uv run autoctx replay <run_id> --generation 1
uv run autoctx benchmark --scenario grid_ctf --runs 5
uv run autoctx serve --host 127.0.0.1 --port 8000
uv run autoctx mcp-serve
Useful variants:
AUTOCONTEXT_AGENT_PROVIDER=anthropic AUTOCONTEXT_ANTHROPIC_API_KEY=... \
uv run autoctx run --scenario grid_ctf --gens 3
AUTOCONTEXT_AGENT_PROVIDER=deterministic AUTOCONTEXT_RLM_ENABLED=true \
uv run autoctx run --scenario grid_ctf --gens 3
Training Workflow
Export JSONL training data from completed runs:
uv run autoctx export-training-data \
--scenario grid_ctf \
--all-runs \
--output training/grid_ctf.jsonl
Launch the autoresearch-style training loop:
uv sync --group dev --extra mlx
uv run autoctx train \
--scenario grid_ctf \
--data training/grid_ctf.jsonl \
--time-budget 300
MLX training is host-only. It must run on an Apple Silicon macOS machine with Metal access. It will not run correctly inside a Docker sandbox on macOS.
If you only want to inspect generated training data first, export without training and open the JSONL directly.
For host setup details and OpenClaw automation via a file-based watcher bridge, see docs/mlx-training.md.
Configuration
Configuration is loaded from AUTOCONTEXT_* environment variables in src/autocontext/config/settings.py.
Common settings:
AUTOCONTEXT_AGENT_PROVIDERAUTOCONTEXT_EXECUTOR_MODEAUTOCONTEXT_MODEL_COMPETITORAUTOCONTEXT_MATCHES_PER_GENERATIONAUTOCONTEXT_MAX_RETRIESAUTOCONTEXT_JUDGE_PROVIDERAUTOCONTEXT_RLM_ENABLEDAUTOCONTEXT_HARNESS_PREFLIGHT_ENABLEDAUTOCONTEXT_STAGED_VALIDATION_ENABLED
See the repo-level .env.example for a working starting point.
Repository Structure
autocontext/
src/autocontext/ Python package
tests/ Pytest suite
dashboard/ Static dashboard assets
docs/ Package-specific documentation
migrations/ SQLite migrations
ts/ TypeScript package
tui/ Interactive terminal UI
infra/ Docker, Fly.io, bootstrap scripts
Validation and Development
uv run ruff check src tests
uv run mypy src
uv run pytest
If you change protocol messages, regenerate the derived protocol artifacts from the repo root:
cd ..
uv run --directory autocontext python scripts/generate_protocol.py
OpenClaw / ClawHub
AutoContext exposes:
- artifact contracts for harnesses, policies, and distilled models
- REST and MCP operations for evaluate, validate, publish, import, and discover
- ClawHub skill manifests and scenario discovery metadata
- an adapter layer for running OpenClaw agents inside the harness
Additional Docs
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file autoctx-0.1.0.tar.gz.
File metadata
- Download URL: autoctx-0.1.0.tar.gz
- Upload date:
- Size: 747.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.17
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
403006ee8227df9aa8d2eb88383c25d93317c207a67cd158dc64423656c26a52
|
|
| MD5 |
d235bf08beda214e3c660e0fd444e828
|
|
| BLAKE2b-256 |
03d1416a77874ef5804cdc6732029b9d4d905f297abad7790066494a361aed4a
|
File details
Details for the file autoctx-0.1.0-py3-none-any.whl.
File metadata
- Download URL: autoctx-0.1.0-py3-none-any.whl
- Upload date:
- Size: 417.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.6.17
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
367e250b46b4b1f0c09b0b17d28f0f511394a611dc5093dc10e26304d000722d
|
|
| MD5 |
555534601700e9de9adb23f50741cd06
|
|
| BLAKE2b-256 |
a75d9a457dcae6d3dd32194d77fcf15cd7e4be3fa441004e4a30798cff22a6da
|