Structured experiment-driven code optimization with git worktrees, traces, and a local dashboard. CLI for the evo plugin (Claude Code, Codex).

These details have not been verified by PyPI

Project links

Project description

evo banner

evo

A Claude Code plugin that optimizes code through experiments. You give it a codebase. It discovers metrics to optimize, sets up the evaluation, and starts running experiments in a loop -- trying things, keeping what improves the score, throwing away what doesn't.

Inspired by Karpathy's autoresearch -- where an LLM runs training experiments autonomously to beat its own best score. Autoresearch is a pure hill climb: try something, keep or revert, repeat on a single branch. Evo adds structure on top of that idea:

Tree search over greedy hill climb. Multiple directions can fork from any committed node, so exploration doesn't collapse to one path.
Parallel semi-autonomous agents. spawn multiple subagents and run them simultaneously, each in its own git worktree. Each subagent reads traces, formulates hypotheses, and can run multiple iterations within its branch.
Shared state. Failure traces, annotations, and discarded hypotheses are accessible to every agent before it decides what to try next.
Gating. Regression tests or safety checks can be wired up as a gate. Experiments that don't pass get discarded.
Observability. A dashboard to monitor your experiments.
Benchmark discovery. /discover explores the repo, figures out what to measure, and instruments the evaluation.

Install

Inside Claude Code:

/plugin marketplace add evo-hq/evo
/plugin install evo@evo-hq-evo

Then reload Claude Code. The /evo:discover and /evo:optimize slash commands become available in any repo, and the evo CLI is on the Bash tool's PATH for the duration of each session.

If you already have a different evo on your system (e.g. the PyPI evo SLAM evaluation tool), the plugin's bin/evo takes precedence while the plugin is enabled, so there's no collision.

Requirements: Claude Code, Python 3.12+, git, uv.

Usage

Two slash commands:

/evo:discover -- explores the repo, instruments the benchmark, runs baseline
/evo:optimize -- runs the optimization loop with parallel subagents until interrupted

/evo:optimize accepts optional parameters:

Parameter	Default	Description
`subagents`	5	Number of parallel subagents per round
`budget`	5	Max iterations each subagent can run within its branch
`stall`	5	Consecutive rounds with no improvement before auto-stopping

Example: /evo:optimize subagents=3 budget=10 stall=3

Typical flow:

you: /evo:discover
evo: explores repo, instruments benchmark, runs baseline

you: /evo:optimize
evo: spawns 5 subagents in parallel, each exploring a different direction
     each subagent can run up to 5 iterations within its branch
     orchestrator collects results, prunes dead branches, adjusts strategy
     repeats until interrupted or stalled

Under the hood, each experiment gets its own git worktree branching from its parent. If the score improves and the gate passes, the experiment is committed. Otherwise it's discarded and the worktree is cleaned up.

Architecture

Orchestrator (main agent)
  - reads state, identifies failure patterns, picks strategic directions
  - gives each subagent a direction + specific ideas + iteration budget
  - collects results, prunes dead branches, adjusts strategy for next round

  Subagent 1 (background, budget: 5 iterations)
    - reads traces, analyzes failures in its focus area
    - formulates hypothesis, edits target, runs benchmark
    - if budget remains and sees a follow-up, iterates on its branch
    - returns: what it tried, what worked, what it learned

  Subagent 2 (background, budget: 5 iterations)
    ...up to N subagents in parallel

Dashboard

The dashboard starts automatically when you run /evo:discover (or evo init). When it comes up, Claude surfaces the URL in the chat:

Dashboard live: http://127.0.0.1:8080 (pid 12345)

If 8080 is busy, evo auto-increments (8081, 8082, ...) and prints the actual port. You can also start it manually:

uv run --project /path/to/evo evo dashboard --port 8080

The chosen port is persisted to .evo/dashboard.port so repeat runs re-use it.

Dev install

For working on evo itself (not just using it):

git clone https://github.com/evo-hq/evo
uv run --project /path/to/evo evo status

uv run resolves dependencies on first use -- no pip install step.

TODO

Distributed evaluation via Harbor -- run benchmarks in containers instead of locally, use Harbor's cloud providers to parallelize.

License

Licensed under the Apache License 2.0.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.4.0a3 pre-release

Apr 29, 2026

0.4.0a2 pre-release

Apr 28, 2026

0.4.0a1 pre-release

Apr 28, 2026

0.3.3

Apr 28, 2026

0.3.2

Apr 27, 2026

0.3.1

Apr 26, 2026

0.3.0

Apr 25, 2026

0.2.2

Apr 16, 2026

0.2.1

Apr 15, 2026

This version

0.2.0

Apr 13, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

evo_hq_cli-0.2.0.tar.gz (27.8 kB view details)

Uploaded Apr 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

evo_hq_cli-0.2.0-py3-none-any.whl (27.8 kB view details)

Uploaded Apr 13, 2026 Python 3

File details

Details for the file evo_hq_cli-0.2.0.tar.gz.

File metadata

Download URL: evo_hq_cli-0.2.0.tar.gz
Upload date: Apr 13, 2026
Size: 27.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for evo_hq_cli-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`ec563dc1c9f0ffd96dee5016978c706289ad4e62e281a9dfb489023c6440f892`
MD5	`05578722a72ee13fdf349683d8439974`
BLAKE2b-256	`5df9a552afe4c1c26335ecca06d07ce9dd306ac65955e68a46653848943cd07a`

See more details on using hashes here.

File details

Details for the file evo_hq_cli-0.2.0-py3-none-any.whl.

File metadata

Download URL: evo_hq_cli-0.2.0-py3-none-any.whl
Upload date: Apr 13, 2026
Size: 27.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.2

File hashes

Hashes for evo_hq_cli-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`47f1d215ce64496260408f95679beb3e83f394cab85d7163fe478a66ae935966`
MD5	`104989b3c2e4d0a3fe29ac0bff65ab2d`
BLAKE2b-256	`67ba1ddd8a1b2168be9f679b3905856a1df58b7f3bc364f0219c70c57e78c2d8`

See more details on using hashes here.

evo-hq-cli 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

evo

Install

Usage

Architecture

Dashboard

Dev install

TODO

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes