Run a prompt against multiple coding agents in parallel and compare results
Project description
AgentTester
⚠️ Experimental — This project is under active development. APIs, config format, and CLI flags may change without notice.
Send a single prompt to multiple coding agents running in parallel and compare the results. Each agent works in its own git worktree on a separate branch so they never interfere with each other.
Install
uv pip install -e ".[dev]"
Quick Start
# List built-in agents
agent-tester agents
# Run two agents on the same prompt
agent-tester run "Add unit tests for the auth module" --agents claude,aider
# Use a prompt file
agent-tester run --prompt-file task.md --agents claude,codex,aider
# Keep worktrees for manual inspection
agent-tester run "Refactor logging" --agents claude,aider --keep-worktrees
How It Works
- You provide a prompt and select agents
- AgentTester creates a git worktree + branch for each agent from the current HEAD
- All agents run concurrently (up to 5), each in its own worktree
- Agent output streams to the terminal with colored prefixes
- A markdown comparison report is generated with diff stats and timing
- Worktrees are cleaned up (branches are preserved for
git diff)
Branches are named agenttester/<run-id>/<agent-name> so you can compare results:
git diff agenttester/a3f2c1d0/claude agenttester/a3f2c1d0/aider
Configuration
Copy config.example.yaml to agent-tester.yaml in your target repo to customize agents. Built-in presets are available for claude, aider, and codex.
You can also pass a config file explicitly:
agent-tester run "Fix the bug" --agents claude --config /path/to/custom.yaml
A global config at ~/.config/agenttester/config.yaml or ~/.agenttester/config.yaml is merged automatically — local project config takes precedence.
Command Placeholders
{prompt}— replaced with the shell-escaped prompt text{prompt_file}— replaced with a path to a temp file containing the prompt- If neither placeholder is present, the prompt is piped to the agent via stdin
Agent Settings
| Field | Description | Default |
|---|---|---|
command |
Shell command template | (required) |
commit_style |
auto (agent commits) or manual (agenttester commits) |
auto |
timeout |
Max seconds before the agent is killed | 600 |
env |
Extra environment variables (key-value map) | {} |
Interactive Model REPL
For comparing responses from vLLM model servers interactively, with persistent conversation history within a session:
agent-tester repl # auto-discovers agent-tester.yaml
agent-tester repl --config custom.yaml # explicit config path
The REPL discovers any agent in your config whose command uses query_model.py,
fans out each prompt to all of them in parallel, and maintains separate
conversation history per model. Use /reset to clear history or exit to quit.
See config.example.yaml for example vLLM agent entries.
Development
uv pip install -e ".[dev]"
ruff check src/ tests/
ruff format src/ tests/
pytest
Docker
# Run against the current directory
docker compose run --rm agent-tester run "Fix the bug" --agents claude
# Run against a different repo
REPO_PATH=/path/to/repo docker compose run --rm agent-tester run "Add tests" --agents claude,aider
Library Usage
import asyncio
from pathlib import Path
from rich.console import Console
from agenttester import Orchestrator, load_config
async def main():
agents = load_config()
selected = [agents["claude"], agents["aider"]]
orch = Orchestrator(Path(".").resolve(), Console())
results = await orch.run("Add unit tests", selected)
for r in results:
print(f"{r.agent_name}: exit={r.exit_code} duration={r.duration:.1f}s")
asyncio.run(main())
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agenttester-0.3.4.tar.gz.
File metadata
- Download URL: agenttester-0.3.4.tar.gz
- Upload date:
- Size: 45.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cc252b2a03632d837a30ab69999e26d04a68ac14ebcf12685d291e0226dede68
|
|
| MD5 |
803f52a267692ac9ec84b821c733b229
|
|
| BLAKE2b-256 |
44f7903480da53fad120b910d441673b4ad1308f98285a21b31157f7cc9276a6
|
Provenance
The following attestation bundles were made for agenttester-0.3.4.tar.gz:
Publisher:
publish.yml on sroomberg/agenttester
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
agenttester-0.3.4.tar.gz -
Subject digest:
cc252b2a03632d837a30ab69999e26d04a68ac14ebcf12685d291e0226dede68 - Sigstore transparency entry: 1510931365
- Sigstore integration time:
-
Permalink:
sroomberg/agenttester@641f5d6d692e12e1e111a922fad94b352d1e9cb6 -
Branch / Tag:
refs/tags/v0.3.4 - Owner: https://github.com/sroomberg
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@641f5d6d692e12e1e111a922fad94b352d1e9cb6 -
Trigger Event:
push
-
Statement type:
File details
Details for the file agenttester-0.3.4-py3-none-any.whl.
File metadata
- Download URL: agenttester-0.3.4-py3-none-any.whl
- Upload date:
- Size: 19.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
74608f49cf3d00e8d3440bc017e6d3f96cb993751fa070e27578a07b67519857
|
|
| MD5 |
2fcda6f1648afaad3d7308d224cd2ae2
|
|
| BLAKE2b-256 |
5dbed30659f9f87bb11c539662e572e4d3a542e42dd31b0cc0cc7ad3e8bd8835
|
Provenance
The following attestation bundles were made for agenttester-0.3.4-py3-none-any.whl:
Publisher:
publish.yml on sroomberg/agenttester
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
agenttester-0.3.4-py3-none-any.whl -
Subject digest:
74608f49cf3d00e8d3440bc017e6d3f96cb993751fa070e27578a07b67519857 - Sigstore transparency entry: 1510931680
- Sigstore integration time:
-
Permalink:
sroomberg/agenttester@641f5d6d692e12e1e111a922fad94b352d1e9cb6 -
Branch / Tag:
refs/tags/v0.3.4 - Owner: https://github.com/sroomberg
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@641f5d6d692e12e1e111a922fad94b352d1e9cb6 -
Trigger Event:
push
-
Statement type: