Skip to main content

Multi-agent AI debate orchestrator — spawns Claude, Gemini, Codex (etc.) CLI agents in tmux panes and runs structured debates, with a web UI and archive browser.

Project description

aidebate

Multi-agent AI debate orchestrator. Spins up Claude Code, Gemini CLI, Codex CLI (and anything else with an interactive REPL) inside tmux panes, runs a structured debate between them, and serves a live web UI plus an archive browser.

status: alpha license: Apache 2.0

Why

You have subscriptions to several AI coding assistants and occasionally want to pit them against each other — or let them cross-examine each other — on a concrete question ("Rust vs Go for this CLI", "Which of these 5 libraries should we pick?", "Is this migration worth it?"). Doing this manually is tedious; aidebate wires it up end to end:

  • Each agent is a long-lived REPL in its own tmux pane (so you can watch them think live, or just see the final verdict).
  • The orchestrator coordinates turns via flag files — agents never step on each other's keystrokes.
  • A moderator agent runs the same way, reads everyone's answers at the end, and writes a verdict.
  • Everything is persisted to disk as rendered Markdown, so every debate leaves a permanent, browsable transcript.

Install

You need tmux, Python 3.10+, and at least one authenticated AI CLI:

Then:

# One-shot (npx-style) with uv:
uvx aidebate serve

# Or persistent install:
pipx install aidebate
debate serve

Or develop from source:

git clone https://github.com/orlenko/aidebate && cd aidebate
python -m venv .venv && .venv/bin/pip install -e ".[dev]"
.venv/bin/debate --help

Quickstart

Web UI (recommended)

debate serve     # http://127.0.0.1:8765

Fill in the form — topic, moderator engine, and one row per debater (role + engine + stance). Click Start. The tiled tmux session spins up in the background and every pane streams into the browser. When it finishes, the verdict renders as proper Markdown and every run joins the "Prior debates" list for future reference.

Command line

debate run \
  --topic "Should we migrate from Framework A to Framework B?" \
  --side pro@claude:"Argue for migrating." \
  --side con@gemini:"Argue for staying." \
  --side auditor@codex:"Neutral cost analysis — no advocacy." \
  --moderator claude \
  --watch

--watch pops open an iTerm2/Terminal window already attached to the debate's tmux session so you can follow along. Without it, use the printed tmux attach -t debate-<timestamp> command.

Other subcommands

Command What it does
debate ls List prior session IDs.
debate attach <id> Print the tmux attach command for a session.
debate kill-all Kill every tmux session whose name starts with debate-.
debate smoke Walking-skeleton test: one agent, one turn, no moderator.

What gets saved

Each run writes to ~/.aidebate/sessions/<id>/. Override with AIDEBATE_HOME=<dir> or --sessions-dir <dir> on any subcommand. (We deliberately avoid macOS's ~/Library/Application Support/ path — the space breaks too many of the ad-hoc shell commands that AI agents emit mid-debate.)

~/.aidebate/sessions/2026-04-14-120000/
├── session.json                    # manifest: topic, sides, moderator, status, timings
├── chat.jsonl                      # group chat log (one JSON object per line)
├── verdict.md                      # moderator's final ruling
├── agents/
│   ├── pro/                        # each agent's cwd
│   ├── con/
│   └── moderator/
├── phase-1-opening/
│   ├── pro.prompt.md
│   ├── pro.answer.md
│   ├── pro.done
│   └── …
├── phase-2-rebuttal/
└── phase-3-verdict/

How it works

Agents coordinate via flag files, not terminal scraping:

  1. Orchestrator writes phase-N/<role>.prompt.md.
  2. It tells the agent (via tmux send-keys) to read that file and follow it.
  3. The prompt instructs the agent to write its response to <role>.answer.md and then touch '<role>.done'.
  4. Orchestrator polls for the .done flag; when every agent's .done lands, the phase is complete.

The debate runs in three phases:

Phase 1: Opening    — each debater writes their position (parallel)
Phase 2: Rebuttal   — each debater attacks every opponent's opening (parallel)
Phase 3: Verdict    — moderator reads all answers + chat, renders ruling

startup_keys in the adapter config auto-dismiss "trust this folder?" dialogs for each CLI. permission_prompts handle anything else interactive.

Adapters

Adding a new AI CLI is a YAML file under src/aidebate/adapters/:

# src/aidebate/adapters/mynewcli.yaml
name: mynewcli
cmd: "mynewcli --yolo --workspace {session_root}"
submit_key: "Enter"
submit_delay: 0.8
permission_prompts:
  - { match: "Trust this", respond: "" }  # empty respond → just press Enter
  - { match: "\\[y/N\\]",  respond: "y" }
answer_instruction: |
  When you have finished this task, write your full response to:

      {answer_path}

  Then run:

      touch '{done_path}'
startup_keys:
  - { delay: 3.0, key: "Enter" }   # dismiss startup dialogs

The adapter's cmd can reference {session_root} and {agent_cwd}, substituted at spawn time.

Development

.venv/bin/pytest                # run the test suite
.venv/bin/ruff check src tests  # lint
scripts/hostile-eyes            # adversarial self-review + mark current HEAD
scripts/pre-pr-check            # fail fast if hostile review is missing
scripts/restart-server          # bounce the local web server on :8765

The tests don't require tmux or any AI CLI — they cover adapter loading, side parsing, prompt building, and the FastAPI endpoints that read session state from disk. For end-to-end runs you need the real thing.

License

Apache-2.0. See LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aidebate-0.4.7.tar.gz (62.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aidebate-0.4.7-py3-none-any.whl (61.1 kB view details)

Uploaded Python 3

File details

Details for the file aidebate-0.4.7.tar.gz.

File metadata

  • Download URL: aidebate-0.4.7.tar.gz
  • Upload date:
  • Size: 62.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aidebate-0.4.7.tar.gz
Algorithm Hash digest
SHA256 12ac9096ee4b5dbe69e3872e0d25023ef80d6123e6c92e28b1b5bd0d5c8f89d0
MD5 7afe9eb4bbf6fab9d4614d0593e4613d
BLAKE2b-256 c31cc97c267f9ea55bef13ccb3e17500507f7aad83c07b08504c7dd225ef15bf

See more details on using hashes here.

Provenance

The following attestation bundles were made for aidebate-0.4.7.tar.gz:

Publisher: release.yml on orlenko/aidebate

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file aidebate-0.4.7-py3-none-any.whl.

File metadata

  • Download URL: aidebate-0.4.7-py3-none-any.whl
  • Upload date:
  • Size: 61.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for aidebate-0.4.7-py3-none-any.whl
Algorithm Hash digest
SHA256 d5e8653348a43072fa08d7ad870e0642b029b988c60b97dd174c64f71fc8c328
MD5 ccccdfa62143d33b3bbedcc9fa9dd010
BLAKE2b-256 37a5ccc983d686de7ed8dcc66b8475de493c1ab2e6e46d138a37615bffedbd8f

See more details on using hashes here.

Provenance

The following attestation bundles were made for aidebate-0.4.7-py3-none-any.whl:

Publisher: release.yml on orlenko/aidebate

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page