Multi-model code review with ranked todos and subagent dispatch, inside Claude Code CLI
Project description
compare-mcp
Multi-model code review with ranked todos and subagent dispatch, inside Claude Code CLI.
Claude Code is great at code review — but it only talks to one model. Copilot CLI recently shipped multi-model debug, letting you bounce a problem off GPT, Claude, and Gemini in one shot. Claude Code can't do that natively. This MCP server adds it: bring your own API keys, fan out to any combination of models, and get back a diffed, ranked list of what they each found.
Fan out any bug or task to multiple LLMs simultaneously, diff their unique insights, optionally run a debate round where models critique each other, then dispatch parallel subagents to implement the combined best fixes — each with its own git commit.
Demo
https://github.com/user-attachments/assets/8990dabb-bc61-4625-8930-c914cffe75da
/compare models→/compare review config.py for security issues→/compare --debate→/compare status
Architecture
Install
pip install compare-mcp
Add to Claude Code:
claude mcp add -s user compare-mcp -- python -m compare_mcp
Quick start
-
Copy the example config:
mkdir -p ~/.compare cp compare-mcp/.compare/config.example.json ~/.compare/config.json
-
Enable at least 2 providers: set
"enabled": trueand add your API key — either as a$ENV_VARreference or paste the key directly into the JSON -
In Claude Code:
/compare memory leak in the tile rendering loop /compare race condition in the connection pool --debate --providers claude,openai /compare status /compare models
Config reference
Config lives at ~/.compare/config.json. API keys use $ENV_VAR syntax — expanded at load time.
Provider types
| Type | SDK | Use for |
|---|---|---|
anthropic |
anthropic-python | Claude models directly |
openai_compat |
openai-python with custom base_url |
OpenAI, Kimi, Minimax, Gemini, Ollama API, any compatible endpoint |
cli |
subprocess stdin/stdout | Ollama CLI, Codex CLI, any binary |
Compare settings
| Key | Default | Description |
|---|---|---|
max_tokens |
2048 | Max tokens per provider response |
timeout_seconds |
120 | Per-provider timeout (see note below) |
db_path |
~/.compare/todos.sqlite |
SQLite todo store location |
dedup_threshold |
0.65 | Fuzzy match threshold (0-1). Higher = stricter |
max_file_lines |
1000 | Warn before sending files larger than this |
Timeout note: Some models (e.g. Kimi's kimi-k2.5) are significantly slower than GPT-4o on large prompts and will time out at 60s. We default to 120s. If a provider consistently times out, try a faster model variant — for Kimi, moonshot-v1-auto is faster than kimi-k2.5 and auto-selects the right context window.
Adding providers
Any OpenAI-compatible endpoint
{
"my_provider": {
"enabled": true,
"type": "openai_compat",
"api_key": "$MY_API_KEY",
"model": "model-name",
"base_url": "https://api.example.com/v1"
}
}
Works with: OpenAI, Kimi (api.moonshot.ai), Minimax (api.minimax.io), Gemini (generativelanguage.googleapis.com/v1beta/openai/), Ollama API (localhost:11434/v1), OpenRouter, Together AI, Groq, etc.
CLI subprocess model
{
"ollama_local": {
"enabled": true,
"type": "cli",
"cli_command": "ollama",
"cli_args": ["run", "codellama"],
"cli_parser": "text"
}
}
cli_parser options: "text" (raw stdout), "json" (parse as JSON), "jsonl" (last complete JSON line).
Commands
In Claude Code, type any of these:
| Command | What it does |
|---|---|
/compare <issue> |
Fan out to all enabled models, diff findings, save ranked todos |
/compare <issue> --debate |
Same as above, plus a debate round where models critique each other |
/compare <issue> --providers openai,kimi |
Compare specific providers only |
/compare models |
Show configured providers and their status |
/compare status |
Show all todos grouped by status (pending/in_progress/done) |
/compare update <id> <status> |
Change a todo's status |
After /compare runs, you'll be asked whether to dispatch subagents to fix the findings in parallel. Each subagent gets one todo, implements the fix, and commits.
How it works
-
Dispatch —
compare_runfans out the code + issue to all enabled providers viaasyncio.gather. Providers that timeout or error are excluded, never crash the whole run. -
Diff —
compare_diffuses rapidfuzz (token sort ratio) to deduplicate findings across providers. Findings seen by 2+ providers are "shared"; the rest are "unique". Agreement rate = shared / total unique groups. -
Debate (optional) —
compare_debatesends each provider's findings to every other provider for critique. A synthesis call merges the results. Capped at 4 providers to limit API calls (N*(N-1)+1). -
Todos —
compare_todoswrites ranked findings to SQLite. High severity first, then by provider count. -
Execute — The
/compareskill dispatches parallel Claude Code subagents, one per todo. Each implements the fix and commits.
MCP tools (7)
| Tool | Description |
|---|---|
compare_models |
List configured providers (no API keys exposed) |
compare_run |
Fan out code review to providers in parallel |
compare_diff |
Extract unique vs shared insights with fuzzy dedup |
compare_debate |
Models critique each other, then synthesize |
compare_todos |
Write ranked findings to SQLite |
compare_status |
Read todos grouped by status |
compare_todo_update |
Update a todo's status |
vs multi_mcp
multi_mcp does parallel dispatch well. compare-mcp builds the workflow layer on top:
| Feature | multi_mcp | compare-mcp |
|---|---|---|
| Parallel dispatch | yes | yes |
| OpenAI-compat providers | yes | yes |
| CLI subprocess models | yes | yes |
| Debate / critique round | raw | structured + merged output |
| Insight diff (unique vs shared) | no | rapidfuzz dedup |
| Agreement rate metric | no | yes |
| SQLite ranked todo store | no | yes |
| Subagent dispatch per todo | no | yes |
| Git commit per fix | no | yes |
| CC skill + /compare | no | yes |
| pip install | no (git clone + make) | yes |
vs Copilot CLI multi-model
Copilot CLI routes through GitHub's API proxy — no BYO keys, no Kimi/Minimax/local models. compare-mcp calls provider APIs directly: full context windows, your own rate limits, any model with an HTTP endpoint or CLI binary.
Development
git clone https://github.com/carolinacherry/compare-mcp.git
cd compare-mcp
pip install -e ".[dev]"
pytest
ruff check .
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file compare_mcp-0.1.0.tar.gz.
File metadata
- Download URL: compare_mcp-0.1.0.tar.gz
- Upload date:
- Size: 21.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0d07e7e13e55bdb35e6c6601b75b313f600e87b4e589c045f845e09abe6af786
|
|
| MD5 |
57281dfbda3f6e7fc215f433a73f5b89
|
|
| BLAKE2b-256 |
2af916654a9fe1c4bb2bf051897e617f22fc2aa91948059ebb28bc5d5fec45a7
|
Provenance
The following attestation bundles were made for compare_mcp-0.1.0.tar.gz:
Publisher:
publish.yml on carolinacherry/compare-mcp
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
compare_mcp-0.1.0.tar.gz -
Subject digest:
0d07e7e13e55bdb35e6c6601b75b313f600e87b4e589c045f845e09abe6af786 - Sigstore transparency entry: 1217915078
- Sigstore integration time:
-
Permalink:
carolinacherry/compare-mcp@3b6318716177616beec1a4accf593fa29273ac25 -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/carolinacherry
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@3b6318716177616beec1a4accf593fa29273ac25 -
Trigger Event:
push
-
Statement type:
File details
Details for the file compare_mcp-0.1.0-py3-none-any.whl.
File metadata
- Download URL: compare_mcp-0.1.0-py3-none-any.whl
- Upload date:
- Size: 16.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3232994252dac4f26e6620505f0d2f1937eee9738966f109605a19da0b6783b0
|
|
| MD5 |
48c3568bc61f676b0c279ffc1c30594e
|
|
| BLAKE2b-256 |
9bb41ac2ac0f7d0e4fde696f789915aac27be0c2460998ef434bc777b8e5d080
|
Provenance
The following attestation bundles were made for compare_mcp-0.1.0-py3-none-any.whl:
Publisher:
publish.yml on carolinacherry/compare-mcp
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
compare_mcp-0.1.0-py3-none-any.whl -
Subject digest:
3232994252dac4f26e6620505f0d2f1937eee9738966f109605a19da0b6783b0 - Sigstore transparency entry: 1217915099
- Sigstore integration time:
-
Permalink:
carolinacherry/compare-mcp@3b6318716177616beec1a4accf593fa29273ac25 -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/carolinacherry
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@3b6318716177616beec1a4accf593fa29273ac25 -
Trigger Event:
push
-
Statement type: