Modern multi-agent CLI orchestrating multiple AI models via OpenRouter

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

firo2525

These details have not been verified by PyPI

Project description

🌐 konklave.info

🏛️ K O N K L A V E

A council of AI models that thinks, codes, reviews, and ships — together.

🚀 Releases June 15, 2026

Stop talking to one AI. Run a team.

Konklave orchestrates six specialized agents — Architect, Coder, Reviewer, Tester, Researcher, Auditor — across any mix of models from OpenRouter. They catch each other's mistakes so you don't have to.

🎬 Screenshots

Full pipeline run — Agents · Chat · Pipeline graph · Live cost tracking

Swarm mode — 6 Researchers running in parallel, all writing findings simultaneously Swarm of researchers

Architect planning a large feature — 6 Coders spawned in parallel Architect with parallel coders

Web Dashboard — live runs, eval comparison, settings Konklave dashboard with eval comparison

Eval results in the terminal — solo vs council, head to head Eval results table in the CLI

✨ How It Works

Instead of one model doing everything, Konklave runs an actual pipeline of specialists:

┌─────────────────────────────────────────────────────────────────┐
│                                                                 │
│   📝 You describe the task                                      │
│          │                                                      │
│          ▼                                                      │
│   🏛️  Architect  ──spawns──►  🔍 Researcher × N (parallel)    │
│          │                                                      │
│          ▼  concrete plan                                       │
│   💻  Coder       (different model family — blind spots cancel) │
│          │                                                      │
│          ▼  implementation                                      │
│   🔎  Reviewer  ──╮                                            │
│                   ├──► consensus vote                          │
│   🛡️  Auditor   ──╯                                            │
│          │                                                      │
│          ▼  approved                                            │
│   🧪  Tester     (runs real commands, reads real output)       │
│          │                                                      │
│          ▼                                                      │
│   ✅  Result                                                    │
│                                                                 │
└─────────────────────────────────────────────────────────────────┘

Each role runs on the model you choose. Mix providers freely — a DeepSeek coder with Anthropic reviewers means genuinely independent cross-checks, not just one model second-guessing itself.

✨ What's New in This Release

Feature	Details
Model-diverse review panel	The review step now fields a third voter from a different model family (`reviewer@google/gemini-2.5-pro` by default). Independent models catch each other's blind spots instead of one model voting twice.
Deliberation round	On a split vote, each reviewer sees the others' full reasoning before casting a FINAL verdict. Unanimous panels pay nothing extra — the second round only fires on a real disagreement.
Image & video generation	New `image_gen` and `video_gen` tools let agents produce and save images/videos via any OpenAI-compatible generation API (DALL-E, Sora, …). Configure endpoint + model in settings.
Configurable from TUI/GUI	All three features above are tunable from the Settings screen — no YAML editing required.

🧠 Six Specialized Roles

Role	What it does
🏛️ Architect	Reads your codebase, spawns parallel Researchers, writes a concrete step-by-step plan
💻 Coder	Implements the plan — intentionally assigned a different model family than the reviewers
🔎 Reviewer	Checks the implementation against the plan and your coding standards
🧪 Tester	Runs actual commands and verifies the result with real evidence, not guesses
🛡️ Auditor	Final pass — does the output actually solve the original request?
🔍 Researcher	Lightweight parallel lookups, runs in swarms to gather context fast

⚡ Work Modes

One flag to control how deep the pipeline goes:

konklave run "your task" --work very_fast    # ⚡ instant — straight to code, no research
konklave run "your task" --work fast         # 🚀 quick research + light review
konklave run "your task"                     # ⚖️  balanced default
konklave run "your task" --work deep         # 🔬 multi-round review and testing
konklave run "your task" --work autonomous   # 🤖 autonomous loop until PERFECT

Mode	Speed	What runs
`very_fast`	⚡⚡⚡	No research → code → done
`fast`	⚡⚡	One researcher, quick review
`balanced`	⚖️	Research → plan → code → review → test
`deep`	🔬	Multi-researcher, multiple review/test rounds
`exhaustive`	🧬	Exhaustive edge cases, trade-off exploration
`autonomous`	🤖	Loop until Auditor approves (with budget cap)
`auto-endless`	♾️	Loop until you stop it

🌊 Swarm — Parallel Sub-Agents

Agents can spawn their own helpers for genuinely independent sub-tasks. Three Researchers run at once instead of sequentially. You control the aggressiveness:

:swarm off            →  each agent works alone
:swarm encouraged     →  agents use parallel helpers when it makes sense  (default)
:swarm aggressive     →  maximally fan out work to sub-agents

🎮 Human-in-the-Loop

Inject a message mid-run, steer the direction, or just watch. Slash commands work in the TUI at any time:

Command	What it does
`:say "focus on auth"`	Inject a message into the active agent
`:work autonomous`	Switch work mode on the fly
`:swarm aggressive`	Turn up parallel agent spawning
`:ask proactive`	Make the Architect ask you more questions
`:cost`	Show live USD spend per agent
`:models`	Reassign any role to a different model
`:exit`	Clean stop at the next step boundary

📦 Built-in Pipelines

Pipelines are plain YAML — no Python required. Write your own or use the built-ins:

🏗️  build_feature    →  research + plan + code + review + test + audit
⚡  quick            →  one-shot coder, no review overhead
♻️  refactor         →  architect-led refactor with before/after review
🐛  debug_issue      →  reproduction → root cause → fix → regression test
🔍  research         →  deep research swarm, no code changes
🤖  autonomous_loop  →  loop until done (budget-capped)

🏗️ Architecture

 You (TUI / CLI)
       │
       ▼
 ┌─────────────┐     ┌──────────────────────────────────────┐
 │  Conductor  │────►│  Pipeline YAML                       │
 │             │     │  • sequential steps                  │
 │             │     │  • parallel groups                   │
 │             │     │  • consensus votes                   │
 │             │     │  • loop steps                        │
 │             │     │  • human-approval gates              │
 └─────────────┘     └──────────────┬───────────────────────┘
                                    │
              ┌─────────────────────┼────────────────────┐
              ▼           ▼         ▼          ▼          ▼
        Architect      Coder    Reviewer    Tester    Auditor
        (Sonnet)    (DeepSeek)  (Sonnet)   (Haiku)   (Sonnet)
            │
            └──spawns──► Researcher × N
                         (parallel swarm)
                              │
                              ▼
                       OpenRouter API
                  (any model, any provider)

Key design decisions:

🔒 Each agent has isolated conversation history — no cross-contamination
🔄 Event Bus decouples UI, Conductor, agents, and tools asynchronously
🛡️ Fallback models — if a model fails, the agent auto-retries on a backup
💾 SQLite persistence — every step saved, resume from any checkpoint

🤖 Default Model Panel

Role	Model	Why
🏛️ Architect	`anthropic/claude-sonnet-4.6`	Strong planning and tool use
💻 Coder	`deepseek/deepseek-chat-v3`	Fast, cheap, excellent at code
🔎 Reviewer	`anthropic/claude-sonnet-4.6`	Different family from coder → independent check
🧪 Tester	`anthropic/claude-haiku-4.5`	Fast and cheap for test gates
🔍 Researcher	`anthropic/claude-haiku-4.5`	High-volume parallel queries
🛡️ Auditor	`anthropic/claude-sonnet-4.6`	Strict final quality gate

Swap any role to any model on OpenRouter — use :models in the TUI or konklave init.

📊 Benchmarks

We run Konklave against a suite of self-contained coding tasks (each with a hidden test file) and score by the fraction of tests that pass. To keep it fair, every configuration runs the exact same model — google/gemini-2.5-flash-lite — so the comparison measures the orchestration, not a model upgrade. Three configurations, head to head:

solo — minimal pipeline, straight to code (no review)
council_same — the full pipeline: architect → coder → reviewer → tester → auditor
council_diverse — the full pipeline with the orchestration turned up: swarm high + autonomous work mode

Latest run — 2026-06-07 · 5 tasks per config · all on gemini-2.5-flash-lite

Configuration	Pass rate	Mean score	Mean cost	Mean time
`solo`	20% (1/5)	54%	$0.0026	8s
`council_same`	40% (2/5)	83%	$0.018	5m 8s
`council_diverse`	80% (4/5)	98%	$0.072	57m

Eval results table in the CLI

Takeaway: with the same model in every seat, just adding the multi-agent pipeline (review + test + audit) doubles the pass rate over a single call (solo 20% → council_same 40%), and turning the orchestration up — swarm high + autonomous looping — doubles it again to 80%, lifting the mean test score from 54% → 98%. The gains here come purely from how the agents work together, not from a stronger model.

🤖 Model under test

All three configurations ran on a single, cheap model so no config has a model advantage:

Configuration	Model	What's different
`solo`	`google/gemini-2.5-flash-lite`	minimal pipeline — straight to code
`council_same`	`google/gemini-2.5-flash-lite`	full pipeline — review, test, audit
`council_diverse`	`google/gemini-2.5-flash-lite`	full pipeline + swarm high + autonomous work mode

This isolates the value of the orchestration itself. In real use you can go further and assign a different model family per role (see Default Model Panel) so independent models catch each other's blind spots on top of these gains.

Cost and time scale with depth — deeper orchestration thinks longer and harder. Use the work modes to dial the trade-off for your task. Full per-task results: eval_results/.

💡 Why This Works — Buy Quality With Cheap Tokens

Here's the whole idea in one sentence: spend more cheap tokens instead of paying for an expensive model — and let it happen on its own.

A single call to a small, cheap model gets you a small, cheap answer (solo: 20% pass rate). But the same cheap model, when it's allowed to review its own work, run the tests, see the failures, fix them, and loop again — autonomously — climbs to 80%. Same model. The only thing that changed is that it kept working.

💸 Cheap tokens, premium results. A frontier model charges a premium per token for one shot. Konklave takes a model that costs a fraction of that and simply uses more of it — many small, cheap passes add up to an answer that rivals (or beats) the expensive one-shot, at a lower model tier.
🤖 Fully autonomous — you don't babysit it. You don't re-prompt, you don't paste error messages back in, you don't say "now write tests." The pipeline reviews, tests, audits, and re-tries by itself until the Auditor is satisfied (or the budget cap stops it). The extra token burn is automatic.
🔁 More tokens are the feature, not a bug. Yes — council_diverse uses far more tokens than a single call. That's the point: those tokens are the quality. You're trading something cheap (tokens from a small model) for something valuable (a correct, tested result) — and you set the ceiling with work modes and a budget cap.

Bottom line: don't pay for a smarter model — let a cheaper one think longer, automatically. More token burn, fully autonomous, better output.

🚀 Installation

Requirements: Python 3.11+ · OpenRouter API key (free tier available)

# 1. Install
pip install konklave

# 2. First-time setup (language, API key, default models)
konklave init

# 3. Launch the interactive TUI
konklave

Windows: double-click start.bat — activates the environment and launches Konklave automatically.

Desktop GUI (optional)

pip install "konklave[gui]"   # adds PySide6
konklave-gui                  # or: python -m konklave.gui

🏁 Quickstart

# Interactive TUI (recommended)
konklave

# One-shot headless run
konklave run "Add input validation to the login form"

# Pick a pipeline depth
konklave run "Refactor the payment module" --work deep

# Resume an interrupted session
konklave resume

# Test your OpenRouter connection
konklave ping

# Show current config
konklave config

⚙️ Configuration

konklave init      # re-run the setup wizard (API key, models, language)
konklave config    # print current settings and their config-file path

Config is stored per-platform — never inside the project folder:

Platform	Path
Windows	`%APPDATA%\konklave\config.toml`
macOS	`~/Library/Application Support/konklave/config.toml`
Linux	`~/.config/konklave/config.toml`

Your API key is stored in the OS keyring (Windows Credential Manager / macOS Keychain / Linux SecretService) — never in a plain file.

Quick-start with custom settings — copy and edit the bundled template:

# find the correct path first
konklave config --path

# then copy config.example.toml from the repo there and edit it

config.example.toml (included in the repo) documents every available setting with inline comments, including model assignments, review-panel tuning, web search, semantic index, media generation, and dashboard options.

📊 Web Dashboard

A FastAPI + browser dashboard for everything that's hard to see in a terminal: live runs, full conversation history, eval comparisons, and settings.

# Launch the dashboard on http://localhost:8000
python -m uvicorn konklave.dashboard.app:app --host 127.0.0.1 --port 8000

Windows: double-click start_dashboard.bat — it sets up the environment, opens your browser, and starts the server.

📜 Browse and export any past session (Markdown / JSON)
📈 Compare eval runs side by side (the screenshot above)
⚙️ Edit configuration and per-role models from the browser
🔒 Binds to 127.0.0.1 only, with a same-origin CSRF guard — local by default

🧠 Persistent Memory

Konklave remembers across sessions. Two human-editable files plus full-text search over every past run:

Store	What it holds
`MEMORY.md`	Workspace-scoped facts the agents learn while working (auto-compacted when it grows too large)
`USER.md`	Global facts about you and your preferences, applied to every workspace
Session DB	SQLite + FTS5 full-text index over all past messages — agents can search what happened before

Memory lives under ~/.konklave/ and is never committed (it's in .gitignore) — it can contain conversation history and personal context. Agents read it at the start of a run and can write new facts via the memory tool.

💻 Platform Support

Platform	Status
Windows 10 / 11	✅
macOS 13+	✅
Linux	✅

🧪 Development & Tests

Konklave ships with a 628-test suite (unit, smoke, and integration) covering the pipeline engine, agents, tools, memory, the dashboard API, and the eval harness.

# Install with dev extras
pip install -e ".[dev]"

# Run the full suite
pytest

# Skip the TUI tests (need a real terminal) and the slow ones
pytest -m "not tui and not slow"

# Run the benchmarks yourself (real OpenRouter calls — costs money)
konklave eval run --configs solo,council_same,council_diverse

Windows: test.bat runs the suite; eval_run.bat runs the benchmarks.

📄 License

Konklave is dual-licensed.

🆓 Open source — GNU AGPL-3.0-or-later (see LICENSE). Free for personal, academic, and open-source use. Note: the AGPL is strong copyleft — if you modify Konklave or offer it to others over a network, you must release your full source under the AGPL too.
💼 Commercial license — for companies that want to use Konklave in a closed-source product or internal tool without the AGPL's source-disclosure obligations. See COMMERCIAL-LICENSE.md.

Built with ❤️ using OpenRouter · UI powered by Textual

⭐ Star this repo if you find it useful! ⭐

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

firo2525

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.0

Jun 14, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

konklave-0.1.0.tar.gz (358.7 kB view details)

Uploaded Jun 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

konklave-0.1.0-py3-none-any.whl (455.2 kB view details)

Uploaded Jun 14, 2026 Python 3

File details

Details for the file konklave-0.1.0.tar.gz.

File metadata

Download URL: konklave-0.1.0.tar.gz
Upload date: Jun 14, 2026
Size: 358.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for konklave-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`23a7f66c64a861989cb344373bbf98eafc59cec41358106118c29294420426be`
MD5	`0c346748d5bc3fc46efe60ae036528b8`
BLAKE2b-256	`b5ee5840535d818d387b8a13bcfa32f1ee3862c328aa7661caa548ba98c9664d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for konklave-0.1.0.tar.gz:

Publisher: publish.yml on firo2525/konklave

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: konklave-0.1.0.tar.gz
- Subject digest: 23a7f66c64a861989cb344373bbf98eafc59cec41358106118c29294420426be
- Sigstore transparency entry: 1819869105
- Sigstore integration time: Jun 14, 2026
Source repository:
- Permalink: firo2525/konklave@b45c48d158fd05b9719935f1243ece46140125c5
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/firo2525
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@b45c48d158fd05b9719935f1243ece46140125c5
- Trigger Event: release

File details

Details for the file konklave-0.1.0-py3-none-any.whl.

File metadata

Download URL: konklave-0.1.0-py3-none-any.whl
Upload date: Jun 14, 2026
Size: 455.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for konklave-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0da6444300e01dbe94d2a79791284eb9e6b151716f33f747e7c7db79a8701fdb`
MD5	`1ecf7baf9f257e87c32dd8c21de9f33a`
BLAKE2b-256	`d78f029042f07d480fc74114a4b81e2af23a3cc92cd4eeaa47a3530e8ec2c18e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for konklave-0.1.0-py3-none-any.whl:

Publisher: publish.yml on firo2525/konklave

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: konklave-0.1.0-py3-none-any.whl
- Subject digest: 0da6444300e01dbe94d2a79791284eb9e6b151716f33f747e7c7db79a8701fdb
- Sigstore transparency entry: 1819869149
- Sigstore integration time: Jun 14, 2026
Source repository:
- Permalink: firo2525/konklave@b45c48d158fd05b9719935f1243ece46140125c5
- Branch / Tag: refs/tags/v0.1.0
- Owner: https://github.com/firo2525
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@b45c48d158fd05b9719935f1243ece46140125c5
- Trigger Event: release

konklave 0.1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

🌐 konklave.info

🏛️ K O N K L A V E

A council of AI models that thinks, codes, reviews, and ships — together.

🚀 Releases June 15, 2026

🎬 Screenshots

✨ How It Works

✨ What's New in This Release

🧠 Six Specialized Roles

⚡ Work Modes

🌊 Swarm — Parallel Sub-Agents

🎮 Human-in-the-Loop

📦 Built-in Pipelines

🏗️ Architecture

🤖 Default Model Panel

📊 Benchmarks

🤖 Model under test

💡 Why This Works — Buy Quality With Cheap Tokens

🚀 Installation

Desktop GUI (optional)

🏁 Quickstart

⚙️ Configuration

📊 Web Dashboard

🧠 Persistent Memory

💻 Platform Support

🧪 Development & Tests

📄 License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance