Multi-LLM Council Framework for adversarial debate, cross-validation, and structured decision-making

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

The LLM Council

$ council run architect "Design a mass hallucination prevention system"

                    ╔══════════════════════════════════════════════════════════╗
                    ║             ⚖️  THE LLM COUNCIL CONVENES  ⚖️              ║
                    ╚══════════════════════════════════════════════════════════╝

      ┌─────────────────┐      ┌─────────────────┐      ┌─────────────────┐
      │  ┌───────────┐  │      │  ┌───────────┐  │      │  ┌───────────┐  │
      │  │ ╭───────╮ │  │      │  │ ╭───────╮ │  │      │  │ ╭───────╮ │  │
      │  │ │GPT5.2│ │  │      │  │ │ CLAUDE│ │  │      │  │ │GEMINI │ │  │
      │  │ ╰───────╯ │  │      │  │ ╰───────╯ │  │      │  │ ╰───────╯ │  │
      │  │   ◉ ◉     │  │      │  │   ◉ ◉     │  │      │  │   ◉ ◉     │  │
      │  │    ⌣      │  │      │  │    ▽      │  │      │  │    ○      │  │
      │  └───────────┘  │      │  └───────────┘  │      │  └───────────┘  │
      │    JUDGE #1     │      │    JUDGE #2     │      │    JUDGE #3     │
      └────────┬────────┘      └────────┬────────┘      └────────┬────────┘
               │                        │                        │
               │ "I propose we use      │ "Actually, I must      │ "Interesting, but
               │  a vector database..." │  respectfully disagree" │  what about...?"
               │                        │                        │
               └────────────────────────┼────────────────────────┘
                                        ▼
                         ┌──────────────────────────────┐
                         │     🔥 ADVERSARIAL DEBATE 🔥   │
                         │                              │
                         │  GPT5.2: "Your approach has  │
                         │          a cold start issue" │
                         │                              │
                         │  CLAUDE: "Fair, but yours    │
                         │          doesn't scale"      │
                         │                              │
                         │  GEMINI: "Both valid. What   │
                         │          if we combine..."   │
                         └──────────────┬───────────────┘
                                        ▼
                         ┌──────────────────────────────┐
                         │      ✅ VERDICT REACHED ✅     │
                         │                              │
                         │   Synthesized best ideas     │
                         │   Schema-validated output    │
                         │   Confidence: 94%            │
                         └──────────────────────────────┘

[Council] Task completed in 45.2s | 3 judges | 2 debate rounds | Cost: $0.12

The LLM Council - Multiple AI models debating as judges

A Multi-LLM Council Framework that orchestrates multiple LLM backends to enable adversarial debate, cross-validation, and structured decision-making.

Why Use a Council?

Single-model outputs have blind spots. By running multiple models in parallel and having them critique each other, the council:

Catches errors that any single model might miss
Reduces hallucination through cross-validation
Produces higher-quality outputs via adversarial refinement
Validates structure with JSON schema enforcement and retry logic

Features

Feature	Description
Multi-Model Council	Run Claude, GPT-4, and Gemini in parallel via single OpenRouter key
Adversarial Critique	Built-in critique phase identifies weaknesses and blind spots
Schema Validation	JSON schema validation with automatic retry for structured outputs
Provider Agnostic	Swap between OpenRouter, direct APIs, or CLI-based providers
Health Checks	Preflight provider health checks with latency tracking
Graceful Degradation	Automatic retry, fallback, and skip strategies for failures
Artifact Store	Persistent storage of drafts with tiered summarization
Secret-Safe Logging	Redaction pipeline prevents credential leakage

Requirements

Requirement	Details
Python	3.10, 3.11, or 3.12
OS	macOS, Linux, Windows (native or WSL)
API Key	At least one provider key (see below)

Supported Providers

Provider	Environment Variable	Notes
OpenRouter	`OPENROUTER_API_KEY`	Recommended - single key for all models
OpenAI	`OPENAI_API_KEY`	Direct GPT access
Anthropic	`ANTHROPIC_API_KEY`	Direct Claude access
Google	`GOOGLE_API_KEY` or `GEMINI_API_KEY`	Direct Gemini access
Vertex AI	`GOOGLE_CLOUD_PROJECT` or `ANTHROPIC_VERTEX_PROJECT_ID` + ADC	Enterprise GCP - Gemini + Claude

Installation

pip install the-llm-council

With specific providers:

# OpenRouter (recommended - single API key for all models)
pip install the-llm-council

# Direct APIs
pip install the-llm-council[anthropic,openai,google]

# Vertex AI (Enterprise GCP)
pip install the-llm-council[vertex]

# All providers
pip install the-llm-council[all]

# Development
pip install the-llm-council[dev]

Agent Skills (Claude Code, OpenAI Codex, Cursor, etc.)

The LLM Council is available as an Agent Skill following the open Agent Skills standard. This works across Claude Code, OpenAI Codex, Cursor, VS Code, and other skill-compatible agents.

Claude Code

# Step 1: Add the repo as a marketplace
/plugin marketplace add sherifkozman/the-llm-council

# Step 2: Install the plugin
/plugin install llm-council@the-llm-council

Once installed, the council skill is auto-invoked when relevant, or use the /council command:

/council implementer "Build a login page with OAuth"

OpenAI Codex

# Copy skills directory to Codex skills location
cp -r skills/council ~/.codex/skills/

Other Agents (Cursor, VS Code, GitHub, etc.)

Copy the skills/council/ directory to your agent's skills folder. The skill follows the open Agent Skills spec and works with any compatible agent.

Quick Start

CLI Usage

# Set your API key
export OPENROUTER_API_KEY="your-key"

# Run a council task (v0.5.0 syntax with modes)
council run drafter --mode impl "Build a login page with OAuth"

# Multi-model council (Claude + GPT-5 + Gemini debating)
council run drafter --mode arch "Design a caching layer" \
  --models "anthropic/claude-opus-4-5,openai/gpt-5.1,google/gemini-3-flash-preview"

# Or set via environment variable
export COUNCIL_MODELS="anthropic/claude-opus-4-5,openai/gpt-5.1,google/gemini-3-flash-preview"
council run drafter "Build a login page"

# Code review with security analysis
council run critic --mode review "Review auth changes" --verbose

# Disable artifact storage for faster runs
council run drafter "Quick fix" --no-artifacts

# Get structured JSON output
council run planner "Add user authentication" --json

# Legacy syntax still works (shows deprecation warning)
council run implementer "Build a login page"  # → drafter --mode impl

Python API

from llm_council import Council
from llm_council.protocol.types import CouncilConfig

# With mode configuration
config = CouncilConfig(providers=["openrouter"], mode="impl")
council = Council(config=config)
result = await council.run(
    task="Build a login page with OAuth",
    subagent="drafter"
)
print(result.output)

Check Provider Health

council doctor

Architecture

┌─────────────────────────────────────────────────────────────────────┐
│                           LLM Council                               │
├─────────────────────────────────────────────────────────────────────┤
│                                                                     │
│  ┌─────────────┐    ┌─────────────┐    ┌─────────────────────────┐ │
│  │    CLI      │───▶│  Council    │───▶│     Orchestrator        │ │
│  │  (typer)    │    │   (API)     │    │                         │ │
│  └─────────────┘    └─────────────┘    │  ┌───────────────────┐  │ │
│                                        │  │  Health Checker   │  │ │
│  ┌─────────────────────────────────┐   │  ├───────────────────┤  │ │
│  │        Provider Registry        │◀──│  │ Degradation Policy│  │ │
│  │  ┌─────────┐ ┌─────────┐       │   │  ├───────────────────┤  │ │
│  │  │OpenRouter│ │Anthropic│ ...  │   │  │  Artifact Store   │  │ │
│  │  └─────────┘ └─────────┘       │   │  └───────────────────┘  │ │
│  └─────────────────────────────────┘   └─────────────────────────┘ │
│                                                                     │
│  ┌─────────────────────────────────────────────────────────────┐   │
│  │                    Subagent Configs                          │   │
│  │  router | planner | architect | implementer | reviewer | ... │   │
│  └─────────────────────────────────────────────────────────────┘   │
│                                                                     │
│  ┌─────────────────────────────────────────────────────────────┐   │
│  │                     JSON Schemas                             │   │
│  │  Validation & retry logic for structured outputs             │   │
│  └─────────────────────────────────────────────────────────────┘   │
│                                                                     │
└─────────────────────────────────────────────────────────────────────┘

Pipeline Flow

0. HEALTH CHECK (optional)
   └── Preflight check of all providers, skip unhealthy ones

1. PARALLEL DRAFTS
   ├── Provider A generates draft
   ├── Provider B generates draft
   └── Provider C generates draft
   └── (Graceful degradation on failures)

2. ADVERSARIAL CRITIQUE
   └── Critic identifies weaknesses, contradictions, blind spots

3. SYNTHESIS
   └── Merge best elements, address critique, validate schema

4. VALIDATION
   └── JSON schema check with retry on failure

5. ARTIFACT STORAGE (optional)
   └── Store drafts and outputs for context management

Subagents (v0.5.0)

Core Agents

Subagent	Modes	Purpose	Example
`drafter`	`impl`, `arch`, `test`	Generate code, designs, tests	"Build the login page"
`critic`	`review`, `security`	Review and analyze	"Review this PR for security"
`synthesizer`	-	Merge and finalize	"Generate changelog for v1.2"
`researcher`	-	Technical research	"Research OAuth providers"
`planner`	`plan`, `assess`	Roadmaps and decisions	"Plan the auth implementation"
`router`	-	Classify and route tasks	"Is this a bug or feature?"

Agent Modes

# drafter modes
council run drafter --mode impl "Build login page"     # Implementation (default)
council run drafter --mode arch "Design caching layer" # Architecture
council run drafter --mode test "Design test suite"    # Test design

# critic modes
council run critic --mode review "Review PR"           # Code review (default)
council run critic --mode security "Analyze auth"      # Security analysis

# planner modes
council run planner --mode plan "Plan implementation"  # Planning (default)
council run planner --mode assess "Redis vs Memcached" # Build vs buy

Deprecated Aliases (Backwards Compatible)

The following legacy names still work but show a deprecation warning:

Old Name	Use Instead	Removed In
`implementer`	`drafter --mode impl`	v1.0
`architect`	`drafter --mode arch`	v1.0
`test-designer`	`drafter --mode test`	v1.0
`reviewer`	`critic --mode review`	v1.0
`red-team`	`critic --mode security`	v1.0
`assessor`	`planner --mode assess`	v1.0
`shipper`	`synthesizer`	v1.0

Writing a Provider

Providers are pluggable via Python entry points. See the full Provider Development Guide for detailed instructions.

Quick Example

from llm_council.providers.base import ProviderAdapter, GenerateRequest, GenerateResponse

class MyProvider(ProviderAdapter):
    name = "myprovider"

    async def generate(self, request: GenerateRequest) -> GenerateResponse:
        # Your implementation
        return GenerateResponse(text="...", content="...")

    async def doctor(self) -> DoctorResult:
        return DoctorResult(ok=True, message="Healthy")

[project.entry-points."llm_council.providers"]
myprovider = "my_package.providers:MyProvider"

Reference Implementations

Provider	Type	File
OpenRouter	HTTP API	`src/llm_council/providers/openrouter.py`
Anthropic	Native SDK	`src/llm_council/providers/anthropic.py`
OpenAI	Native SDK	`src/llm_council/providers/openai.py`
Google	Native SDK	`src/llm_council/providers/google.py`
Vertex AI	Native SDK	`src/llm_council/providers/vertex.py`
Codex CLI	Subprocess	`src/llm_council/providers/cli/codex.py`

Configuration

Environment Variables

# OpenRouter (recommended - single key for all models)
export OPENROUTER_API_KEY="your-key"

# Direct APIs
export ANTHROPIC_API_KEY="sk-ant-..."
export OPENAI_API_KEY="sk-..."
export GOOGLE_API_KEY="..."

# Vertex AI - Gemini (Enterprise GCP)
export GOOGLE_CLOUD_PROJECT="your-project-id"
export GOOGLE_CLOUD_LOCATION="us-central1"  # optional
export VERTEX_AI_MODEL="gemini-2.5-pro"     # optional, default: gemini-2.0-flash

# Vertex AI - Claude (Enterprise GCP)
export ANTHROPIC_VERTEX_PROJECT_ID="your-project-id"
export CLOUD_ML_REGION="global"              # Claude uses global region
export ANTHROPIC_MODEL="claude-opus-4-5@20251101"  # model with version

# Auth for Vertex AI: gcloud auth application-default login OR
# export GOOGLE_APPLICATION_CREDENTIALS="/path/to/sa.json"

# Multi-model council: comma-separated OpenRouter model IDs
export COUNCIL_MODELS="anthropic/claude-opus-4-5,openai/gpt-5.1,google/gemini-3-flash-preview"

# Optional: Model pack overrides for specific task types
export COUNCIL_MODEL_FAST="anthropic/claude-3-5-haiku"    # Quick tasks
export COUNCIL_MODEL_REASONING="anthropic/claude-opus-4-5" # Deep analysis
export COUNCIL_MODEL_CODE="openai/gpt-5.1"                # Code generation
export COUNCIL_MODEL_CRITIC="anthropic/claude-sonnet-4-5" # Adversarial critique

Per-Subagent Reasoning Configuration (v0.3.0+)

Subagents can be configured with provider preferences, model overrides, and extended reasoning/thinking budgets in their YAML configs:

# src/llm_council/subagents/red-team.yaml
name: red-team
model_pack: harsh_critic

# Provider preferences
providers:
  preferred: [anthropic, openai]
  fallback: [openrouter]
  exclude: [google]

# Model overrides per provider
models:
  anthropic: claude-opus-4-5
  openai: o3-mini
  google: gemini-3-pro

# Extended reasoning/thinking configuration
reasoning:
  enabled: true
  effort: high           # OpenAI o-series: low/medium/high
  budget_tokens: 32768   # Anthropic: 1024-128000
  thinking_level: high   # Google Gemini 3.x: minimal/low/medium/high

Provider	Parameter	Values	Description
OpenAI	`effort`	low/medium/high	Reasoning effort for o-series models
Anthropic	`budget_tokens`	1024-128000	Extended thinking token budget
Google	`thinking_level`	minimal/low/medium/high	Gemini 3.x thinking level

Default Reasoning Tiers (v0.4.0+)

All subagents have pre-configured reasoning defaults based on task complexity:

Tier	Subagents	Config	Use Case
High	architect, assessor, planner, reviewer, red-team	`effort: high`, `budget_tokens: 16384`	Deep analysis, critical decisions
Medium	implementer, researcher	`effort: medium`, `budget_tokens: 8192`	Balanced code/research tasks
Disabled	router, shipper, test-designer	`enabled: false`	Fast tasks, no overhead

Config File

# ~/.config/llm-council/config.yaml
providers:
  - name: openrouter
    api_key: ${OPENROUTER_API_KEY}
    default_model: anthropic/claude-opus-4-5

defaults:
  timeout: 120
  max_retries: 3
  summary_tier: actions

CLI Reference

council run <subagent> "<task>"    # Run a council task
council doctor                      # Check provider health
council config                      # Show configuration

# Options
--mode             Agent mode (impl/arch/test for drafter, review/security for critic, etc.)
--providers, -p    Comma-separated provider list
--models, -m       Comma-separated OpenRouter model IDs for multi-model council
--no-artifacts     Disable artifact storage
--json             Output structured JSON
--verbose, -v      Verbose output

# Config file options (moved from CLI in v0.5.0)
# Set these in ~/.config/llm-council/config.yaml under 'defaults:'
#   timeout: 120
#   max_retries: 3
#   enable_degradation: true

Development

# Clone the repository
git clone https://github.com/sherifkozman/the-llm-council.git
cd the-llm-council

# Install with dev dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Run linting
ruff check src/
mypy src/llm_council

Contributing

Contributions are welcome! See our Roadmap for planned features and Contributing Guide for details.

Quick Start

# Fork and clone
git clone https://github.com/YOUR_USERNAME/the-llm-council.git
cd the-llm-council

# Install dev dependencies
pip install -e ".[dev]"

# Run tests
pytest

# Run linting
ruff check src/ && mypy src/llm_council

Contribution Workflow

Fork the repository
Create a feature branch (git checkout -b feature/amazing-feature)
Make your changes
Test your changes (pytest)
Lint your code (ruff check src/ && mypy src/llm_council)
Commit with a clear message (git commit -m 'Add amazing feature')
Push to your branch (git push origin feature/amazing-feature)
Open a Pull Request

What We're Looking For

New Providers: Add support for more LLM backends
New Subagents: Create specialized agents for specific tasks
Bug Fixes: Found a bug? We'd love a fix!
Documentation: Improvements to docs are always welcome
Tests: More test coverage is great

Security

For security concerns, please see our Security Policy or email vibecode@sherifkozman.com.

Key security features:

CLI adapters use exec-style subprocess (no shell injection)
Environment variable allowlisting prevents secret leakage
Path traversal protection in artifact storage
Configurable secret redaction in logs

License

MIT License - see LICENSE for details.

Acknowledgments

Built with:

Pydantic - Data validation
Typer - CLI framework
Rich - Terminal formatting
httpx - Async HTTP client

When one model isn't enough, convene a council.

_{~ vibe coded by Sherif Kozman & The LLM Council ~}

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

sherifkozman

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.7.16

Apr 10, 2026

0.7.15

Apr 8, 2026

0.7.14

Apr 8, 2026

0.7.13

Apr 7, 2026

0.7.12

Apr 2, 2026

0.7.11

Apr 2, 2026

0.7.10

Apr 2, 2026

0.7.9

Apr 1, 2026

0.7.8

Apr 1, 2026

0.7.7

Mar 31, 2026

0.7.6

Mar 31, 2026

0.7.5

Mar 29, 2026

0.7.4

Mar 29, 2026

0.7.3

Mar 29, 2026

0.7.2

Mar 28, 2026

0.7.1

Mar 28, 2026

0.7.0

Mar 28, 2026

0.6.0

Mar 14, 2026

0.5.3

Jan 4, 2026

0.5.2

Jan 3, 2026

This version

0.5.1

Jan 3, 2026

0.5.0

Dec 26, 2025

0.4.13

Dec 24, 2025

0.4.12

Dec 24, 2025

0.4.11

Dec 24, 2025

0.4.10

Dec 24, 2025

0.4.9

Dec 23, 2025

0.4.8

Dec 23, 2025

0.4.7

Dec 23, 2025

0.4.6

Dec 23, 2025

0.4.5

Dec 23, 2025

0.4.4

Dec 23, 2025

0.4.3

Dec 23, 2025

0.4.2

Dec 23, 2025

0.4.1

Dec 23, 2025

0.4.0

Dec 23, 2025

0.3.1

Dec 23, 2025

0.3.0

Dec 23, 2025

0.2.3

Dec 23, 2025

0.2.2

Dec 23, 2025

0.2.1

Dec 23, 2025

0.2.0

Dec 20, 2025

0.1.0

Dec 20, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

the_llm_council-0.5.1.tar.gz (122.1 kB view details)

Uploaded Jan 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

the_llm_council-0.5.1-py3-none-any.whl (152.1 kB view details)

Uploaded Jan 3, 2026 Python 3

File details

Details for the file the_llm_council-0.5.1.tar.gz.

File metadata

Download URL: the_llm_council-0.5.1.tar.gz
Upload date: Jan 3, 2026
Size: 122.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for the_llm_council-0.5.1.tar.gz
Algorithm	Hash digest
SHA256	`d7cedc7d478ff21800eb54d12253aaf34de4f09b8a6a2be8ce9d4a939a254d21`
MD5	`4a5bd5bfc3dc473a0d5b24d162e20126`
BLAKE2b-256	`a7e665e2b5024b604895a40e5cd2510e00a5a162b644a70080355122a824b257`

See more details on using hashes here.

Provenance

The following attestation bundles were made for the_llm_council-0.5.1.tar.gz:

Publisher: publish.yml on sherifkozman/the-llm-council

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: the_llm_council-0.5.1.tar.gz
- Subject digest: d7cedc7d478ff21800eb54d12253aaf34de4f09b8a6a2be8ce9d4a939a254d21
- Sigstore transparency entry: 789182456
- Sigstore integration time: Jan 3, 2026
Source repository:
- Permalink: sherifkozman/the-llm-council@e7b40fd19794fea457e2d3e33b0428302e534291
- Branch / Tag: refs/tags/v0.5.1
- Owner: https://github.com/sherifkozman
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@e7b40fd19794fea457e2d3e33b0428302e534291
- Trigger Event: release

File details

Details for the file the_llm_council-0.5.1-py3-none-any.whl.

File metadata

Download URL: the_llm_council-0.5.1-py3-none-any.whl
Upload date: Jan 3, 2026
Size: 152.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for the_llm_council-0.5.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7a23014eecd80587111f89cad76ebb8b26a600ff5b307b874aae6ec5ba0ba2c0`
MD5	`d1dcb4c20afd92c8b36e05dc025a43bf`
BLAKE2b-256	`36bac18f906835db1d58bdd6c1719c821a8820377bd9f4aedcbbf4ae904bd957`

See more details on using hashes here.

Provenance

The following attestation bundles were made for the_llm_council-0.5.1-py3-none-any.whl:

Publisher: publish.yml on sherifkozman/the-llm-council

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: the_llm_council-0.5.1-py3-none-any.whl
- Subject digest: 7a23014eecd80587111f89cad76ebb8b26a600ff5b307b874aae6ec5ba0ba2c0
- Sigstore transparency entry: 789182458
- Sigstore integration time: Jan 3, 2026
Source repository:
- Permalink: sherifkozman/the-llm-council@e7b40fd19794fea457e2d3e33b0428302e534291
- Branch / Tag: refs/tags/v0.5.1
- Owner: https://github.com/sherifkozman
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@e7b40fd19794fea457e2d3e33b0428302e534291
- Trigger Event: release

the-llm-council 0.5.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

The LLM Council

Why Use a Council?

Features

Requirements

Supported Providers

Installation

Agent Skills (Claude Code, OpenAI Codex, Cursor, etc.)

Claude Code

OpenAI Codex

Other Agents (Cursor, VS Code, GitHub, etc.)

Quick Start

CLI Usage

Python API

Check Provider Health

Architecture

Pipeline Flow

Subagents (v0.5.0)

Core Agents

Agent Modes

Deprecated Aliases (Backwards Compatible)

Writing a Provider

Quick Example

Reference Implementations

Configuration

Environment Variables

Per-Subagent Reasoning Configuration (v0.3.0+)

Default Reasoning Tiers (v0.4.0+)

Config File

CLI Reference

Development

Contributing

Quick Start

Contribution Workflow

What We're Looking For

Security

License

Acknowledgments

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance