Reference implementation of the Symposium protocol — structured, sequential, adversarial multi-agent deliberation.
Project description
Symposium
An opinionated protocol for structured, sequential, adversarial multi-agent deliberation.
What is this?
Symposium is a protocol specification + a reference Python runtime that orchestrates a small panel of LLM-backed agents through a structured, turn-based deliberation, producing a single, replayable, schema-validated artifact.
It is not a generic agent framework. It enforces exactly one conversation topology — fixed panel, one primary turn per agent per round, one structurally-separated coordinator, bounded forks — and trades topology flexibility for testable scheduler invariants and byte-identical replay of any past session.
Two things ship together in this repo:
docs/specification.md— the normative protocol. Implementable in any language. The spec is what conformance means.symposium/— the reference Python runtime. Today: full scheduler, persistence, replay, the deterministicFakeProvideradapter, an OpenAI-shaped HTTP adapter (real OpenAI plus self-hosted OpenAI-compatible endpoints), and an Anthropic-shaped HTTP adapter (real Anthropic plus self-hosted Anthropic-compatible endpoints).
Why one more protocol?
Most multi-agent stacks expose enough flexibility (group chat, arbitrary handoffs, nested supervisors) that any two implementations diverge on the parts that matter — when does the conversation stop, what exactly is replayed, what fails the run, how is delegation routed. Each implementation invents its own answers, and operators end up debugging the framework instead of the agents.
Symposium goes the opposite way: one opinionated topology, sharp boundaries, closed enums. What you get in exchange:
| Symposium | |
|---|---|
| Topology | Fixed deliberation_panel, one primary_turn per agent per round, single coordination_turn from a structurally-separated coordinator_agent. |
| Inter-agent routing | Schema-validated direct_request only. Inline @AgentName in prose is never routing — prompt-injection resistant by construction. |
| Roles | Three-way separation: Selector chooses who, CoordinatorAgent recommends what next (LLM, no executive power), OrchestratorRuntime schedules and terminates (deterministic code, sole party that decides when a session stops). |
| Failure surface | Closed 7-value termination-reason enum; closed 12-value adapter error.kind enum; closed 3-value on_agent_failure policy. |
| Replayability | Four distinct contracts documented separately: transcript_replay (unconditional byte identity), execution_replay (conditional on ten pinning conditions), golden-test byte identity, fake_provider determinism. No "it should be deterministic" hand-waving. |
| Persistence | Canonical Artifact (§5.10) with RFC-8785 JCS-canonicalized transcript_digest (SHA-256). Tamper-evident. |
| Execution mode | MVP is batch-only (ADR-004). Interactive / event-stream / async are explicitly v1+. |
Full discussion in §10 Competitive Positioning of the spec.
Quick start
The reference runtime ships three adapters out of the box: the
deterministic FakeProvider (for tests and reproducible demos), an
OpenAI-shaped HTTP adapter (for real-model sessions against
api.openai.com or any OpenAI-Chat-Completions-compatible endpoint),
and an Anthropic-shaped HTTP adapter (for real-model sessions against
api.anthropic.com or any Anthropic-Messages-compatible endpoint).
Every flow produces a persisted, byte-identically replayable artifact.
The distribution name is symposium-protocol; the import package is
symposium (cf. scikit-learn → sklearn).
# Stable install (PyPI)
pip install symposium-protocol # then: import symposium
# Released tag, straight from GitHub (works without PyPI)
pip install "git+https://github.com/terrordrummer/symposium@v1.5.0"
# Development install (editable, from a clone)
git clone https://github.com/terrordrummer/symposium
cd symposium
pip install -e ".[test]"
Fake-driven session (no API key, no network)
symposium run \
--config examples/configs/walking-skeleton.yaml \
--script examples/scripts/walking-skeleton.json \
--output runs/ \
examples/problem.md
# Replay (byte-identity check on the stored canonical_transcript)
symposium replay runs/demo-walking-skeleton-001
# Validate the artifact against the v1.0.0 JSON Schemas
symposium validate runs/demo-walking-skeleton-001/artifact.json
OpenAI-driven session
export OPENAI_API_KEY=sk-...
# Optional: point at a self-hosted OpenAI-compatible endpoint
# export OPENAI_BASE_URL=https://my-llm-proxy.internal/v1
symposium run \
--config examples/configs/openai.yaml \
--output runs/ \
examples/problem.md
Anthropic-driven session
export ANTHROPIC_API_KEY=sk-ant-...
# Optional: point at a self-hosted Anthropic-compatible endpoint
# export ANTHROPIC_BASE_URL=https://my-llm-proxy.internal/v1
symposium run \
--config examples/configs/anthropic.yaml \
--output runs/ \
examples/problem.md
Selecting the panel
Before round 1 the §4.1 selector chooses the active deliberation
panel and binds the coordinator. Config.selector.strategy picks one of
three strategies, each emitting a schema-valid SelectorOutput
(§5.11) written to <run_dir>/selector_output.json on every run:
fixed(default, MVP/R3) — degenerate: the panel is the declareddefault_deliberation_paneland the coordinator is the declaredcoordinator_agent. Makes no provider call.rules— pure, deterministic. Matches each agent's persona metadata (reasoning_scope/domain_scope) against theproblem_statementvia a transparent keyword table; records dropped agents inexcluded_agents. No provider call, so the same(config)yields a byte-identical decision (and stays replayable under §7.6).llm— one bounded provider invocation (the §6.2expected_output_schema = nullfree-text path, driven by the coordinator agent'sprovider/model) parsed into aSelectorOutput. Requires aselector_budget(§5.2); its usage is budgeted separately and never entersArtifact.cumulative_usageor thetranscript_digest. For fake sessions, script the single selector call with--selector-script(mirrors--script).
# rules: deterministic, no model call
symposium run \
--config examples/configs/rules-selector.yaml \
--script examples/scripts/walking-skeleton.json \
--output runs/
# llm: one bounded selector call (separate fake script) + deliberation
symposium run \
--config examples/configs/llm-selector.yaml \
--selector-script examples/scripts/llm-selector.json \
--script examples/scripts/walking-skeleton.json \
--output runs/
# → stdout: selector_strategy=… / selected_agents=…
# → <run_dir>/selector_output.json
The selector is a distinct ADR-005 role: it chooses who deliberates,
emits no canonical_transcript message, and an empty/malformed selection
terminates the session with reason = schema_error before round 1.
Inspecting metrics
Every persisted run directory can be analysed offline with symposium metrics, which computes the §7.9 MVP observability set (token / cost
usage per agent and per (provider, model), latency per invocation,
participation per round, branch depth, deferred-queue length, panel
contractions, schema-failure counts, termination reason, the
usage_estimated flag) and writes metrics.json next to the
artifact:
symposium metrics runs/demo-walking-skeleton-001
# → runs/demo-walking-skeleton-001/metrics.json (full breakdown)
# → stdout: one-screen human-readable summary
The §7.9 set is deliberately MVP — role_purity_score,
disagreement_frequency, interaction_graph,
delegation_frequency, per-invocation provider-retry counts and a
live observability_event stream are §7.10 v1+ extensions and
formally deferred. The MVP set is fully derivable from the persisted
artifact.json alone; no live event bus required.
The CLI resolves each agent's provider string through the adapter
registry (§6.11). Built-in registrations: openai, anthropic, and
— when --script is given — fake. Plug your own adapter in by
registering a factory before the run.
Re-running a session
symposium replay (above) is the §7.5 transcript_replay — it
re-renders the stored canonical_transcript and is byte-identical
unconditionally (no model call). symposium execution-replay is the
§7.6 execution_replay — it re-runs the orchestrator against the
original problem_statement / Config to regenerate a fresh transcript,
and is reproducible only when every non-deterministic source is pinned
(the ten pinning conditions of §7.6: runtime, adapter, provider,
model, sampling, cache, tool_env, wallclock, persona, transcript_prefix).
symposium execution-replay runs/demo-walking-skeleton-001 \
--script examples/scripts/walking-skeleton.json \
--output runs/
# → runs/demo-walking-skeleton-001-replay/ (fresh run, distinct session id)
# → digest=match | digest=MISMATCH (first_divergence=…)
Before touching the runtime it checks every pinning condition decidable
offline and aborts with a pinning_violation diagnostic (naming the
exact condition) on the first one that cannot be satisfied — §7.6
forbids silent best-effort replay. Exit codes: 0 digest match, 3
pinning violation, 4 digest mismatch, 1 any other error.
Reproducibility is conditional, not free (§7.8: replayable ≠
reproducible). Two runtime-allocated fields feed the digest but aren't
produced by the provider — Message.id (uuid4) and Message.timestamp
(wall-clock). execution-replay pins both to the values recorded in the
original transcript (§7.6 condition #8's fixed clock source + §9.4.1's
deterministic id allocator), so a deterministic FakeProvider run
reproduces its digest exactly — no special recording step required. A
re-execution that genuinely diverges (different content, count, or
routing) desyncs from the recorded sequence and reports a mismatch with
the first diverging message id, never a spurious match. A caller can
override the timestamp source with fixed_clock (a library knob).
Library use
from symposium import Config, FakeProviderScript
from symposium.providers import FakeProvider, default_registry
from symposium.scheduler import run_session
# Fake-driven: pass an explicit per-agent map
artifact = run_session(config, {"default": FakeProvider(script=script)},
runs_root="runs/")
# OpenAI-driven: build providers from the registry
providers = default_registry().build_session_providers(config)
artifact = run_session(config, providers, runs_root="runs/")
print(artifact.transcript_digest) # 64-hex JCS-SHA-256 digest
print(artifact.outcome.kind) # "synthesis" or "termination"
# §7.6 execution_replay — re-execute under the ten pinning conditions and
# compare the fresh digest. ids/timestamps are replayed from the recording,
# so a deterministic run reproduces its digest with no extra setup.
from symposium.replay import execution_replay, PinningViolation
try:
result = execution_replay("runs/" + config.session_id,
providers={"default": FakeProvider(script=script)})
print(result.digest_matches) # True — every pinning condition satisfied
print(result.conditions_checked, result.conditions_assumed)
except PinningViolation as exc:
print("aborted on §7.6 condition:", exc.condition)
Use in Claude Code (MCP server)
Symposium ships an optional MCP server that exposes the runtime as
tools, so a Claude client (Claude Code, Claude Desktop, claude.ai) can
launch a structured deliberation and read back its result, replay status,
and metrics — over the same run_session(...) API, with no changes to the
runtime or the protocol.
# Install with the optional MCP extra
pip install "symposium-protocol[mcp]"
# …or from the released tag:
pip install "symposium-protocol[mcp] @ git+https://github.com/terrordrummer/symposium@v1.6.0"
# Register the stdio server with Claude Code
claude mcp add symposium -- symposium-mcp
For Claude Desktop, add the server to your mcpServers config
(claude_desktop_config.json). Set ANTHROPIC_API_KEY (or
OPENAI_API_KEY) in env when you want real-provider deliberations; omit
it for fake-driven, deterministic runs:
{
"mcpServers": {
"symposium": {
"command": "symposium-mcp",
"env": { "ANTHROPIC_API_KEY": "sk-ant-..." }
}
}
}
The server exposes six tools:
deliberate(problem, …)— build aConfigfrom arguments (panel persona ids resolved into inline personas exactly as the CLI does), run a session, and return{outcome, synthesis_answer | termination_reason, selected_agents, transcript_digest, cumulative_usage, run_dir, rounds}.deliberate_streaming(problem, …)— same arguments and same final result asdeliberate, but streams each turn live as the panel produces it (every agent turn, each coordinator verdict, the final synthesis) via MCP progress + log notifications, so you can follow the discussion as it evolves instead of waiting for the whole session.deliberate_adaptive(problem, *, experts=None, max_expansions=2, …)— deliberate with dynamic agent generation. Early-start: each capability inexperts(free-text needs) becomes a generated domain persona added to the panel before the first session. Runtime: if a session terminates asking for help (user_input_required/external_research_required), a persona is generated for that need and the deliberation continues in a fresh session with the augmented panel (up tomax_expansions). Returns{final, sessions, generated_agents, expansions, panel_final}. Host-orchestrated over the frozen runtime.generate_persona(need, …)— design one new expertPersonafor a capability gap (constrained to thePersonaJSON Schema, validated) and return it, to use as apanelmember.get_run_summary(run_dir)— load a persisted run, recompute the §7.9 metrics, verify the §7.5 transcript replay, and return a compact summary.list_personas()— the six built-in personas (R3 default panel + coordinator) to use aspanel/coordinatorarguments.
A typical deliberate call from a Claude client:
// default: route each persona across the installed terminal CLIs — NO API
// key (provider="cli-auto"): visionary → codex, the rest → claude, with
// fallback to whichever CLI is installed
deliberate(problem="Should we adopt a structured deliberation protocol?")
// force a single terminal CLI for all agents
deliberate(problem="…", provider="claude-cli") // or "codex-cli"
// real HTTP API instead (reads ANTHROPIC_API_KEY from the env)
deliberate(problem="…", provider="anthropic")
// deterministic, network-free (used by the tests and demos)
deliberate(
problem="demo",
provider="fake",
fake_script_path="examples/scripts/walking-skeleton.json"
)
No API key needed. The default provider="cli-auto" runs each panel
turn through a locally-installed terminal CLI, reusing its existing login
(OAuth/keychain) — no ANTHROPIC_API_KEY/OPENAI_API_KEY. It routes by
persona — the lateral/creative visionary to codex-cli
(codex exec --output-schema …), the technical/systematic personas
(logician, engineer, researcher, critic, coordinator) to claude-cli
(claude -p --output-format json --json-schema …) — and falls back to
whichever CLI is actually installed (only claude installed → the whole
panel runs on claude, and vice-versa). Force one CLI with
provider="claude-cli" / "codex-cli".
Billing. When a CLI is logged in with a subscription (Claude
Pro/Max for claude, a ChatGPT plan for codex), turns run against that
subscription's usage and rate limits — not metered, per-token API
billing. There is no separate dollar charge to an API account; you are
spending subscription quota, so a full panel (≈ one call per turn) and
especially deliberate_adaptive (multiple linked sessions) consume that
quota faster and can hit plan limits. The cost_usd Symposium records for
a CLI turn is an API-equivalent reference (what the tokens would cost
at API rates), reported as estimated — not a bill. (Only if a CLI is
authenticated via an API key instead of a subscription login is the
usage metered.) Use provider="fake" for free, deterministic, offline
demos. The HTTP adapters (anthropic, openai) call the metered API and
do read an API key. Both CLI providers also work from the plain CLI:
provider: claude-cli / codex-cli in a config's agents.
The mcp dependency is optional: import symposium and the symposium
CLI work without it. See symposium/integrations/mcp_server.py.
What's in this repo
.
├── docs/
│ ├── specification.md # The protocol (normative, ~6440 lines)
│ ├── repository-strategy.md # Reference-impl conventions (non-normative)
│ └── schemas/v1.0.0/ # 16 JSON Schemas (Draft 2020-12)
│ └── examples/ # 28 positive + 36 negative fixtures + validators
├── symposium/ # Reference Python runtime
│ ├── models.py # Pydantic models mirroring the JSON Schemas
│ ├── providers/ # ProviderAdapter + registry + Fake/OpenAI/Anthropic/Claude-CLI/Codex-CLI adapters
│ ├── selector/ # §4.1 selector: fixed / rules / llm strategies
│ ├── scheduler/ # §4.11 pseudocode → executable loop
│ ├── storage/ # Run directory layout + JCS digest
│ ├── replay/ # transcript_replay (§7.5) + execution_replay (§7.6)
│ ├── observability/ # §7.9 MVP metric set (offline)
│ ├── personas/ # MVP default panel (R3)
│ ├── integrations/ # Host integrations — MCP server (`symposium-mcp`)
│ └── cli/ # `symposium` command
├── examples/ # Walking-skeleton + rules/llm selector configs + scripts
├── tests/ # pytest suite (FakeProvider determinism,
│ # scheduler invariants, e2e schema
│ # validation, replay byte-identity)
├── pyproject.toml
├── .github/workflows/ # validate (CI) + release (publish on tag)
├── CONTRIBUTING.md
├── ROADMAP.md # thin pointer to spec §12 (normative roadmap)
├── LICENSE # Apache 2.0
└── README.md
What's normative: docs/specification.md §1–§9 + the JSON Schemas
under docs/schemas/v1.0.0/. A conformant Symposium runtime satisfies
every MUST / MUST NOT there and validates against the schemas. Sections
§10–§13 are positioning, integration, roadmap, and vision (non-binding).
§14 is a thin pointer to the non-normative companion.
What's reference, not normative: everything under symposium/,
examples/, and tests/. The Python package is one valid implementation
of the protocol; a different runtime in a different language is equally
valid as long as it conforms to the spec.
Conformance check
Two validators ship with the schemas. Any contributor or implementor can re-run them locally:
cd docs/schemas/v1.0.0/examples
pip install "jsonschema==4.26.0" "referencing>=0.35" "rfc8785>=0.1.4"
python3 validate.py # 28/28
python3 validate_negative.py # 36/36
The reference runtime's own test suite (pytest) cross-checks the artifact it emits against those same schemas:
pip install -e ".[test]"
pytest -q
CI runs both on every push and every pull request (see badge above).
Reading order
If you only want the gist, the first 200 lines of the spec are enough: §1 (conformance surface), §2 (vocabulary), §3 (overview + non-goals).
If you intend to implement: §1 → §2 → §4 (runtime + scheduler) → §5 (schemas) → §6 (provider/tool adapter contract) → §7 (persistence + replay) → §8 (budget + failure + security) → §9 (testing harness). §4.11 is the canonical pseudocode.
If you want to compare against existing frameworks: §10 covers AutoGen, CrewAI, LangGraph, and OpenAI Agents SDK.
Status
v1.0 — specification frozen 2026-05-26. Ratified by joint adversarial
review (10 passes, bilateral sign-off). The 16 JSON Schemas under
docs/schemas/v1.0.0/ are pinned at this version. Forward-compatible
changes will publish under docs/schemas/v1.1.0/ etc., per the
versioning policy in §5.1.
Issues, errata, and discussion: use the GitHub issue tracker.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file symposium_protocol-1.10.0.tar.gz.
File metadata
- Download URL: symposium_protocol-1.10.0.tar.gz
- Upload date:
- Size: 313.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
858d2f6801e48a478a13740662eede5fd2dd8c1f76eace6088f6a12bb7818d78
|
|
| MD5 |
a535a4fdb0cbdfc448b91ef70179b7ed
|
|
| BLAKE2b-256 |
228a4af5879f2cbdd260cdae78d85b6d33a5dbda2b16b4c1ef0776586d905274
|
Provenance
The following attestation bundles were made for symposium_protocol-1.10.0.tar.gz:
Publisher:
release.yml on terrordrummer/symposium
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
symposium_protocol-1.10.0.tar.gz -
Subject digest:
858d2f6801e48a478a13740662eede5fd2dd8c1f76eace6088f6a12bb7818d78 - Sigstore transparency entry: 1635294368
- Sigstore integration time:
-
Permalink:
terrordrummer/symposium@7c5215e14fd6c2cad4bef80d61afb0a88fa80323 -
Branch / Tag:
refs/tags/v1.10.0 - Owner: https://github.com/terrordrummer
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@7c5215e14fd6c2cad4bef80d61afb0a88fa80323 -
Trigger Event:
push
-
Statement type:
File details
Details for the file symposium_protocol-1.10.0-py3-none-any.whl.
File metadata
- Download URL: symposium_protocol-1.10.0-py3-none-any.whl
- Upload date:
- Size: 131.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6ee90357a6237c0ed02c6095266855d162c96ae88465260defa923ae6939e855
|
|
| MD5 |
05eb9814ade2075489daf10a1e182df1
|
|
| BLAKE2b-256 |
8ba649866293f21b3f26d030e98f44304a6858997c4b3a1cef29c4126a8baf0e
|
Provenance
The following attestation bundles were made for symposium_protocol-1.10.0-py3-none-any.whl:
Publisher:
release.yml on terrordrummer/symposium
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
symposium_protocol-1.10.0-py3-none-any.whl -
Subject digest:
6ee90357a6237c0ed02c6095266855d162c96ae88465260defa923ae6939e855 - Sigstore transparency entry: 1635294387
- Sigstore integration time:
-
Permalink:
terrordrummer/symposium@7c5215e14fd6c2cad4bef80d61afb0a88fa80323 -
Branch / Tag:
refs/tags/v1.10.0 - Owner: https://github.com/terrordrummer
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@7c5215e14fd6c2cad4bef80d61afb0a88fa80323 -
Trigger Event:
push
-
Statement type: