MCP server for multi-model AI discussions — cloud, agentic CLIs, local GPU models, soul memory

These details have not been verified by PyPI

Project links

Repository

Project description

Chitta Bridge

MCP server for multi-model AI discussions — works with Claude Code and Codex. Connect to any AI backend: cloud providers, agentic CLIs, and local GPU models.

Quick Start

uv pip install git+https://github.com/genomewalker/chitta-bridge.git

chitta-bridge-install               # both Claude Code + Codex
chitta-bridge-install claude-code   # Claude Code only
chitta-bridge-install codex         # Codex CLI only

Skills (Codex): /review, /rescue, /room, /soul — plus all mcp__chitta_bridge__* tools.

Features

Multiple backends: OpenCode, Codex CLI, and local GPU models (Ollama/vLLM)
Continuous sessions: Conversation history persists across messages
Session warmup: background ping captures session ID — subsequent calls skip cold start
Multiple models: OpenCode (GPT-5.x, Claude, Gemini) + Codex (o3, o4-mini, gpt-4.1)
Agent support: plan, build, explore, general agents (OpenCode)
Agentic execution: Full-auto mode with sandboxed file operations (Codex)
Variant control: Set reasoning effort (minimal to max)
File/image attachment: Share code files and images for context
Session continuity: Conversations continue across tool calls
Discussion rooms: async multi-agent roundtables — any mix of backends respond in parallel, see the full thread, synthesize into one answer

Installation

With uv (recommended)

uv pip install git+https://github.com/genomewalker/chitta-bridge.git

With pip

pip install git+https://github.com/genomewalker/chitta-bridge.git

From source

git clone https://github.com/genomewalker/chitta-bridge.git
cd chitta-bridge
pip install -e .

Register

chitta-bridge-install               # install for both Claude Code and Codex
chitta-bridge-install claude-code   # Claude Code only (registers MCP server)
chitta-bridge-install codex         # Codex CLI only (plugin + skills + MCP)
chitta-bridge-uninstall             # uninstall from both
chitta-bridge-uninstall codex       # uninstall from Codex only

Verify: claude mcp list (Claude Code) or check ~/.codex/plugins/ (Codex)

OpenCode Backend

Tool	Description
`opencode_start`	Start a new session (auto-warms up, captures session ID)
`opencode_discuss`	Send a message
`opencode_plan`	Start planning discussion
`opencode_brainstorm`	Open-ended brainstorming
`opencode_review`	Review code
`opencode_ping`	Check if model is reachable
`opencode_models`	List available models
`opencode_agents`	List available agents
`opencode_model`	Change session model
`opencode_agent`	Change session agent
`opencode_variant`	Change reasoning effort
`opencode_config`	Show current configuration
`opencode_configure`	Set defaults (persisted)
`opencode_history`	Show conversation history
`opencode_sessions`	List all sessions
`opencode_switch`	Switch to another session
`opencode_end`	End current session
`opencode_health`	Server health check

Discussion Rooms

Async multi-agent roundtable with agent souls — participants get persistent identity, memory, tools, and structured challenge rounds.

Basic Room

room_create(
    room_id="my-room",
    topic="What's the best way to design a cache invalidation strategy?",
    participants='[
        {"name":"Codex","backend":"codex","session_id":"codex-1"},
        {"name":"Gemini","backend":"opencode","session_id":"gemini-1"},
        {"name":"Llama","backend":"local","model":"qwen2.5:32b","base_url":"http://gpunode:11434/v1"}
    ]'
)

room_run(room_id="my-room", rounds=2)
room_synthesize(room_id="my-room")

Soul-Powered Room

Each participant can have a soul — a system prompt, memory namespace, tools, challenge bias, and response format:

room_create(
    room_id="expert-panel",
    topic="How should we authenticate ancient DNA from permafrost?",
    participants='[
        {"name":"Paleogenomicist","backend":"local","model":"qwen2.5:32b",
         "base_url":"http://gpunode:11434/v1",
         "soul":{
           "system_prompt":"You are a senior paleogenomicist with 15+ years experience...",
           "realm":"agent:paleogenomicist",
           "tools":["recall","remember","web_search","smart_context"],
           "max_tool_turns":2,
           "challenge_bias":0.7,
           "response_format":"### Key Points\\n### Tools & Thresholds\\n### Caveats"
         }},
        {"name":"Bioinformatician","backend":"local","model":"phi4:14b",
         "base_url":"http://gpunode:11434/v1",
         "soul":{
           "system_prompt":"You are a computational biologist specializing in pipelines...",
           "realm":"agent:bioinformatician",
           "tools":["recall","remember","smart_context"],
           "challenge_bias":0.4
         }}
    ]'
)

# Challenge mode: between rounds, a moderator extracts claims and
# forces participants to disagree, provide evidence, and refine
room_run(room_id="expert-panel", rounds=2, challenge=true)
room_synthesize(room_id="expert-panel")

Soul Features

Feature	Description
`system_prompt`	Agent identity, expertise, personality
`realm`	Chitta memory namespace — per-agent persistent memory
`tools`	Available tools (see Agent Tools below)
`max_tool_turns`	Max tool-use iterations per response (default 3)
`max_rounds`	Max discussion rounds, 0 = unlimited
`challenge_bias`	0 = agreeable, 1 = devil's advocate
`response_format`	Structured output template

Challenge Rounds

When challenge=true, a moderator automatically:

Extracts substantive claims from the previous round
Injects a challenge prompt requiring each participant to disagree with at least one claim
Forces evidence-based refinement instead of polite agreement

GPU Contention Handling

When multiple local models share the same GPU endpoint, rooms automatically run participants sequentially to avoid model-swap thrashing. Different endpoints run in parallel.

Room Tools

Tool	Description
`room_create`	Create a discussion room with named participants and optional souls
`room_add_participant`	Add a participant to an existing room
`room_run`	Run N rounds with optional challenge mode
`room_read`	Read the full transcript
`room_synthesize`	Distill the transcript — consensus, disagreements, best answer, open questions

Agent Tools

Tools available to soul-powered room participants via mediated XML tool calling. Assign a subset per agent via the tools field.

Memory (core)

Tool	Description
`recall`	Semantic vector search over agent's memory realm
`remember`	Store an insight or fact in agent's memory realm
`smart_context`	Task-aware context assembly (memories + code symbols + graph)

Memory (extended)

Tool	Description
`recall_keyword`	BM25 keyword search — best when exact terms are known
`recall_temporal`	Search memories from a specific time range (since/until)
`hybrid_recall`	Combined vector + BM25 search — best general-purpose recall
`5w_search`	Structured who/what/when/where/why search
`forget`	Remove a memory by similarity match

Web

Tool	Description
`web_search`	DuckDuckGo search, returns titles + URLs + snippets
`web_fetch`	Fetch a URL as plain text (HTML stripped, max 8000 chars)

File operations

Tool	Description
`read_file`	Read file with line numbers (offset/limit, capped at 500 lines)
`write_file`	Create or overwrite a file (auto-creates parent dirs)
`edit_file`	Targeted string replacement with context display
`glob`	Find files by glob pattern, sorted by modification time
`grep`	Regex search over file contents with context lines

Shell

Tool	Description
`bash`	Execute a shell command (sandboxed, 60s timeout, dangerous commands blocked)

Code intelligence (via chitta)

Tool	Description
`read_function`	Read a function's source code by name
`read_symbol`	Look up any code symbol (class, function, variable)
`search_symbols`	Search for code symbols matching a query
`codebase_overview`	High-level overview of codebase structure

Task tracking

Tool	Description
`todo_add`	Add a task to the agent's personal todo list
`todo_list`	List current todo items
`todo_done`	Mark a todo item as complete

Synthesis

After running a room, distill the full discussion into a single answer. Any backend can act as synthesizer — Claude (default), local GPU model, OpenCode, or Codex.

room_synthesize(room_id="my-room")

# Use a local model as synthesizer
room_synthesize(
    room_id="my-room",
    synthesizer='{"name":"Qwen3","backend":"local","model":"qwen3:30b-a3b","base_url":"http://gpunode:11434/v1"}'
)

Local Models (GPU Nodes)

Chat with local LLMs (Ollama / vLLM) running on GPU nodes — via Slurm auto-discovery or direct hostname.

# 1. Start Ollama on a Slurm GPU node (writes URL to /tmp/ollama-server-<model>.url)
slurm-serve-ollama.sh llama3.3:70b

# 2. Discover available nodes and models
local_discover()

# 3. Start a session (auto-discovers endpoint if omitted)
local_start(session_id="llm1", model="llama3.3:70b")

# 4. Chat
local_discuss(message="Explain cache invalidation strategies")

# Or specify node explicitly
local_start(session_id="llm2", model="qwen3:30b-a3b", endpoint="http://gpunode01:11434/v1")

Discovery order

/tmp/ollama-server-*.url cache files (written by slurm-serve-ollama.sh)
Your running Slurm GPU jobs (squeue --me)
CHITTA_BRIDGE_GPU_NODES=node1,node2 environment variable
localhost:11434 fallback

Tool	Description
`local_discover`	Find GPU nodes with Ollama/vLLM running
`local_start`	Start a session (auto-discovers endpoint)
`local_discuss`	Chat with the local model
`local_models`	List models available at an endpoint
`local_sessions`	List active local sessions
`local_switch`	Switch active session
`local_end`	End a session
`local_history`	Show conversation history
`local_health`	Health check

Web Search

Search the web and fetch pages directly from Claude Code — no API key needed (DuckDuckGo).

# Search
web_search(query="ancient metagenomics DNA damage authentication")

# Fetch a page
web_fetch(url="https://example.com/article", max_chars=12000)

Tool	Description
`web_search`	Search via DuckDuckGo — returns titles, URLs, snippets
`web_fetch`	Fetch a web page as plain text (HTML stripped)

Soul Memory (chittad)

Bidirectional memory bridge to the cc-soul daemon with realm-scoped memory. Each room participant can have its own memory namespace, and room discussions automatically pull relevant memories as context.

# Check if soul is running
soul_status()

# Recall memories (global or realm-scoped)
soul_recall(query="cache invalidation strategies", limit=5)

# Store a memory
soul_remember(content="Room discussion concluded X is better than Y", kind="episode")

# Smart context (memories + code symbols + graph)
soul_context(task="refactor authentication middleware")

Tool	Description
`soul_recall`	Search memories by query (supports realm scoping)
`soul_remember`	Store a new memory (supports realm scoping)
`soul_context`	Smart context assembly (memories + symbols + graph)
`soul_status`	Check if chittad is available

Discussion rooms automatically:

Seed agent realms on first turn — identity and topic stored for future recall
Inject soul context at creation — participants see relevant memories (code symbols filtered)
Store contributions back — each agent's response stored in their realm
Store synthesis back — room conclusions become soul episodes
Hybrid recall — vector + BM25 keyword matching for better memory retrieval

Codex Backend

Session tools

Tool	Description
`codex_start`	Start a new Codex session
`codex_discuss`	Send a message to Codex
`codex_run`	Run a one-off task (stateless, returns session ID)
`codex_model`	Change session model
`codex_config`	Show Codex configuration
`codex_configure`	Set Codex defaults (persisted)
`codex_history`	Show conversation history
`codex_sessions`	List all Codex sessions
`codex_switch`	Switch to another session
`codex_end`	End current session
`codex_health`	Codex health check

Review (normal + adversarial)

Tool	Description
`codex_review`	Code review with `mode` (normal/adversarial), `focus`, `--base`, `effort`, `background`, `sandbox`

Adversarial mode challenges design decisions, architecture, and tradeoffs instead of just finding bugs:

codex_review(mode="adversarial", focus="race conditions and data loss", base="main")
codex_review(mode="adversarial", background=True)  # returns job ID

Rescue (background job delegation)

Tool	Description
`codex_rescue`	Delegate a task to Codex — supports `background`, `resume_from`, `effort`, `fresh`, `sandbox`
`codex_job_status`	Check progress of background rescue jobs
`codex_job_result`	Get final output + Codex session ID for `codex resume`
`codex_job_cancel`	Cancel a running background job

# Start a background rescue
codex_rescue(task="investigate why the tests started failing", background=True)

# Check progress
codex_job_status()

# Get result (includes session ID for native Codex resume)
codex_job_result()

# Resume a previous session
codex_rescue(task="apply the fix", resume_from="SESSION_ID")

# Full access (network + filesystem)
codex_rescue(task="fetch and apply the upstream patch", sandbox="danger-full-access")

Codex Plugin for Codex CLI

chitta-bridge ships as a proper Codex plugin with skills and MCP tools:

chitta-bridge-install codex       # install
chitta-bridge-uninstall codex     # uninstall

This installs to ~/.codex/plugins/cache/local/chitta-bridge/local/ and enables:

Skills: /review, /rescue, /room, /soul
Tools: All mcp__chitta_bridge__* tools (soul memory, rooms, web, jobs)

Available Models

OpenCode

Provider	Models
openai	gpt-5.2-codex, gpt-5.1-codex-max, gpt-5.1-codex-mini
github-copilot	claude-opus-4.5, claude-sonnet-4.5, gpt-5, gemini-2.5-pro
opencode	gpt-5-nano (free), glm-4.7-free, grok-code

Run opencode models to see all available models.

Codex

Model	Description
o3	Default, high capability
o4-mini	Faster, lower cost
gpt-4.1	Alternative option

Configuration

Environment variables

# OpenCode
export OPENCODE_MODEL="openai/gpt-5.2-codex"
export OPENCODE_AGENT="plan"
export OPENCODE_VARIANT="medium"

# Codex
export CODEX_MODEL="o3"
export CODEX_SANDBOX="workspace-write"

Config file

~/.chitta-bridge/config.json:

{
  "model": "openai/gpt-5.2-codex",
  "agent": "plan",
  "variant": "medium",
  "codex_model": "o3",
  "codex_sandbox": "workspace-write"
}

OpenCode Variants (reasoning effort)

minimal -> low -> medium -> high -> xhigh -> max

Higher variants use more reasoning tokens for complex tasks.

Codex Sandbox Modes

Mode	Description
`read-only`	Can only read files
`workspace-write`	Can write to workspace (default)
`danger-full-access`	Full filesystem access (use with caution)

The full_auto option (default: true) enables low-friction execution with workspace-write sandbox.

Requirements

Python 3.10+
Claude Code or Codex CLI (or both)
OpenCode CLI for opencode_* tools
Ollama or vLLM on a GPU node for local_* tools

License

MIT

Project details

These details have not been verified by PyPI

Project links

Repository

Release history Release notifications | RSS feed

0.11.9

Apr 17, 2026

0.11.8

Apr 16, 2026

0.11.7

Apr 16, 2026

0.11.6

Apr 15, 2026

This version

0.11.2

Apr 4, 2026

0.11.1

Apr 4, 2026

0.9.0

Apr 2, 2026

0.8.1

Apr 1, 2026

0.8.0

Apr 1, 2026

0.7.0

Mar 31, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chitta_bridge-0.11.2.tar.gz (75.3 kB view details)

Uploaded Apr 4, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

chitta_bridge-0.11.2-py3-none-any.whl (80.8 kB view details)

Uploaded Apr 4, 2026 Python 3

File details

Details for the file chitta_bridge-0.11.2.tar.gz.

File metadata

Download URL: chitta_bridge-0.11.2.tar.gz
Upload date: Apr 4, 2026
Size: 75.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for chitta_bridge-0.11.2.tar.gz
Algorithm	Hash digest
SHA256	`3b323c14b9a9c8670793ae61e0377c4fdd252847ba3fe8c017d25b9a14cebb5e`
MD5	`1e64a66373df94195eca3a33425f00fa`
BLAKE2b-256	`5a26c2a0f1d7da982780344f0a8b52660be1e6bf86cce94d07ef05b03e369b8c`

See more details on using hashes here.

File details

Details for the file chitta_bridge-0.11.2-py3-none-any.whl.

File metadata

Download URL: chitta_bridge-0.11.2-py3-none-any.whl
Upload date: Apr 4, 2026
Size: 80.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for chitta_bridge-0.11.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`275a54c05e8ecba8e6b6e5cdf614ee63a300d6668bede109130c8c111d962b7b`
MD5	`2c49c2929e570bce78b189917870a2a3`
BLAKE2b-256	`c90495c0f8370643ee4151c89d23a2458e6de9f111d32ddc2c743749ce797dd3`

See more details on using hashes here.

chitta-bridge 0.11.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Chitta Bridge

Quick Start

Features

Installation

With uv (recommended)

With pip

From source

Register

OpenCode Backend

Discussion Rooms

Basic Room

Soul-Powered Room

Soul Features

Challenge Rounds

GPU Contention Handling

Room Tools

Agent Tools

Synthesis

Local Models (GPU Nodes)

Discovery order

Web Search

Soul Memory (chittad)

Codex Backend

Session tools

Review (normal + adversarial)

Rescue (background job delegation)

Codex Plugin for Codex CLI

Available Models

OpenCode

Codex

Configuration

Environment variables

Config file

OpenCode Variants (reasoning effort)

Codex Sandbox Modes

Requirements

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes