Skip to main content

Terminal UI for managing multiple Claude sessions with tmux

Project description

Cerberus

Cerberus is an AI coding that allows you to quickly design good software. It allows you to iterate on specs for what you want to build, and then delegate work in parallel to agents building your software in an overseeable way..

The best coding experiences understand the tango of deference between human and AI thinking.

Humans excel at:

  • Articulating vision and requirements
  • Making architectural decisions
  • Providing context and judgment

AI agents excel at:

  • Implementing well-specified tasks
  • Exploring codebases
  • Handling repetitive transformations

Cerberus is a multi-agent development environment that lets you focus on the creative work of software design while coordinating swarms of AI agents to handle implementation. You provide the vision, Cerberus manages the execution.

How It Works

Designer and Executor Sessions

Main Session (Designer): Your primary workspace lives in your source code directory. The designer agent discusses with you, understands your requirements, and orchestrates the work.

Executor Sessions: When you define a task, the designer spawns executor agents in isolated git worktrees. Each executor works independently on their assigned task, reaching out to you only when clarification is needed.

The Orchestrator UI

Cerberus provides a unified interface to manage your agent swarm:

  • Session List: View all active designer and executor sessions
  • Diff View: See changes in real-time as agents work
  • Monitor Tab: Watch automated summaries of agent activities
  • Keyboard Controls:
    • s - Open spec editor for the selected session (designer gets notified on save)
    • t - Open terminal in the selected session's worktree
    • N - Create new session
    • D - Delete session
    • Arrow keys - Navigate sessions

Workflow

  1. Design: Discuss your ideas with the designer agent. Use the spec editor (s) to sketch out plans.

  2. Delegate: The designer spawns executor agents with clear task specifications.

  3. Monitor: Watch executors work through the UI. They'll reach out if they need guidance.

  4. Review: Jump into executor worktrees (t) to test changes, or view diffs before merging.

  5. Integrate: Merge completed work back into your main branch.

Executor Communication Examples

Executors are trained to proactively report issues back to the designer. Here are real-world scenarios showing when and how executors should communicate:

Example 1: Missing Dependencies

Scenario: Executor tries to run tests but pytest is not installed.

Executor thinks: "I need to run tests but pytest isn't available. I should report this immediately."

Executor sends: send_message_to_session(
  session_id="main",
  message="ERROR: Cannot run tests - 'pytest' is not installed. Should I install it with pip, or would you prefer a different testing approach?"
)

Designer responds: "Yes, go ahead and install pytest. We use it for all Python tests."

Example 2: Build Failures

Scenario: Executor makes changes but the build fails with existing TypeScript errors.

Executor thinks: "The build is failing, but it's due to pre-existing errors, not my changes. I need guidance."

Executor sends: send_message_to_session(
  session_id="main",
  message="ERROR: Build failed with TypeScript errors in 3 files (user.ts, auth.ts, api.ts). These errors existed before my changes. Should I fix them first or implement my task differently?"
)

Designer responds: "Those are known issues. Focus on your task and we'll fix those separately."

Example 3: Ambiguous Requirements

Scenario: Instructions mention a component that doesn't exist in the codebase.

Executor thinks: "The instructions reference 'UserService' but I can't find it. Better ask than guess."

Executor sends: send_message_to_session(
  session_id="main",
  message="QUESTION: Instructions say to extend 'UserService' but I cannot find this class in the codebase. Did you mean 'AuthService' or should I create a new UserService?"
)

Designer responds: "Sorry, my mistake - I meant AuthService. Please extend that."

Example 4: Successful Completion

Scenario: Executor completes the task successfully.

Executor sends: send_message_to_session(
  session_id="main",
  message="COMPLETE: Added rate limiting to all API endpoints. Implemented using Redis with 100 requests/minute limit. All 23 existing tests pass, added 8 new tests for rate limiting behavior. Ready for review."
)

Key Takeaway: Executors should err on the side of over-communication. It's better to ask a question that seems obvious than to waste time or implement incorrectly.

Architecture

Git Worktrees + Docker Isolation

Cerberus combines git worktrees with Docker containerization:

Worktrees (Visible on Host):

  • Each executor gets its own branch and working directory at ~/.kerberos/worktrees/
  • Changes are tracked independently
  • No conflicts between concurrent agent work
  • Easy to review, test, and merge completed tasks
  • Worktrees are visible and editable in your editor

Docker Containers (Isolated Execution):

  • Each session runs in its own Docker container
  • Agent commands execute in isolation with mounted worktree
  • Two modes:
    • Unpaired (default): Agent only accesses worktree
    • Paired (opt-in): Agent also accesses source project
  • Provides command isolation while keeping files accessible

See DOCKER.md for detailed Docker architecture.

Communication (MCP):

  • Designers spawn executors with detailed instructions
  • Executors can message back with questions or completion status
  • Automated monitoring tracks agent activity

Getting Started

# Launch the Cerberus orchestrator
cerb

# The UI will open with a main designer session
# Press 's' to open the spec editor and start planning
# The designer will spawn executors as needed

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cerb_code-0.1.17.tar.gz (44.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cerb_code-0.1.17-py3-none-any.whl (42.3 kB view details)

Uploaded Python 3

File details

Details for the file cerb_code-0.1.17.tar.gz.

File metadata

  • Download URL: cerb_code-0.1.17.tar.gz
  • Upload date:
  • Size: 44.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.0

File hashes

Hashes for cerb_code-0.1.17.tar.gz
Algorithm Hash digest
SHA256 8e246171af285608f838005fe5ac78c9f24f1a880dcc44f45db3b21289147dd6
MD5 7ac33bfce3f4c3ccb25178324a8ea672
BLAKE2b-256 1114cede8483462f5ccecf4aad2b1460a8c61f44af86864376402ae75bc91d6f

See more details on using hashes here.

File details

Details for the file cerb_code-0.1.17-py3-none-any.whl.

File metadata

  • Download URL: cerb_code-0.1.17-py3-none-any.whl
  • Upload date:
  • Size: 42.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.0

File hashes

Hashes for cerb_code-0.1.17-py3-none-any.whl
Algorithm Hash digest
SHA256 66a77468edbf088c96a020ab512002448020fb263318b7f8c658123917b5fd37
MD5 dd43a7654163d3a21d5c896ab5df5ab9
BLAKE2b-256 50b110a9fdcefbaf03a051f3acc5e892de82b30849a8f3929be1218fbdfc1c8e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page