Skip to main content

Terminal UI for managing multiple Claude sessions with tmux

Project description

Cerberus

Cerberus is an AI coding that allows you to quickly design good software. It allows you to iterate on specs for what you want to build, and then delegate work in parallel to agents building your software in an overseeable way..

The best coding experiences understand the tango of deference between human and AI thinking.

Humans excel at:

  • Articulating vision and requirements
  • Making architectural decisions
  • Providing context and judgment

AI agents excel at:

  • Implementing well-specified tasks
  • Exploring codebases
  • Handling repetitive transformations

Cerberus is a multi-agent development environment that lets you focus on the creative work of software design while coordinating swarms of AI agents to handle implementation. You provide the vision, Cerberus manages the execution.

How It Works

Designer and Executor Sessions

Main Session (Designer): Your primary workspace lives in your source code directory. The designer agent discusses with you, understands your requirements, and orchestrates the work.

Executor Sessions: When you define a task, the designer spawns executor agents in isolated git worktrees. Each executor works independently on their assigned task, reaching out to you only when clarification is needed.

The Orchestrator UI

Cerberus provides a unified interface to manage your agent swarm:

  • Session List: View all active designer and executor sessions
  • Diff View: See changes in real-time as agents work
  • Monitor Tab: Watch automated summaries of agent activities
  • Keyboard Controls:
    • s - Open spec editor for the selected session (designer gets notified on save)
    • t - Open terminal in the selected session's worktree
    • N - Create new session
    • D - Delete session
    • Arrow keys - Navigate sessions

Workflow

  1. Design: Discuss your ideas with the designer agent. Use the spec editor (s) to sketch out plans.

  2. Delegate: The designer spawns executor agents with clear task specifications.

  3. Monitor: Watch executors work through the UI. They'll reach out if they need guidance.

  4. Review: Jump into executor worktrees (t) to test changes, or view diffs before merging.

  5. Integrate: Merge completed work back into your main branch.

Executor Communication Examples

Executors are trained to proactively report issues back to the designer. Here are real-world scenarios showing when and how executors should communicate:

Example 1: Missing Dependencies

Scenario: Executor tries to run tests but pytest is not installed.

Executor thinks: "I need to run tests but pytest isn't available. I should report this immediately."

Executor sends: send_message_to_session(
  session_id="main",
  message="ERROR: Cannot run tests - 'pytest' is not installed. Should I install it with pip, or would you prefer a different testing approach?"
)

Designer responds: "Yes, go ahead and install pytest. We use it for all Python tests."

Example 2: Build Failures

Scenario: Executor makes changes but the build fails with existing TypeScript errors.

Executor thinks: "The build is failing, but it's due to pre-existing errors, not my changes. I need guidance."

Executor sends: send_message_to_session(
  session_id="main",
  message="ERROR: Build failed with TypeScript errors in 3 files (user.ts, auth.ts, api.ts). These errors existed before my changes. Should I fix them first or implement my task differently?"
)

Designer responds: "Those are known issues. Focus on your task and we'll fix those separately."

Example 3: Ambiguous Requirements

Scenario: Instructions mention a component that doesn't exist in the codebase.

Executor thinks: "The instructions reference 'UserService' but I can't find it. Better ask than guess."

Executor sends: send_message_to_session(
  session_id="main",
  message="QUESTION: Instructions say to extend 'UserService' but I cannot find this class in the codebase. Did you mean 'AuthService' or should I create a new UserService?"
)

Designer responds: "Sorry, my mistake - I meant AuthService. Please extend that."

Example 4: Successful Completion

Scenario: Executor completes the task successfully.

Executor sends: send_message_to_session(
  session_id="main",
  message="COMPLETE: Added rate limiting to all API endpoints. Implemented using Redis with 100 requests/minute limit. All 23 existing tests pass, added 8 new tests for rate limiting behavior. Ready for review."
)

Key Takeaway: Executors should err on the side of over-communication. It's better to ask a question that seems obvious than to waste time or implement incorrectly.

Architecture

Git Worktrees + Docker Isolation

Cerberus combines git worktrees with Docker containerization:

Worktrees (Visible on Host):

  • Each executor gets its own branch and working directory at ~/.kerberos/worktrees/
  • Changes are tracked independently
  • No conflicts between concurrent agent work
  • Easy to review, test, and merge completed tasks
  • Worktrees are visible and editable in your editor

Docker Containers (Isolated Execution):

  • Each session runs in its own Docker container
  • Agent commands execute in isolation with mounted worktree
  • Two modes:
    • Unpaired (default): Agent only accesses worktree
    • Paired (opt-in): Agent also accesses source project
  • Provides command isolation while keeping files accessible

See DOCKER.md for detailed Docker architecture.

Communication (MCP):

  • Designers spawn executors with detailed instructions
  • Executors can message back with questions or completion status
  • Automated monitoring tracks agent activity

Getting Started

# Launch the Cerberus orchestrator
cerb

# The UI will open with a main designer session
# Press 's' to open the spec editor and start planning
# The designer will spawn executors as needed

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cerb_code-0.1.13.tar.gz (43.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cerb_code-0.1.13-py3-none-any.whl (40.6 kB view details)

Uploaded Python 3

File details

Details for the file cerb_code-0.1.13.tar.gz.

File metadata

  • Download URL: cerb_code-0.1.13.tar.gz
  • Upload date:
  • Size: 43.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.0

File hashes

Hashes for cerb_code-0.1.13.tar.gz
Algorithm Hash digest
SHA256 9eff2132a4d931533b4860b3ee4b44b42aba1f72b8b2e75053d89070ea3ee2c9
MD5 a24b7e25aea70373c0c8430eb393f7d7
BLAKE2b-256 7feee880168217a034ce29b4e521c1cbab2c1e0f74c264070ce67cfb2b3e3550

See more details on using hashes here.

File details

Details for the file cerb_code-0.1.13-py3-none-any.whl.

File metadata

  • Download URL: cerb_code-0.1.13-py3-none-any.whl
  • Upload date:
  • Size: 40.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.0

File hashes

Hashes for cerb_code-0.1.13-py3-none-any.whl
Algorithm Hash digest
SHA256 a58507371b3c62571a51b18eeeee8fd9ff4c3aa82fe9a28da07d255768e7e606
MD5 047959fe58e01adc964354a61235b988
BLAKE2b-256 5e1a5c51f53b8746dbef61c9978ba83413bdcb656021089bb6f5f1e5754ca462

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page