Skip to main content

Houmao runtime package for realm controller and CAO launcher workflows

Project description

Houmao

A framework and CLI toolkit for orchestrating teams of loosely-coupled AI agents.

Current Status

Houmao is under active development. The operator-facing workflow is stabilizing around the houmao-mgr + houmao-server pair, with local_interactive (tmux-backed) as the primary backend. Expect interface changes while the core runtime, gateway, and mailbox contracts continue to harden.

Project Introduction

Project docs: https://igamenovoer.github.io/houmao/

What It Is

Houmao is a framework and CLI toolkit designed to orchestrate teams of loosely-coupled, CLI-based AI agents.

Name Origin: Houmao (猴毛, "monkey hair") is inspired by the classic tale Journey to the West. Just as Sun Wukong (The Monkey King) plucks strands of his magical hair to create independent, capable clones of himself, this framework allows you to multiply your capabilities by spinning up numerous autonomous helpers.

Unlike traditional orchestration models where an "agent" is merely an in-process object graph, Houmao treats each agent as a first-class citizen. Every agent is a dedicated, real CLI process (such as codex, claude, or gemini) operating with its own isolated disk state, memory, and native user experience.

The Core Idea (What We Avoid)

The core idea is to avoid a hard-coded orchestration model.

Instead of shipping a fixed “agent graph” runtime (LangGraph / AutoGen-style orchestration), Houmao treats a team as a set of independently runnable CLI agents and provides lightweight primitives to construct, start, and manage them, while keeping “how the team coordinates” flexible and context-driven.

What The Framework Provides

  • Zero-setup adoption: wrap any running claude, codex, or gemini session with houmao-mgr agents join — no configuration, no restart. You keep your familiar coding-agent workflow and gain management, coordination, and team features on top.
  • Construction (when you need it): build agent runtimes from tool setups + skills + roles (and optional presets) for reproducible, declarative agent setups.
  • Management: start/resume/prompt/stop agents with houmao-mgr (typically tmux-backed so you can attach and interact).
  • Team communication: a shared gateway and mailbox plane for groups of agents (built on Houmao's own gateway service).

Why This Is Useful (Benefits)

  • Near-zero learning curve: agents join lets you start with what you already know — your familiar coding agent in a terminal — and add Houmao's management layer only when you need it.
  • Low barrier to composition: assemble new agent teams from human-like instruction packages (skills + roles) and tool profiles, without designing rigid contracts up front.
  • Flexible team contracts: coordination choices can change with context because the framework does not impose a fixed graph or flow.
  • Transparent per-agent UX: each agent is a real CLI process; you can attach to its tmux window/session to see what it’s doing and interact with its native TUI when needed.
  • Full tool surface area: the system operates the same terminal/TUI interface you do, so every native capability remains usable (and you can always take over manually if automation hits an unexpected prompt).

Typical Use Cases

  • Parallel specialist agents: run a "coder" agent and a "reviewer" agent side by side on the same repo — each with a different role and tool — so one writes while the other critiques.
  • Optimization loops: set up a coder agent that implements changes and a profiler agent that benchmarks them, iterating back and forth without manual handoff.
  • Team agent presets: give every team member the same pre-configured agent lineup (same models, skills, and roles) checked into the repo, without sharing anyone's API keys.
  • Swap the AI, keep the workflow: change which model or CLI tool an agent uses without touching its role prompt or the task it is working on.

How Agents Join Your Workflow

  • Adopt an existing session (recommended): start your CLI tool (claude, codex, or gemini) in a tmux session the way you normally would, then run houmao-mgr agents join --agent-name <name> from inside that session. Houmao wraps the running process with its management envelope — registry, gateway, prompt/interrupt, mailbox — without restarting the tool. Zero agent-definition setup required. This is the recommended starting point because there is nothing new to learn: you keep your familiar coding-agent workflow and layer Houmao management on top.
  • Managed launch (full control): for teams that need reproducible, declarative agent setups, construct from tool setups + skills + roles/presets, then start/resume/prompt/stop via houmao-mgr agents launch. This path builds an isolated runtime home with projected configs, skills, and credentials.
  • Bring-your-own process with launch options: you can also start the underlying CLI tool manually (for example via the generated launch_helper_path from build-brain) and then use agents join with --launch-args and --launch-env to record enough state for later agents relaunch.

Installation

uv tool install houmao

For development from source:

pixi install
pixi shell

tmux (required)

The primary backend (local_interactive) runs each agent CLI inside a tmux session. Ensure tmux is installed:

command -v tmux

Usage Guide

Recommended starting point: if you already use a coding agent (claude, codex, or gemini) in a terminal, jump to Section 1 — Quick Start: agents join. It takes about 30 seconds and requires no agent-definition setup.

CLI Entry Points

Entrypoint Purpose Status
houmao-mgr Primary operator CLI — build, launch, prompt, stop, server control Active
houmao-server Houmao-owned REST server for multi-agent coordination In development — not ready for use
houmao-passive-server Registry-driven stateless server for distributed agent coordination In development — not ready for use
houmao-cli Legacy build/start/prompt/stop entrypoint Deprecated — use houmao-mgr
houmao-cao-server Legacy CAO server launcher Deprecated — exits with migration guidance
houmao-mgr --help
houmao-server --help

1. Quick Start: Adopt an Existing Session (agents join)

The fastest way to bring an agent under Houmao management. No agent-definition directory, no brain build, no config projection — just wrap a running CLI tool with the full management envelope.

sequenceDiagram
    participant U as You (terminal)
    participant T as tmux session
    participant P as Provider CLI<br/>(claude / codex / gemini)
    participant H as houmao-mgr

    U->>T: tmux new-session -s my-agent
    U->>P: claude (or codex / gemini)
    Note over P: Provider TUI is running<br/>in window 0, pane 0
    U->>H: houmao-mgr agents join<br/>--agent-name my-agent
    H->>T: Detect provider from<br/>pane 0 process tree
    H-->>H: Create placeholder manifest,<br/>attach gateway,<br/>publish to registry
    H-->>U: ✓ Joined as "my-agent"
    Note over U,H: Full management now available
    U->>H: houmao-mgr agents prompt<br/>--agent-name my-agent<br/>--prompt "explain this file"
    U->>H: houmao-mgr agents state<br/>--agent-name my-agent
    U->>H: houmao-mgr agents stop<br/>--agent-name my-agent

Step-by-step:

# 1. Create a tmux session and start your CLI tool normally
tmux new-session -s my-agent
claude                          # or: codex, gemini

# 2. From a second terminal pane (inside the SAME tmux session), join
houmao-mgr agents join --agent-name my-agent

# 3. Now you can use the full management surface:
houmao-mgr agents state   --agent-name my-agent   # transport-neutral summary state
houmao-mgr agents prompt  --agent-name my-agent --prompt "explain this repo"
houmao-mgr agents stop    --agent-name my-agent   # graceful shutdown

Tip: agents join auto-detects the provider (claude_code, codex, or gemini_cli) from the process tree in window 0 / pane 0. If detection fails, pass --provider <name> explicitly.

What You Get After Joining

Once agents join completes, the adopted session has the same management capabilities as a fully managed agents launch session:

Capability Command
Query transport-neutral summary state houmao-mgr agents state --agent-name <name>
Inspect raw gateway-owned TUI tracking (when attached) houmao-mgr agents gateway tui state --agent-name <name>
Send a semantic prompt houmao-mgr agents prompt --agent-name <name> --prompt "…"
Interrupt a running turn houmao-mgr agents interrupt --agent-name <name>
Attach to a gateway houmao-mgr agents gateway attach --agent-name <name>
Send / receive mailbox messages houmao-mgr agents mail send --agent-name <name>
Stop the agent houmao-mgr agents stop --agent-name <name>

The only difference: a joined agent has a placeholder brain manifest (no skills/configs were projected), and relaunch support depends on whether you provided --launch-args at join time.

2. Easy Specialists (project easy)

For a reusable, named agent without learning the full agent-definition-directory layout, use the easy specialist workflow. This is the natural next step after agents join.

# Initialize a project overlay (one-time)
houmao-mgr project init

# Create a specialist (bundles role + tool + auth into one named definition)
houmao-mgr project easy specialist create \
  --name my-coder --tool claude \
  --api-key sk-ant-... \
  --system-prompt "You are a Python backend developer."

# Launch an instance
houmao-mgr project easy instance launch \
  --specialist my-coder --name my-coder

# Use the full management surface
houmao-mgr agents prompt --agent-name my-coder --prompt "explain main.py"
houmao-mgr project easy instance get --name my-coder
houmao-mgr project easy instance stop --name my-coder

Specialists persist under .houmao/ and survive across sessions. See the Easy Specialists guide for the full model (specialist vs preset, lifecycle, storage layout, and management commands).

3. Full Preset Launch

For teams that need full control over roles, skills, presets, and tool configurations, use the preset-backed launch path. See Agent Definitions for the complete agent-definition-directory layout.

houmao-mgr agents launch --agents gpu-kernel-coder --provider claude_code
houmao-mgr agents prompt --agent-name gpu-kernel-coder --prompt "Review the latest commit"
houmao-mgr agents stop --agent-name gpu-kernel-coder

For a runnable end-to-end example, see scripts/demo/minimal-agent-launch/.

Runnable Demos

The repository ships two maintained runnable demos under scripts/demo/:

  • minimal-agent-launch/ — Preset-backed headless launch with Claude or Codex. Shows the full build → launch → prompt → stop cycle and records reproducible outputs.

    scripts/demo/minimal-agent-launch/scripts/run_demo.sh --provider claude_code
    
  • single-agent-mail-wakeup/ — Easy specialist + gateway + mailbox-notifier wake-up. Creates a specialist, attaches a gateway, enables mail-notifier polling, and verifies the agent wakes up on incoming mail. See the demo README for stepwise commands.

    scripts/demo/single-agent-mail-wakeup/run_demo.sh auto --tool claude
    

Subsystems at a Glance

Subsystem Description Docs
Gateway Per-agent FastAPI sidecar for session control, request queue, and mail facade Gateway Reference
Mailbox Unified async message transport — filesystem and Stalwart JMAP backends Mailbox Reference
TUI Tracking State machine, detectors, and replay engine for tracking agent TUI state TUI Tracking Reference

System Skills: Agent Self-Management

Houmao installs packaged skills into agent tool homes so that agents themselves can drive management tasks through their native skill interface — without the operator manually invoking houmao-mgr. This means an agent can create specialists, manage credentials, inspect definitions, and control other agent instances autonomously.

Skill What it enables
houmao-manage-specialist Create, list, inspect, and remove project-local specialist definitions
houmao-manage-credentials Add, update, inspect, and remove project-local tool auth bundles
houmao-manage-agent-definition List, inspect, initialize, update, and remove roles and presets
houmao-manage-agent-instance Launch, list, inspect, stop, and clean up managed agent instances

agents join and agents launch auto-install these skills by default. To install them into an external tool home manually:

houmao-mgr system-skills install --tool claude --home ~/.claude --default

See the System Skills reference for the full catalog, named sets, and install options.

Full Documentation

Complete reference, guides, and developer docs are published at igamenovoer.github.io/houmao.

Development

pixi run format              # ruff format
pixi run lint                # ruff check
pixi run typecheck           # mypy --strict
pixi run test-runtime        # runtime-focused test suites
pixi run docs-serve          # local docs site with live reload

Legacy note: Houmao was originally inspired by CAO (CLI Agent Orchestrator). Legacy houmao-cli, houmao-cao-server, and cao_rest backend paths are deprecated — use houmao-mgr, houmao-server, and local_interactive instead.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

houmao-0.3.1.tar.gz (910.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

houmao-0.3.1-py3-none-any.whl (1.1 MB view details)

Uploaded Python 3

File details

Details for the file houmao-0.3.1.tar.gz.

File metadata

  • Download URL: houmao-0.3.1.tar.gz
  • Upload date:
  • Size: 910.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for houmao-0.3.1.tar.gz
Algorithm Hash digest
SHA256 996bdbe09dfe12d95b70635496fd876532867d2648d9e1b395b819add20e6144
MD5 ee05d62982516451c09107e151125ff2
BLAKE2b-256 1f93a03bbcad337d651605d3e8d34e28ff7e769eed80c92b9e73dc13928c2aa6

See more details on using hashes here.

Provenance

The following attestation bundles were made for houmao-0.3.1.tar.gz:

Publisher: pypi-release.yml on igamenovoer/houmao

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file houmao-0.3.1-py3-none-any.whl.

File metadata

  • Download URL: houmao-0.3.1-py3-none-any.whl
  • Upload date:
  • Size: 1.1 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for houmao-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 ae9ec12aa30359c16c5d2784e3450e0964cb18313a17fc335ac141464e8593ee
MD5 4061415628b89362fb8d5a8bf754b91a
BLAKE2b-256 c59505a6c3179ba76553b29c5ecfd228e8d9947cbc6dbe2c31d9b0f6b6cd7702

See more details on using hashes here.

Provenance

The following attestation bundles were made for houmao-0.3.1-py3-none-any.whl:

Publisher: pypi-release.yml on igamenovoer/houmao

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page