A governed agent runtime for local-first, private-first, embeddable AI agents.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

Agentao (Agent + Tao)

   ___                      _
  / _ \ ___ _ ___  ___  ___| |_  ___  ___
 /  _  // _` / -_)| _ \/ _ \  _|/ _` / _ \
/_/ |_| \__, \___||_// \___/\__|\__,_\___/
        |___/        (The Way of Agents)

"Order in Chaos, Path in Intelligence."

Agentao is a Governed Agent Runtime for local-first, private-first, embeddable AI agents.

It helps teams build, embed, and extend AI agents with governance, permissions, protocols, memory, plugins, and multi-session control.

"Tao" (道) represents the underlying Laws, Methods, and Paths that govern all things. In Agentao, it is the invisible structure that keeps autonomous agents safe, connected, and observable. "Agent Harness" remains a useful explanatory term for this runtime skeleton, but it is not the product's primary positioning.

Built with Python, Agentao gives developers and teams a governed runtime they can run locally, deploy privately, and embed into their own tools and workflows.

Embed in 30 lines

Embedding agentao in a Python host (FastAPI route, pytest fixture, Jupyter kernel, batch job) is the primary use case. The snippet below constructs an agent with no env-discovery, no implicit network, no global state — the host owns every input.

pip install agentao

from pathlib import Path
from agentao import Agentao
from agentao.llm import LLMClient
from agentao.transport import NullTransport

agent = Agentao(
    working_directory=Path("/tmp/agent-run-1"),
    llm_client=LLMClient(
        api_key="sk-...",
        base_url="https://api.openai.com/v1",
        model="gpt-5.4",
    ),
    transport=NullTransport(),
)
reply = agent.chat("Summarize today's logs.")
print(reply)
agent.close()

For env-driven discovery (.env, ~/.agentao/, <wd>/.agentao/*) use agentao.embedding.build_from_environment(working_directory=...) instead — it is what the CLI uses internally. See docs/EMBEDDING.md for the full embedding contract and docs/api/harness.md for the public API surface (Agentao.events(), active_permissions(), capability injection).

CLI Quickstart

If you want to drive Agentao interactively from a terminal instead of embedding it, the CLI gets you running in about 3 minutes:

Install the package with the CLI extras (rich, prompt-toolkit, readchar, pygments). Plain pip install agentao installs the embedding-only core in 0.4.0+; the [cli] extra adds the terminal UI:

pip install 'agentao[cli]'

Upgrading from 0.3.x? pip install agentao previously bundled the CLI deps automatically. From 0.4.0 they live in [cli]. The fastest "no-change" upgrade line is pip install 'agentao[full]'. See docs/migration/0.3.x-to-0.4.0.md.

Create a local .env file. Agentao requires all three provider variables at startup — missing OPENAI_BASE_URL or OPENAI_MODEL raises ValueError immediately:

printf "OPENAI_API_KEY=sk-your-key-here\nOPENAI_BASE_URL=https://api.openai.com/v1\nOPENAI_MODEL=gpt-5.4\n" > .env

Verify the CLI works:

agentao -p "Reply with the single word: OK"

Expected output:

OK

Start the interactive session:

agentao

If you hit a startup error, jump directly to Troubleshooting common startup failures or Troubleshooting.

Start Here

Choose the path that matches what you want to do:

Embed in your Python project: Embed in 30 lines → docs/EMBEDDING.md → docs/api/harness.md
New to Agentao (CLI): CLI Quickstart → Minimum Viable Configuration → Usage
Need the minimum setup only: Installation → Required environment variable → Minimal runnable example
Want to switch models or providers: Using with Different Providers
Want MCP tools: MCP Server Configuration → MCP (Model Context Protocol) Support
Want plugins, hooks, or skills: Plugin System → Hooks System → Dynamic Skills System
Want to embed Agentao in code: Headless / SDK Use → ACP (Agent Client Protocol) Mode
Want to contribute: For contributors (source install) → Development → Testing

First Commands

Once the CLI starts, these are the commands most new users need first:

/help       Show available commands
/status     Show provider, model, token usage, and task summary
/model      List or switch models on the current provider
/provider   List or switch configured providers
/todos      Show the current task checklist
/memory     Inspect or manage memory
/mcp list   Check MCP server status

Why Agentao?

Most agent frameworks give you raw capability. Agentao gives you a governed runtime.

The name itself encodes the design: Agent (capability) + Tao (governance). Every feature is built around three pillars of a governed runtime:

Pillar	What it means	How Agentao implements it
Constraint (约束)	Agents must not act without consent	Tool Confirmation — shell, web, and destructive ops pause for human approval
Connectivity (连接)	Agents must reach the world beyond their training	MCP Protocol — seamlessly connects to any external service via stdio or SSE
Observability (可观测性)	Agents must show their work	Live Thinking display + Complete Logging — every reasoning step and tool call is visible

Feature Overview

If you're evaluating Agentao, start here. If you're trying to get unblocked quickly, skip ahead to Installation and Usage.

Area	What you get	Where to go next
Governance	Tool confirmation, permission modes, read-before-assert behavior, visible reasoning	Permission Modes
Context	Long-session token tracking, compression, overflow recovery	Core Capabilities
Memory	SQLite-backed persistent memory and recall	Core Capabilities
Execution UX	Rich terminal output, structured tool display, task checklist	Usage
Extensibility	MCP servers, plugins, hooks, dynamic skills	Configuration
Runtime Surfaces	CLI, non-interactive mode, SDK embedding, ACP mode, sub-agents	Usage

Common Workflows

If you're not sure how to approach Agentao yet, follow one of these paths:

Goal	What to read	What to try
Get the first successful run	CLI Quickstart → Minimum Viable Configuration	`agentao -p "Reply with the single word: OK"`
Start using it in a real repo	Starting the Agent → Project Instructions (AGENTAO.md)	`agentao` then `/status`
Use another provider or model	Using with Different Providers → Commands	`/provider` then `/model`
Add external tools	MCP Server Configuration → MCP (Model Context Protocol) Support	create `.agentao/mcp.json` then `/mcp list`
Extend the agent	Plugin System → Hooks System → Dynamic Skills System	`agentao skill list`
Contribute code	For contributors (source install) → Testing → Development	`uv sync` then run tests

Documentation Map

Use the README for the main path, and jump to the docs below when you need depth:

Topic	Document
Quickstart	docs/QUICKSTART.md
Command cheat sheet	docs/QUICK_REFERENCE.md
Configuration reference (all `.agentao/*` files, `.env`, `AGENTAO.md`)	docs/CONFIGURATION.md
ACP server mode	docs/ACP.md
Logging details	docs/LOGGING.md
Skills guide	docs/SKILLS_GUIDE.md
Memory details	docs/features/memory-management.md
ACP client details	docs/features/acp-client.md
ACP embedding API	docs/features/acp-embedding.md
Headless runtime contract	docs/features/headless-runtime.md
Session replay	docs/features/session-replay.md
macOS sandbox-exec	docs/features/macos-sandbox-exec.md
Developer guide	developer-guide/ (run `cd developer-guide && npx vitepress dev` to browse locally)
Integration examples	examples/ — five runnable blueprints (SaaS API, IDE plugin, ticket triage, data workbench, batch job)

One-liner demo — try it right after install:

# Ask Agentao to analyze the current directory
agentao -p "List all Python files here and summarize what each one does"

Core Capabilities

This section is the detailed feature reference. If you're brand new, you can skip to Installation, Minimum Viable Configuration, and Usage first, then come back here later.

🏛️ Autonomous Governance (自治治理)

A disciplined agent that acts deliberately, not impulsively:

Multi-turn conversations with persistent context
Function calling for structured tool usage
Smart tool selection and execution
Tool-call resilience — lightweight repair for malformed JSON arguments and near-miss tool names, plus outbound text sanitization before messages are sent back to strict LLM APIs
Tool confirmation — user approval required for Shell, Web, and destructive Memory operations; domain-based tiered permissions for web_fetch (allowlist/blocklist/ask)
Reliability principles — system prompt enforces read-before-assert, discrepancy reporting, and fact/inference distinction on every turn
Operational guidelines — tone & style rules, shell command efficiency patterns, tool parallelism, non-interactive flags, and explain-before-act security rules
Auto-loading of project instructions from AGENTAO.md at startup
Current date context — injected as <system-reminder> into each user message rather than the system prompt, keeping the system prompt stable across turns for prompt cache efficiency
Live thinking display — shows LLM reasoning and tool calls in real time with Rule separators
Streaming shell output — shell command stdout displayed in real-time as it executes
Complete logging of all LLM interactions to agentao.log
Multi-line paste support — paste multi-line text as one unit (prompt_toolkit native; Alt+Enter for manual newline, Enter to submit)
Slash command Tab completion — type / and press Tab for an autocomplete menu

🧠 Elastic Context Engine (弹性上下文引擎)

Agentao keeps long sessions usable without forcing users to manually prune context.

The important user-facing pieces are:

token usage is visible in /status and /context
old history is compressed instead of silently dropped
recent turns stay verbatim for continuity
oversized tool output is truncated before it can blow up the prompt
overflow recovery retries automatically before surfacing an error

Default context limit is 200K tokens and can be changed with AGENTAO_CONTEXT_TOKENS.

💾 SQLite Memory (持久记忆)

A SQLite-backed memory system automatically resurfaces relevant context without requiring a vector database.

At the README level, the key ideas are:

two stores exist by default: project memory and user memory
persistent memories survive until deleted
session summaries help continuity across restarts
recall is dynamic per turn, so relevant memory comes back when needed
Chinese retrieval quality is improved with jieba segmentation and a user dictionary

Useful next steps:

quick usage: docs/features/memory-quickstart.md
implementation and behavior details: docs/features/memory-management.md

Save a memory:

❯ Remember that this project uses uv for package management
❯ Save my preferred language as Python

Skill Crystallization: /crystallize suggest collects structured evidence from the current session (tool calls, file paths, workflow steps, outcome signals) and drafts a SKILL.md. Iterate with /crystallize feedback <text> (or /crystallize revise for interactive input) to steer the rewrite, polish authoring style with /crystallize refine, then /crystallize create [name] writes it to skills/ (global or project scope) and reloads skills immediately. Recommended flow: suggest → feedback (repeatable) → refine → create.

💡 Semantic Display Engine

The terminal UI is designed to stay readable during real work, not just demos.

In practice this means:

tool calls render with semantic headers instead of raw noise
long output is buffered and truncated around the useful tail
diffs and errors are surfaced clearly
warnings are consolidated instead of flooding the screen
sub-agent execution and reasoning stay visually distinct

If you need command-level operational shortcuts, use docs/QUICK_REFERENCE.md. Logging detail lives in docs/LOGGING.md.

✅ Session Task Tracking

For multi-step tasks, Agentao maintains a live task checklist that the LLM updates as it works:

/todos

Task List (2/4 completed):

  ✓ Read existing code           completed
  ✓ Design new module structure  completed
  ◉ Write new module             in_progress
  ○ Run tests                    pending

LLM-managed — the agent calls todo_write at the start of complex tasks and updates statuses as each step completes (pending → in_progress → completed)
Always visible — current task list is injected into the system prompt so the LLM always knows its own progress
Session-scoped — cleared automatically on /clear or /new; not persisted to disk (unlike memory)
/status summary — shows Task list: 2/4 completed when tasks are active

🤖 SubAgent System

Agentao can delegate tasks to independent sub-agents, each running its own LLM loop with scoped tools and turn limits. Inspired by Gemini CLI's "agent as tool" pattern.

Built-in agents:

codebase-investigator — read-only codebase exploration (find files, search patterns, analyze structure)
generalist — general-purpose agent with access to all tools for complex multi-step tasks

Built-in agents are disabled by default to keep the default tool schema compact. Enable them per project in .agentao/settings.json:

{
  "agents": {
    "enable_builtin": true
  }
}

Embedded hosts can also pass enable_builtin_agents=True to Agentao(...) or build_from_environment(...).

Two trigger paths:

LLM-driven — the parent LLM decides to delegate via agent_codebase_investigator / agent_generalist tools; supports optional run_in_background=true for async fire-and-forget
User-driven — use /agent <name> <task> to run a sub-agent directly, /agent bg <name> <task> for background, /agents to view the live dashboard

Visual framing — foreground sub-agents are wrapped with cyan rule separators so their output is clearly distinct from the main agent:

──────────── ▶ [generalist]: task description ────────────
  ⚙ [generalist 1/20] read_file (src/main.py)
  ⚙ [generalist 2/20] run_shell_command (pytest)
──────── ◀ [generalist] 3 turns · 8 tool calls · ~4,200 tokens · 12s ────

Confirmation isolation:

Foreground sub-agents: confirmation dialog shows [agent_name] tool_name so you know which sub-agent is requesting permission
Background sub-agents: all tools auto-approved (no interactive prompts from background threads, which would corrupt the terminal)

Cancellation propagation — pressing Ctrl+C cleanly stops the current agent and any foreground sub-agent in progress (they share the same CancellationToken). Background agents are unaffected — they run to completion independently.

Background completion push — when a background agent finishes, the parent LLM is automatically notified at the start of the next turn via a <system-reminder> message, without needing to poll check_background_agent.

Parent context injection — sub-agents receive the last 10 parent messages as context so they understand the broader task.

Custom agents: create .agentao/agents/my-agent.md with YAML frontmatter (name, description, tools, max_turns) — auto-discovered at startup.

🔌 MCP (Model Context Protocol) Support

Connect to external MCP tool servers to dynamically extend the agent's capabilities. Agentao acts as the central hub connecting your LLM brain to the outside world:

graph LR
  User((User)) -- CLI --> Agentao[Agentao Harness]
  Agentao -- MCP --> Filesystem[Filesystem Server]
  Agentao -- MCP --> GitHub[GitHub Server]
  Agentao -- MCP --> Custom[Your Custom Server]
  Agentao -- LLM API --> Brain[OpenAI / Gemini / DeepSeek]

Stdio transport — spawn a local subprocess (e.g. npx @modelcontextprotocol/server-filesystem)
SSE transport — connect to remote HTTP/SSE endpoints
Auto-discovery — tools are discovered on startup and registered as mcp_{server}_{tool}
Confirmation — MCP tools require user confirmation unless the server is marked "trust": true
Env var expansion — $VAR and ${VAR} syntax in config values
Two-level config — project .agentao/mcp.json overrides global <home>/.agentao/mcp.json

🧩 Plugin System

Agentao supports a Claude Code-compatible plugin system for packaging extensions behind a plugin.json manifest.

At a high level, a plugin can contribute:

Skills and commands
Sub-agent definitions
MCP server definitions
Lifecycle hooks

Plugin sources are loaded with precedence from global → project → inline --plugin-dir.

Most users only need these commands:

agentao plugin list
agentao plugin list --json
agentao skill list
agentao skill install owner/repo:path/to/skill        # monorepo subdirectory
agentao skill install owner/repo:path/to/skill@main   # pin to a branch / tag / commit
agentao skill update --all

Use this section as the overview. For skill-centric workflows, jump to docs/SKILLS_GUIDE.md. The plugin internals stay in the contributor docs and implementation notes.

🪝 Hooks System

Hooks let plugins react to lifecycle events before or after prompts and tool calls.

The important part for most readers:

Agentao supports a practical subset of the Claude Code hooks model
command hooks can run external commands
prompt hooks can inject additional context
hook payloads use Claude Code tool aliases for compatibility

If you are evaluating whether hooks exist, this section answers that. If you need the full event matrix or payload contract, move that detail into dedicated docs rather than the README front page.

🎯 Dynamic Skills System

Skills are auto-discovered from skills/, activated on demand, and can be created without changing Python code.

Typical ways to work with skills:

Add a local skills/<name>/SKILL.md
Generate one from a session with /crystallize suggest
Iterate with /crystallize feedback <text> or /crystallize refine
Write it with /crystallize create [name]
Install managed skills from GitHub

Common commands:

agentao skill list
agentao skill install anthropics/skills:skills/pdf      # owner/repo:path
agentao skill install anthropics/skills:skills/pdf@main # pin a ref
agentao skill update my-skill
agentao skill remove my-skill

For a fuller walkthrough, use docs/SKILLS_GUIDE.md.

🛠️ Comprehensive Tools

Agentao ships with a broad tool surface, but most users only need the categories:

File operations: read, write, edit, list
Search and discovery: glob, grep-like content search
Shell and web access
Task tracking and memory tools
Sub-agent and skill activation tools
Dynamically discovered MCP tools

Use First Commands for the beginner subset, Commands for the full slash-command reference, and docs/QUICK_REFERENCE.md for a faster operator cheat sheet.

📼 Session Replay

Agentao can record the full runtime timeline of a session — turns, tool calls, permission decisions, streaming chunks, errors — as an append-only JSONL file under .agentao/replays/. This is separate from /sessions (which restores a conversation you can keep talking to); replay records what the agent did for debugging, audit, and protocol replay.

Disabled by default. Toggle with /replay on or set replay.enabled: true in .agentao/settings.json.
One file per session instance. A new instance_id is minted on each session birth (/clear, /new, ACP session/load all start a fresh file).
Inspect & prune — /replay list lists, /replay show <id> renders, /replay tail <id> [n] shows the last N events, /replay prune enforces the replay.max_instances cap (default 20).
Schema is exported and CI-enforced. The JSON Schema lives in agentao/replay/schema.py and drift between code and the on-disk format fails CI.

For the full event reference and field-level redaction options, see docs/features/session-replay.md.

🛡️ macOS Sandbox (defense-in-depth)

On macOS, Agentao can wrap each run_shell_command subprocess in sandbox-exec (Apple Seatbelt) so even tool calls the user approves cannot escape the workspace. This is orthogonal to permission modes — it limits what an allowed command can actually do, not whether it's allowed.

Opt-in. Off by default to avoid breaking network-dependent workflows like npm install or git clone.
Per-session toggle. /sandbox on / /sandbox off for the running session; persist by editing .agentao/sandbox.json.
Switchable profiles. /sandbox profile <name> picks among the built-in .sb templates under agentao/sandbox/profiles/.
Status & inspection. /sandbox status shows the active profile and any config errors; /sandbox profiles lists what's available.
Scope. Only run_shell_command is wrapped — other tools run inside the agent process and continue to rely on the permission engine. Linux / Windows are no-ops.

For the full profile schema and threat model, see docs/features/macos-sandbox-exec.md.

🛰️ Headless Runtime

agentao.acp_client.ACPManager is the stable, semver-guaranteed surface for embedding hosts (workflow runtimes, daemons, schedulers) that need to drive project-local ACP servers without scraping the CLI.

Public entry points. prompt_once(name, prompt, ...) for one-shot fire-and-forget turns; send_prompt(...) for long-lived sessions. Both fail-fast with AcpClientError(code=SERVER_BUSY) rather than block when a turn is already in flight on the same server.
One active turn per server. Concurrency is enforced via a per-server lock; cancellation, timeouts, and errors all clean up the slot.
Non-interactive mode. Set interactive=False so the manager auto-rejects session/request_permission and _agentao.cn/ask_user instead of blocking on WAITING_FOR_USER — the right default for daemons.
Stable import root. Only from agentao.acp_client import ... is semver-stable; submodule internals can change between releases.

A runnable smoke consumer lives at examples/headless_worker.py. For the full contract — error codes, timeout vs. lock-wait semantics, latched-interaction behavior — see docs/features/headless-runtime.md and docs/features/acp-embedding.md.

Design Principles

Agentao is built around three foundational principles:

Minimalism (极简) — Zero friction to start. pip install agentao and you're running. No databases, no complex config, no cloud dependencies.
Transparency (透明) — No black boxes. The agent's reasoning chain is displayed in real time. Every LLM request, tool call, and token count is logged to agentao.log. You always know what the agent is doing and why.
Integrity (完整) — Context is never silently lost. Conversation history is compressed with LLM summarization (not truncated blindly), and memory recall ensures relevant context resurfaces automatically. The agent maintains a coherent world-model across sessions.

Installation

If your goal is simply "get Agentao running", read this section together with Minimum Viable Configuration. If you're contributing to the codebase, you can jump to For contributors (source install).

Prerequisites

Python 3.10 or higher
An API key (OpenAI, Anthropic, Gemini, DeepSeek, or any OpenAI-compatible provider)

Install

Pick the install line that matches your usage. From 0.4.0 the package ships a small embedding-only core; CLI / web fetch / Chinese tokenization are opt-in extras.

# Embedding host (Python `from agentao import Agentao`) — smallest closure
pip install agentao

# CLI user (`agentao` console script) — adds rich/prompt-toolkit/readchar/pygments
pip install 'agentao[cli]'

# Upgrading from 0.3.x and want zero behaviour change — exact 0.3.x closure
pip install 'agentao[full]'

See docs/migration/0.3.x-to-0.4.0.md for the full migration guide and dependency map.

Then create a .env file. Agentao requires all three provider variables at startup — OPENAI_API_KEY, OPENAI_BASE_URL, and OPENAI_MODEL:

printf "OPENAI_API_KEY=your-api-key-here\nOPENAI_BASE_URL=https://api.openai.com/v1\nOPENAI_MODEL=gpt-5.4\n" > .env

For contributors (source install)

git clone https://github.com/jin-bo/agentao
cd agentao
uv sync
cp .env.example .env

Minimum Viable Configuration

Everything you need to get Agentao running from scratch.

Recommended first-run order:

Confirm your Python version.
Set OPENAI_API_KEY, OPENAI_BASE_URL, and OPENAI_MODEL in .env (all three are mandatory).
Run the minimal example.
If it fails, check the startup troubleshooting table below.

Supported Python versions

Version	Status
3.10	✅ supported
3.11	✅ supported
3.12	✅ supported
< 3.10	❌ not supported

Verify before installing:

python --version   # must be 3.10 or higher

Required environment variable

Three variables are mandatory to start Agentao:

Variable	Required	Example
`OPENAI_API_KEY`	Yes	`sk-...`
`OPENAI_BASE_URL`	Yes	`https://api.openai.com/v1`
`OPENAI_MODEL`	Yes	`gpt-5.4`

All three must be set — Agentao raises ValueError immediately at startup if any is missing. The minimum .env:

OPENAI_API_KEY=sk-your-key-here
OPENAI_BASE_URL=https://api.openai.com/v1
OPENAI_MODEL=gpt-5.4

Create it in the directory where you run agentao:

printf "OPENAI_API_KEY=sk-your-key-here\nOPENAI_BASE_URL=https://api.openai.com/v1\nOPENAI_MODEL=gpt-5.4\n" > .env

Note: Agentao loads .env from the current working directory, then falls back to ~/.env. No system-level setup is needed.

Default provider behavior

Setting	Default	Override with
Provider	`OPENAI`	`LLM_PROVIDER=ANTHROPIC`
API key	(none — must be set)	`OPENAI_API_KEY=sk-...`
Base URL	(none — must be set)	`OPENAI_BASE_URL=https://api.openai.com/v1`
Model	(none — must be set)	`OPENAI_MODEL=gpt-5.4`
Temperature	`0.2`	`LLM_TEMPERATURE=0.7`

Each provider reads its own <NAME>_API_KEY, <NAME>_BASE_URL, and <NAME>_MODEL:

# Use Anthropic Claude instead of OpenAI
LLM_PROVIDER=ANTHROPIC
ANTHROPIC_API_KEY=sk-ant-...
ANTHROPIC_MODEL=claude-sonnet-4-6
ANTHROPIC_BASE_URL=https://api.anthropic.com/v1

Minimal runnable example

pip install 'agentao[cli]'
printf "OPENAI_API_KEY=sk-your-key-here\nOPENAI_BASE_URL=https://api.openai.com/v1\nOPENAI_MODEL=gpt-5.4\n" > .env

# Verify it works without a UI (exits after one response)
agentao -p "Reply with the single word: OK"

Expected output:

OK

If that works, start the interactive session:

agentao

Troubleshooting common startup failures

Symptom	Likely cause	Fix
`ValueError: Missing OPENAI_API_KEY`	API key not set	Add `OPENAI_API_KEY=sk-...` to `.env`
`ValueError: Missing OPENAI_BASE_URL`	Base URL not set	Add `OPENAI_BASE_URL=https://api.openai.com/v1` to `.env`
`ValueError: Missing OPENAI_MODEL`	Model not set	Add `OPENAI_MODEL=gpt-5.4` (or your target model) to `.env`
`AuthenticationError`	Invalid API key	Verify the key value in `.env`
`NotFoundError: model not found`	Model name doesn't match provider	Confirm the model is available for your provider
`APIConnectionError`	Network / firewall / proxy issue	Check your internet connection; set `OPENAI_BASE_URL` if behind a proxy
`command not found: agentao`	CLI not on PATH or `[cli]` extra missing	Run `pip install 'agentao[cli]'`; add `~/.local/bin` (Linux/Mac) or `Scripts\` (Windows) to `$PATH`
`agentao CLI requires extra packages (missing: rich)`	Bare core install (post-0.4.0)	Run `pip install 'agentao[cli]'` — see `docs/migration/0.3.x-to-0.4.0.md`
Starts but gives wrong-provider errors	`LLM_PROVIDER` mismatch	Make sure `LLM_PROVIDER` matches the key you provided (e.g. `LLM_PROVIDER=OPENAI` with `OPENAI_API_KEY`)
`ModuleNotFoundError` on startup	Incomplete install	Re-run `pip install 'agentao[cli]'` for CLI use, `pip install 'agentao[full]'` for the 0.3.x-equivalent closure; check Python version ≥ 3.10
`.env` not loaded	File in wrong directory	Run `agentao` from the directory containing `.env`, or place it in `~/.env`

Configuration

Use this section after the first successful run. If you're new, you usually only need the .env example below plus the provider notes in Using with Different Providers.

Edit .env with your settings:

# Required: Your API key
OPENAI_API_KEY=your-api-key-here

# Required: Base URL for the API endpoint
OPENAI_BASE_URL=https://api.openai.com/v1

# Required: Model name (no default — must be set explicitly)
OPENAI_MODEL=gpt-5.4

# Optional: Context window limit in tokens (default: 200000)
# AGENTAO_CONTEXT_TOKENS=200000

# Optional: Maximum tokens the LLM may generate per response (default: 65536)
# LLM_MAX_TOKENS=65536

# Optional: LLM sampling temperature (default: 0.2)
# LLM_TEMPERATURE=0.2

MCP Server Configuration

Create .agentao/mcp.json in your project (or <home>/.agentao/mcp.json for global servers):

{
  "mcpServers": {
    "filesystem": {
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-filesystem", "/path/to/dir"],
      "trust": true
    },
    "github": {
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-github"],
      "env": { "GITHUB_TOKEN": "$GITHUB_TOKEN" }
    },
    "remote-api": {
      "url": "https://api.example.com/sse",
      "headers": { "Authorization": "Bearer $API_KEY" },
      "timeout": 30
    }
  }
}

Field	Description
`command`	Executable for stdio transport
`args`	Command-line arguments
`env`	Extra environment variables (supports `$VAR` / `${VAR}` expansion)
`cwd`	Working directory for subprocess
`url`	SSE endpoint URL
`headers`	HTTP headers for SSE transport
`timeout`	Connection timeout in seconds (default: 60)
`trust`	Skip confirmation for this server's tools (default: false)

MCP servers connect automatically on startup. Use /mcp list to check status.

Using with Different Providers

Agentao supports switching between providers at runtime with /provider. Add credentials for each provider to your .env (or ~/.env) using the naming convention <NAME>_API_KEY, <NAME>_BASE_URL, and <NAME>_MODEL. All three must be set — a provider missing any of them is not shown in the /provider list and cannot be switched to.

# OpenAI (default)
OPENAI_API_KEY=sk-...
OPENAI_BASE_URL=https://api.openai.com/v1
OPENAI_MODEL=gpt-5.4

# Gemini
GEMINI_API_KEY=...
GEMINI_BASE_URL=https://generativelanguage.googleapis.com/v1beta/openai/
GEMINI_MODEL=gemini-2.0-flash

# DeepSeek
DEEPSEEK_API_KEY=...
DEEPSEEK_BASE_URL=https://api.deepseek.com/v1
DEEPSEEK_MODEL=deepseek-chat

Then switch at runtime:

/provider           # list providers that have both API key and model configured
/provider GEMINI    # switch to Gemini
/model              # see available models on the new endpoint

Usage

This section is organized by usage mode:

Interactive CLI: Starting the Agent and Commands
One-shot scripting: Non-Interactive (Print) Mode
Python embedding: Headless / SDK Use
Editor or external client integration: ACP (Agent Client Protocol) Mode
Project-local ACP servers: ACP Client — Project-Local Server Management

Starting the Agent

agentao

If this is your first interactive run, try /help, /status, and /model first.

Non-Interactive (Print) Mode

Use -p / --print to send a single prompt, get a plain-text response on stdout, and exit — no UI, no confirmations. Useful for scripting and pipes.

# Basic usage
agentao -p "What is 2+2?"

# Read from stdin
echo "Summarize this: hello world" | agentao -p

# Combine -p argument with stdin (both are joined and sent as one prompt)
echo "Some context" | agentao -p "Summarize the stdin"

# Pipe output to a file
agentao -p "List 3 prime numbers" > output.txt

# Use in a pipeline
agentao -p "Translate to French: Good morning" | pbcopy

In print mode all tools are auto-confirmed (no interactive prompts). The exit code is 0 on success and 1 on error.

Headless / SDK Use

Embed Agentao in your own Python code with no terminal UI:

from pathlib import Path
from agentao.embedding import build_from_environment
from agentao.transport import SdkTransport

events = []
transport = SdkTransport(
    on_event=events.append,           # receive typed AgentEvents
    confirm_tool=lambda n, d, a: True,  # auto-approve all tools
)
# Factory reads .env / .agentao/ for credentials and config.
# For pure-injection construction use Agentao(working_directory=, api_key=, base_url=, model=, transport=...).
agent = build_from_environment(working_directory=Path.cwd(), transport=transport)
response = agent.chat("Summarize the current directory")

SdkTransport accepts four optional callbacks: on_event, confirm_tool, ask_user, on_max_iterations. Omit any you don't need — unset ones fall back to safe defaults (auto-approve, sentinel for ask_user, stop on max iterations).

For fully silent headless use with no callbacks, call build_from_environment(working_directory=...) with no transport — it uses NullTransport automatically. Embedded hosts that want pure-injection construction (no .env / .agentao/ reads) can call Agentao(working_directory=..., api_key=..., base_url=..., model=..., transport=...) directly; since 0.3.0 working_directory= is required (TypeError without it), and the three credential fields are required when no llm_client= is supplied.

For end-to-end embedding patterns — capability injection (FileSystem / ShellExecutor / MemoryStore / MCPRegistry), async usage (agent.arun(...)), opt-in replay / sandbox / bg_store, and the 0.2.15 → 0.3.0 migration guide — see docs/EMBEDDING.md.

The host-stable observation surface lives in agentao.harness: agent.events() returns an async iterator over ToolLifecycleEvent / SubagentLifecycleEvent / PermissionDecisionEvent, and agent.active_permissions() returns a JSON-safe ActivePermissions snapshot (with loaded_sources provenance). Internal AgentEvent / Transport.emit() are richer but not version-stable — production hosts should target the harness contract. Full reference: docs/api/harness.md.

ACP (Agent Client Protocol) Mode

Launch Agentao as an ACP stdio JSON-RPC server so ACP-compatible clients (e.g. Zed) can drive Agentao as their agent runtime:

agentao --acp --stdio
# or, when the console script isn't on PATH:
python -m agentao --acp --stdio

The server reads newline-delimited JSON-RPC 2.0 messages on stdin, writes responses and session/update notifications on stdout, and routes logs (and any stray print) to stderr. Press Ctrl-D or close stdin to shut down cleanly.

Supported methods: initialize, session/new, session/prompt, session/cancel, session/load. Tool confirmations are surfaced via server→client session/request_permission requests with allow_once / allow_always / reject_once / reject_always options. Per-session cwd and mcpServers injection are supported; multi-session isolation (cancel/permission/messages) is enforced.

v1 limits: stdio transport only; text and resource_link content blocks only (image/audio/embedded resource are rejected with INVALID_PARAMS); MCP servers limited to stdio + sse (http capability is false); ACP-level fs/* and terminal/* host capabilities are not proxied — files and shell commands run locally in the session's cwd.

See docs/ACP.md for the full launch flow, supported method table, capability advertisement, annotated NDJSON transcript, event mapping reference, troubleshooting, and contributor notes.

ACP Client — Project-Local Server Management

In addition to acting as an ACP server, Agentao can also act as an ACP client — connecting to and managing project-local ACP servers. These are external agent processes that communicate over stdio using JSON-RPC 2.0 with NDJSON framing.

Configuration: Create .agentao/acp.json in your project root:

{
  "servers": {
    "planner": {
      "command": "node",
      "args": ["./agents/planner/index.js"],
      "env": { "LOG_LEVEL": "info" },
      "cwd": ".",
      "description": "Planning agent",
      "autoStart": true
    },
    "reviewer": {
      "command": "python",
      "args": ["-m", "review_agent"],
      "cwd": "./agents/reviewer",
      "description": "Code review agent",
      "autoStart": false,
      "requestTimeoutMs": 120000
    }
  }
}

Server lifecycle:

configured → starting → initializing → ready ↔ busy → stopping → stopped
                                         ↕
                                   waiting_for_user

CLI commands:

Command	Description
`/acp`	Overview of all servers
`/acp start <name>`	Start a server
`/acp stop <name>`	Stop a server
`/acp restart <name>`	Restart a server
`/acp send <name> <msg>`	Send a prompt (auto-connects)
`/acp cancel <name>`	Cancel active turn
`/acp status <name>`	Detailed status
`/acp logs <name> [n]`	View stderr output (last n lines)
`/acp approve <name> <id>`	Approve a permission request
`/acp reject <name> <id>`	Reject a permission request
`/acp reply <name> <id> <text>`	Reply to an input request

Interaction bridge: When an ACP server needs user input (permission confirmation or free-form text), it sends a notification that becomes a pending interaction. These appear in the inbox and in /acp status <name>.

Extension method: Agentao advertises a private _agentao.cn/ask_user extension for requesting free-form text input from the user, enabling richer server-to-user interaction beyond simple permission grants.

Key design decisions:

Project-only config — no global <home>/.agentao/acp.json; ACP servers are project-scoped
No auto-send — messages are never automatically routed to ACP servers; use /acp send explicitly
Separate inbox — server output appears in the ACP inbox, not in the main conversation context
Lazy initialization — the ACP manager is created on first /acp command, not at startup

See docs/features/acp-client.md for the full configuration reference, lifecycle details, interaction bridge protocol, diagnostics, and troubleshooting guide.

Commands

All commands start with /. Type / and press Tab for autocomplete.

If you only need the small beginner subset, start with First Commands. The table below is the full command reference.

Command	Description
`/help`	Show help message
`/clear`	Save current session, clear conversation history and all memories, start fresh
`/new`	Alias for `/clear`
`/status`	Show message count, model, active skills, memory count, context usage
`/model`	Fetch and list available models from the configured API endpoint
`/model <name>`	Switch to specified model (e.g., `/model gpt-5.4`)
`/provider`	List available providers (detected from `*_API_KEY` env vars)
`/provider <NAME>`	Switch to a different provider (e.g., `/provider GEMINI`)
`/skills`	List available and active skills
`/memory`	List all saved memories
`/memory user`	Show user-scope memories (/.agentao/memory.db)
`/memory project`	Show project-scope memories (.agentao/memory.db)
`/memory session`	Show current session summary (from session_summaries table)
`/memory status`	Show memory counts, session size, and recall hit count
`/memory search <query>`	Search memories (searches keys, tags, and values)
`/memory tag <tag>`	Filter memories by tag
`/memory delete <key>`	Delete a specific memory
`/memory clear`	Clear all memories (with confirmation)
`/crystallize suggest`	Analyze the session (evidence + transcript) and draft a reusable skill
`/crystallize feedback <text>`	Add feedback and rewrite the current skill draft
`/crystallize revise`	Interactively enter feedback and rewrite the draft
`/crystallize refine`	Improve the current draft with skill-creator guidance
`/crystallize status`	Show pending draft status (feedback + evidence counts)
`/crystallize clear`	Clear the pending draft
`/crystallize create [name]`	Write the skill draft to skills/ (prompts for name and scope)
`/mcp`	List MCP servers with status and tool counts
`/mcp add <name> <cmd\|url>`	Add an MCP server to project config
`/mcp remove <name>`	Remove an MCP server from project config
`/context`	Show current context window usage (tokens and %)
`/context limit <n>`	Set context window limit (e.g., `/context limit 100000`)
`/agent`	List available sub-agents
`/agent list`	Same as `/agent`
`/agent <name> <task>`	Run a sub-agent in foreground (with ▶/◀ visual boundary)
`/agent bg <name> <task>`	Run a sub-agent in background (returns agent ID immediately)
`/agent dashboard`	Live auto-refreshing dashboard of all background agents
`/agent status`	Show all background agent tasks (status, elapsed, stats)
`/agent status <id>`	Show full result or error for a specific background agent
`/agents`	Shorthand for `/agent dashboard`
`/mode`	Show current permission mode
`/mode read-only`	Block all write and shell tools
`/mode workspace-write`	Allow file writes and safe read-only shell; ask for web (default)
`/mode full-access`	Allow all tools without prompting
`/plan`	Enter plan mode (LLM researches and drafts a plan; no mutations allowed)
`/plan show`	Display the saved plan file
`/plan implement`	Exit plan mode, restore prior permissions, display saved plan
`/plan clear`	Archive and clear the current plan; exit plan mode
`/plan history`	List recently archived plans
`/copy`	Copy last agent response to clipboard (Markdown)
`/sessions`	List saved sessions
`/sessions resume <id>`	Resume a saved session
`/sessions delete <id>`	Delete a specific session
`/sessions delete all`	Delete all saved sessions (with confirmation)
`/todos`	Show the current session task list with status icons
`/tools`	List all registered tools with descriptions
`/tools <name>`	Show parameter schema for a specific tool
`/exit` or `/quit`	Exit the program

Permission Modes (Safety Feature)

Agentao controls which tools execute automatically versus which require user confirmation via three named permission modes. Switch with /mode — the choice is persisted to .agentao/settings.json and restored on the next launch.

Mode	Behavior
`workspace-write`	Default. File writes (`write_file`, `replace`) and safe read-only shell commands (`git status/log/diff`, `ls`, `grep`, `cat`, etc.) execute automatically. Web access (`web_fetch`, `web_search`) asks. Unknown shell commands ask. Dangerous patterns (`rm -rf`, `sudo`) are blocked.
`read-only`	All write and shell tools are blocked. Only read-only tools (`read_file`, `glob`, `grep`, etc.) are permitted.
`full-access`	All tools execute without prompting. Useful for trusted, fully automated workflows.

/mode                   (show current mode)
/mode workspace-write   (default — file writes + safe shell allowed)
/mode read-only         (block all writes and shell)
/mode full-access       (allow everything)

Tools that still ask in workspace-write mode:

web_fetch — network access (with domain-tiered exceptions: see below)
web_search — network access
run_shell_command — when the command doesn't match the safe-prefix allowlist
mcp_* — MCP server tools (unless server has "trust": true)

Domain-based permissions for web_fetch:

Category	Domains	Behavior
Allowlist	`.github.com`, `.docs.python.org`, `.wikipedia.org`, `r.jina.ai`, `.pypi.org`, `.readthedocs.io`	Auto-allow
Blocklist	`localhost`, `127.0.0.1`, `0.0.0.0`, `169.254.169.254`, `.internal`, `.local`, `::1`	Auto-deny (SSRF protection)
Default	Everything else	Ask for confirmation

Customize via .agentao/permissions.json:

{
  "rules": [
    {"tool": "web_fetch", "domain": {"allowlist": [".mycompany.com"]}, "action": "allow"},
    {"tool": "web_fetch", "domain": {"blocklist": [".sketchy.io"]}, "action": "deny"}
  ]
}

Domain patterns: leading dot (.github.com) = suffix match; no dot (r.jina.ai) = exact match.

During a confirmation prompt, if you press 2 (Yes to all) the session escalates to full-access mode in memory — no prompts for the rest of the session, but the saved mode is unchanged so the next launch uses whatever /mode you set last.

Plan Mode

Plan mode is a dedicated workflow for complex tasks where you want the LLM to research and draft a plan first, then execute only after you approve.

/plan                   (enter plan mode — prompt turns [plan])
"Plan how to refactor the logging module"
                        (agent reads files, calls plan_save → gets draft_id)
                        (agent calls plan_finalize(draft_id) when ready)
                        "Execute this plan? [y/N]"
y                       (exit plan mode, restore permissions, agent implements)

What plan mode enforces:

File writes (write_file, replace) are denied
Memory writes (save_memory, todo_write) are denied
Non-allowlisted shell commands are denied (no accidental side effects)
Safe read-only shell commands (git diff, ls, cat, grep, etc.) are allowed
Web access (web_fetch, web_search) asks as usual
Skill activation is allowed (skills only modify the system prompt)

Model tools — plan_save(content) and plan_finalize(draft_id) are available to the agent in plan mode. The agent calls plan_save to save a draft and receives a draft_id. It must pass that exact draft_id to plan_finalize to trigger the approval prompt — ensuring you approve the exact draft you reviewed.

Plan mode preset takes precedence over any custom permissions.json rules — a workspace allow for write_file cannot bypass plan mode restrictions.

Workflow:

/plan — enter plan mode; prompt indicator turns [plan] (magenta)
Ask the agent to plan something — it reads files and writes a structured plan
Agent calls plan_save to persist the draft; the approval prompt only appears after plan_finalize
Press y at the "Execute?" prompt to implement, or n to continue refining
/plan implement — manually exit plan mode and restore prior permissions
/plan clear — delete the plan file and exit plan mode

Notes:

Prior permission mode is saved and restored exactly on /plan implement
/mode is blocked while in plan mode (use /plan implement to exit first)
/clear resets plan mode automatically

Confirmation menu keys (no Enter needed):

1 — Yes, execute once
2 — Yes to all for this session (escalates to full-access in memory)
3 or Esc — Cancel

Example Interactions

Reading and analyzing files:

❯ Read the file main.py and explain what it does
❯ Search for all Python files in this directory
❯ Find all TODO comments in the codebase

Working with code:

❯ Create a new Python file called utils.py with helper functions
❯ Replace the old function in utils.py with an improved version
❯ Run the tests using pytest

Web and search:

❯ Fetch the content from https://example.com
❯ Search for Python best practices

Memory:

❯ Remember that I prefer tabs over spaces for indentation
❯ Save this API endpoint URL for future use
❯ What do you remember about my preferences?
/memory status              (see entry counts, session size, recall hits)
/memory user                (browse profile-scope memories)
/memory project             (browse project-scope memories)

Skill crystallization:

/crystallize suggest             (draft a skill from the current session)
/crystallize feedback <text>     (rewrite the draft with your feedback, repeatable)
/crystallize revise              (interactively enter feedback)
/crystallize refine              (polish with skill-creator guidance)
/crystallize status              (show feedback + evidence counts)
/crystallize create [name]       (write the skill to skills/ and reload)

Context management:

❯ /context                     (check current token usage)
❯ /context limit 100000        (set a lower context limit)
❯ /status                      (see memory count and context %)

Using agents:

❯ Analyze the project structure and find all API endpoints
     (LLM may auto-delegate to codebase-investigator)
/agent codebase-investigator find all TODO comments in this project
/agent generalist refactor the logging module to use structured output

/agent bg generalist run the full test suite and summarize failures
/agents                        (live dashboard — auto-refreshes while agents run)
/agent status a1b2c3d4         (get full result of a specific background agent)

Using MCP tools:

/mcp list                   (check connected servers and tools)
/mcp add fs npx -y @modelcontextprotocol/server-filesystem /tmp
❯ List all files in the project     (LLM may use MCP filesystem tools)

Task tracking:

❯ Refactor the logging module to use structured output
     (LLM creates a task list, updates statuses as it works)
/todos                          (view current task list at any time)
/status                         (shows "Task list: 3/5 completed")

Using skills:

❯ Activate the pdf skill to help me merge PDF files
❯ Use the xlsx skill to analyze this spreadsheet

Planning before implementing:

/plan
"Plan how to add a /foo command to the CLI"
                        (agent reads files, calls plan_save, then plan_finalize)
                        "Execute this plan? [y/N]" → y
                        (exits plan mode, agent implements)
/plan implement         (manual exit if you pressed n)
/plan show              (view saved plan at any time)
/plan clear             (discard plan and exit plan mode)

Copying output:

/copy                           (copy last response to clipboard as Markdown)

Inspecting tools:

/tools                          (list all registered tools)
/tools run_shell_command        (show parameter schema)
/tools web_fetch                (check what args it accepts)

Project Instructions (AGENTAO.md)

Use AGENTAO.md when you want project-specific rules, conventions, or workflow instructions to load automatically for every session in the current repository.

Agentao automatically loads project-specific instructions from AGENTAO.md if it exists in the current working directory. This is the most powerful customization feature — it injects your instructions at the top of the system prompt, making them higher-priority than any built-in agent guidelines.

Use AGENTAO.md to define:

Code style and conventions
Project structure and patterns
Development workflows and testing approaches
Common commands and best practices
Reliability rules (e.g. require the agent to cite file and line number when making factual claims)

If the file doesn't exist, the agent works normally with its default instructions. Think of it as a per-project .cursorrules or CLAUDE.md — a lightweight way to give the agent deep project context without touching any code.

Project Structure

This section is mainly for contributors or advanced users who want to understand how the codebase is organized.

agentao/
├── main.py                  # Entry point
├── pyproject.toml           # Project configuration
├── .env                     # Configuration (create from .env.example)
├── .env.example             # Configuration template
├── AGENTAO.md             # Project-specific agent instructions
├── README.md                # This file
├── tests/                   # Test files
│   └── test_*.py            # Feature tests
├── docs/                    # Documentation
│   ├── features/            # Feature documentation
│   └── implementation/      # Technical implementation details
└── agentao/
    ├── agent.py             # Core orchestration
    ├── session.py           # Save / load / list saved sessions
    ├── cancellation.py      # Cooperative CancellationToken (Ctrl+C propagation)
    ├── permissions.py       # PermissionEngine (modes, domain rules, allowlists)
    ├── display.py           # Rich theming + tool/event renderers
    ├── logging_utils.py     # File logger setup for agentao.log
    ├── tool_runner.py       # Thin facade re-exporting runtime/ phases
    ├── cli/                 # CLI interface (Rich) — split into subpackage
    │   ├── _globals.py      # Console, logger, theme
    │   ├── _utils.py        # Slash commands, completer
    │   ├── app.py           # AgentaoCLI class (init, REPL loop)
    │   ├── transport.py     # Transport protocol callbacks
    │   ├── session.py       # Session lifecycle hooks
    │   ├── commands.py      # Slash command handlers (incl. /sandbox, /replay)
    │   ├── commands_ext.py  # Heavy command handlers (memory, agent)
    │   ├── entrypoints.py   # Entry points, parser, init wizard
    │   └── subcommands.py   # Skill/plugin CLI subcommands
    ├── runtime/             # Per-turn pipeline (split out of ToolRunner)
    │   ├── chat_loop.py     # Outer chat() loop; iteration cap; replay end-of-turn
    │   ├── llm_call.py      # Single LLM round-trip + retry/recovery
    │   ├── turn.py          # PerTurnRuntime — wraps one LLM→tool cycle
    │   ├── tool_planning.py # Plan tool calls (resolve, dedupe, ordering)
    │   ├── tool_executor.py # Phase-3 execution (permissions, sandbox injection)
    │   └── tool_result_formatter.py  # Format tool results for next LLM turn
    ├── context_manager.py   # Context window management + Agentic RAG
    ├── transport/           # Transport protocol (decouple runtime from UI)
    │   ├── events.py        # AgentEvent + EventType
    │   ├── null.py          # NullTransport (headless / silent)
    │   ├── sdk.py           # SdkTransport + build_compat_transport
    │   └── base.py          # Transport Protocol definition
    ├── prompts/             # System-prompt assembly
    │   ├── builder.py       # _build_system_prompt entry point
    │   ├── sections.py      # Identity, reliability, tone & style blocks
    │   └── helpers.py       # Date injection, AGENTAO.md loader
    ├── llm/
    │   └── client.py        # OpenAI-compatible LLM client
    ├── agents/
    │   ├── manager.py       # AgentManager — loads definitions, creates wrappers
    │   ├── tools.py         # TaskComplete, CompleteTaskTool, AgentToolWrapper
    │   └── definitions/     # Built-in agent definitions (.md with YAML frontmatter)
    ├── plugins/             # Plugin system
    │   ├── manager.py       # Plugin discovery, loading, precedence
    │   ├── manifest.py      # plugin.json parser + path safety
    │   ├── hooks.py         # Hook dispatch, payload adapters, tool aliasing
    │   ├── models.py        # Plugin data models, supported events/types
    │   ├── skills.py        # Plugin skill resolution + collision detection
    │   ├── agents.py        # Plugin agent resolution
    │   ├── mcp.py           # Plugin MCP server merge
    │   └── diagnostics.py   # Plugin diagnostics + CLI reporting
    ├── plan/                # Plan mode (controller, prompt, session state)
    ├── replay/              # Session replay subsystem
    │   ├── recorder.py      # Append-only JSONL writer per session/instance
    │   ├── reader.py        # Stream/tail replay files
    │   ├── schema.py        # Exported JSON Schema (drift-checked in CI)
    │   ├── lifecycle.py     # start/end hooks tied to session lifecycle
    │   ├── retention.py     # max_instances pruning
    │   ├── redact.py        # Field-level redaction
    │   └── sanitize.py      # Tool-arg / event sanitization
    ├── sandbox/             # macOS sandbox-exec wrapper (opt-in)
    │   ├── policy.py        # SandboxPolicy.resolve(tool_name, args)
    │   └── profiles/        # Built-in .sb seatbelt profiles
    ├── acp/                 # ACP server (run Agentao as agent for Zed/etc.)
    │   ├── server.py        # JSON-RPC stdio loop
    │   ├── session_manager.py  # Multi-session isolation
    │   ├── session_new.py / session_prompt.py / session_cancel.py / session_load.py
    │   ├── transport.py     # NDJSON framing
    │   └── mcp_translate.py # Per-session MCP injection
    ├── acp_client/          # ACP client manager (Agentao drives external servers)
    │   ├── manager/         # ACPManager — public API: prompt_once, send_prompt
    │   ├── client.py        # Single-server JSON-RPC client
    │   ├── process.py       # Subprocess lifecycle
    │   ├── inbox.py         # Server-output inbox
    │   ├── interaction.py   # Permission / ask_user bridge
    │   └── router.py        # Notification dispatch
    ├── mcp/
    │   ├── config.py        # Config loading + env var expansion
    │   ├── client.py        # McpClient + McpClientManager
    │   └── tool.py          # McpTool wrapper for Tool interface
    ├── memory/
    │   ├── manager.py       # SQLite memory manager
    │   ├── models.py        # MemoryEntry, IndexEntry dataclasses
    │   ├── retriever.py     # Index-based dynamic recall
    │   └── crystallizer.py  # Skill Crystallization
    ├── tooling/             # Cross-cutting tool helpers (registry adapters)
    ├── tools/
    │   ├── base.py          # Tool base class + registry
    │   ├── file_ops.py      # Read, write, edit, list
    │   ├── search.py        # Glob, grep
    │   ├── shell.py         # Shell execution (sandbox-aware)
    │   ├── web.py           # Fetch, search
    │   ├── memory.py        # Persistent memory tools
    │   ├── skill.py         # Skill activation
    │   ├── ask_user.py      # Mid-task user clarification
    │   └── todo.py          # Session task checklist
    └── skills/
        ├── manager.py       # Skill loading and management
        ├── registry.py      # Skill registry (JSON-backed)
        ├── installer.py     # Skill install/update from remote
        └── sources.py       # GitHub skill source

Testing

Run these checks before opening a PR or after making behavior changes.

# Run all tests (requires source checkout)
python -m pytest tests/ -v

# Run specific test files
python -m pytest tests/test_context_manager.py -v
python -m pytest tests/test_memory_management.py -v

Tests use unittest.mock.Mock for the LLM client — no real API calls required.

Logging

Use this section when you need to inspect what the agent, model, or tools actually did during a session.

All LLM interactions are logged to agentao.log:

tail -f agentao.log    # Real-time monitoring
grep "ERROR" agentao.log

Logged data includes: full message content, tool calls with arguments, tool results, token usage, and timestamps.

Development

Contributor reading path:

For contributors (source install)
Project Structure
Testing
The extension guide you need: Adding a Tool, Adding an Agent, or Adding a Skill

This section is contributor-oriented. If you're only using Agentao as a CLI, you can skip it.

Adding a Tool

Create a tool class in agentao/tools/:

from .base import Tool

class MyTool(Tool):
    @property
    def name(self) -> str:
        return "my_tool"

    @property
    def description(self) -> str:
        return "Description for LLM"

    @property
    def parameters(self) -> Dict[str, Any]:
        return {
            "type": "object",
            "properties": {
                "param": {"type": "string", "description": "..."}
            },
            "required": ["param"],
        }

    @property
    def requires_confirmation(self) -> bool:
        return False  # Set True for dangerous operations

    def execute(self, param: str) -> str:
        return f"Result: {param}"

tools_to_register.append(MyTool())

Adding an Agent

Create a Markdown file with YAML frontmatter. Built-in agents live in agentao/agents/definitions/ and are opt-in via .agentao/settings.json; project-level agents go in .agentao/agents/ and are discovered by default.

---
name: my-agent
description: "When to use this agent (shown to LLM for delegation decisions)"
tools:                    # optional — omit for all tools
  - read_file
  - search_file_content
  - run_shell_command
max_turns: 10             # optional, default 15
---
You are a specialized agent. Instructions for the sub-agent go here.
When finished, call complete_task to return your result.

Restart Agentao — agents are auto-discovered and registered as agent_my_agent tools.

Adding a Skill

Option A: manually

Create skills/my-skill/SKILL.md:

---
name: my-skill
description: Use when... (trigger conditions for LLM)
---

# My Skill

Documentation here...

Restart Agentao — skills are auto-discovered.

Option B: crystallize from a session

/crystallize suggest             (LLM drafts a skill from the session's tool calls + transcript)
/crystallize feedback <text>     (optional, repeatable — rewrite the draft to address your feedback)
/crystallize refine              (optional — polish authoring style with skill-creator guidance)
/crystallize create [name]       (prompts for scope, writes SKILL.md, reloads immediately)

Each suggest captures structured evidence — tool calls, file paths, workflow steps, outcome signals — alongside the transcript, so the draft reflects what the session actually did, not just what was chatted about. feedback / revise update the pending draft in place; status shows the current draft's evidence + feedback counts; clear discards it.

Skills created with /crystallize create are written to .agentao/skills/ (project scope) or <home>/.agentao/skills/ (global scope) and are available immediately without restarting.

Troubleshooting

Use this section for issues beyond first-run setup, including runtime behavior, tools, and integration problems.

If you're debugging a first-run problem, start with Troubleshooting common startup failures instead of this section.

Model List Not Loading: /model queries the live API endpoint. If it fails (invalid key, unreachable endpoint, no models endpoint), a clear error is shown. Verify your OPENAI_API_KEY and OPENAI_BASE_URL settings.

Provider List Empty: /provider shows only providers that have all three of *_API_KEY, *_BASE_URL, and *_MODEL set. Make sure all three are in ~/.env or exported into the shell — a local .env in the project directory is not required.

API Key Issues: Verify .env exists and contains a valid key with correct permissions.

Context Too Long Errors: Agentao handles these automatically with three-tier recovery (compress → minimal history → error). Common causes: very large tool results (e.g. reading huge files) or extremely long conversations. If errors persist, lower the limit with /context limit <n> or AGENTAO_CONTEXT_TOKENS.

Memory Not Appearing in Responses: Check /memory status — verify entries exist and recall hit count is incrementing. The retriever scores entries against your query using keyword overlap and recency; if your query doesn't share tokens with any entry's title, tags, keywords, or content, nothing will be recalled. Try rephrasing or use /memory user / /memory project to inspect entries directly. Note that the stable block always includes user-scope entries and structural project types (decision, constraint, workflow, preference, profile) regardless of the query — only project_fact and note entries depend on the per-turn recall scoring (with the 3 most-recently-updated also surfaced unconditionally).

MCP Server Not Connecting: Run /mcp list to see status and error messages. Verify the command exists and is executable, or that the SSE URL is reachable. Check agentao.log for detailed connection errors.

Tool Execution Errors: Check file permissions, path correctness, and that shell commands are valid for your OS.

Etymology

Agentao = Agent + Tao (道)

道 (Tao/Dào) is a foundational concept in Chinese philosophy, representing the natural order that underlies all things. It carries three intertwined meanings:

Laws (法则) — the rules that constrain and shape behavior
Methods (方法) — the paths and techniques for accomplishing goals
Paths (路径) — the routes through which things flow and connect

In the context of this project, Tao captures what an Agent Harness should be: not just a raw capability engine, but a disciplined path through which intelligent agents can act safely, transparently, and purposefully. An agent without Tao is powerful but unpredictable. Agentao is the structure that makes that power trustworthy.

License

This project is open source. Feel free to use and modify as needed.

Acknowledgments

Built with OpenAI Python SDK
CLI interface powered by Rich
Input handling powered by prompt_toolkit
Optional enhanced web fetching via Crawl4AI
MCP support via Model Context Protocol SDK
MCP architecture inspired by Gemini CLI
Inspired by Claude Code

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

jin-bo

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.4.4

May 6, 2026

0.4.3

May 5, 2026

0.4.2

May 1, 2026

This version

0.4.0

May 1, 2026

0.3.4

May 1, 2026

0.3.3

May 1, 2026

0.3.1

May 1, 2026

0.3.0

Apr 29, 2026

0.2.16

Apr 28, 2026

0.2.15

Apr 27, 2026

0.2.14

Apr 26, 2026

0.2.13

Apr 24, 2026

0.2.13rc1 pre-release

Apr 24, 2026

0.2.12

Apr 22, 2026

0.2.11

Apr 19, 2026

0.2.10

Apr 15, 2026

0.2.9

Apr 12, 2026

0.2.8

Apr 11, 2026

0.2.8rc1 pre-release

Apr 11, 2026

0.2.7

Apr 10, 2026

0.2.7rc1 pre-release

Apr 10, 2026

0.2.6

Apr 9, 2026

0.2.5

Apr 7, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentao-0.4.0.tar.gz (893.7 kB view details)

Uploaded May 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentao-0.4.0-py3-none-any.whl (615.1 kB view details)

Uploaded May 1, 2026 Python 3

File details

Details for the file agentao-0.4.0.tar.gz.

File metadata

Download URL: agentao-0.4.0.tar.gz
Upload date: May 1, 2026
Size: 893.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for agentao-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`809c93f3bc48279d094ed7b27c0592a53bb3a0a438069785c2fecb85b7570daa`
MD5	`68dbb25dbda6edb026814310852e0c25`
BLAKE2b-256	`1e720a0301f365e29505e4ffb2409e360c539cf0291ee2f86c5eda0d84c3321f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agentao-0.4.0.tar.gz:

Publisher: publish.yml on jin-bo/agentao

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agentao-0.4.0.tar.gz
- Subject digest: 809c93f3bc48279d094ed7b27c0592a53bb3a0a438069785c2fecb85b7570daa
- Sigstore transparency entry: 1417659019
- Sigstore integration time: May 1, 2026
Source repository:
- Permalink: jin-bo/agentao@ad07cd10f3a4ddcbab6055baf8a181679d163c10
- Branch / Tag: refs/heads/main
- Owner: https://github.com/jin-bo
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ad07cd10f3a4ddcbab6055baf8a181679d163c10
- Trigger Event: workflow_dispatch

File details

Details for the file agentao-0.4.0-py3-none-any.whl.

File metadata

Download URL: agentao-0.4.0-py3-none-any.whl
Upload date: May 1, 2026
Size: 615.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for agentao-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`244294055c582545ad78f925154930320b1ac8cfdb2e401eb82b8c706c21cca3`
MD5	`d35a3b7565feb66b2eeaf9d74d22d820`
BLAKE2b-256	`1f6a748b443f921ddcba24b2369a6ba3032c3e3bd5e93bc7a60ba5700f7c31de`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agentao-0.4.0-py3-none-any.whl:

Publisher: publish.yml on jin-bo/agentao

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agentao-0.4.0-py3-none-any.whl
- Subject digest: 244294055c582545ad78f925154930320b1ac8cfdb2e401eb82b8c706c21cca3
- Sigstore transparency entry: 1417659031
- Sigstore integration time: May 1, 2026
Source repository:
- Permalink: jin-bo/agentao@ad07cd10f3a4ddcbab6055baf8a181679d163c10
- Branch / Tag: refs/heads/main
- Owner: https://github.com/jin-bo
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ad07cd10f3a4ddcbab6055baf8a181679d163c10
- Trigger Event: workflow_dispatch

agentao 0.4.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Agentao (Agent + Tao)

Embed in 30 lines

CLI Quickstart

Start Here

Table of Contents

User Guide

Contributor Guide

Detailed Reference

First Commands

Why Agentao?

Feature Overview

Common Workflows

Documentation Map

Core Capabilities

🏛️ Autonomous Governance (自治治理)

🧠 Elastic Context Engine (弹性上下文引擎)

💾 SQLite Memory (持久记忆)

💡 Semantic Display Engine

✅ Session Task Tracking

🤖 SubAgent System

🔌 MCP (Model Context Protocol) Support

🧩 Plugin System

🪝 Hooks System

🎯 Dynamic Skills System

🛠️ Comprehensive Tools

📼 Session Replay

🛡️ macOS Sandbox (defense-in-depth)

🛰️ Headless Runtime

Design Principles

Installation

Prerequisites

Install

For contributors (source install)

Minimum Viable Configuration

Supported Python versions

Required environment variable

Default provider behavior

Minimal runnable example

Troubleshooting common startup failures

Configuration

MCP Server Configuration

Using with Different Providers

Usage

Starting the Agent

Non-Interactive (Print) Mode

Headless / SDK Use

ACP (Agent Client Protocol) Mode

ACP Client — Project-Local Server Management

Commands

Permission Modes (Safety Feature)

Plan Mode

Example Interactions

Project Instructions (AGENTAO.md)

Project Structure

Testing

Logging

Development

Adding a Tool

Adding an Agent

Adding a Skill

Troubleshooting

Etymology

License

Acknowledgments

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details