A powerful console-based AI agent

These details have not been verified by PyPI

Project description

Captain Claw

An open-source AI agent that runs locally, supports multiple LLM providers, and gets work done — coding, research, automation, document processing, orchestration — through persistent sessions with built-in safety guards.

Feature Snapshot

Capability	What it does
Multi-model routing	Mix GPT, Claude, Gemini, and Ollama in one CLI
Per-session model selection	Keep one session on Claude, another on GPT, another on Ollama
Persistent multi-session workflows	Resume any session exactly where you left off
Built-in safety guards	Input, output, and script/tool checks before anything runs
22 built-in tools	Shell, files, web fetch/get/search, docs, email, TTS, Google Drive/Calendar/Gmail, todo, contacts, scripts, APIs, datastore, deep memory
Skills system	OpenClaw-compatible skills with auto-discovery and GitHub install
Orchestrator / DAG mode	Decompose complex tasks into parallel multi-session execution
Memory / RAG	Hybrid vector + text retrieval across workspace and sessions
Web UI	Chat, monitor pane, instruction editor, command palette, datastore browser, deep memory dashboard
Remote integrations	Telegram, Slack, Discord with secure pairing
Cross-session to-do memory	Persistent task list shared across sessions with auto-capture
Cross-session script memory	Persistent script/file tracking with auto-capture from write tool
Cross-session API memory	Persistent API endpoint tracking with auto-capture from web_fetch
Datastore	SQLite-backed relational tables managed by the agent, with protection rules, import/export, and web dashboard
Cron scheduling	Interval, daily, and weekly tasks inside the runtime
OpenAI-compatible API	`POST /v1/chat/completions` proxy with agent pool

Quick Start

1. Install

Requires Python 3.11 or higher.

python -m venv venv
source venv/bin/activate
pip install captain-claw

2. Set an API key

export OPENAI_API_KEY="your-openai-key"
export ANTHROPIC_API_KEY="your-anthropic-key"
export GOOGLE_API_KEY="your-google-key"
export GEMINI_API_KEY="your-gemini-key"

Use only the keys you need. For Ollama, no key is required — just set provider: ollama in config.yaml.

3. Launch

captain-claw-web   # Web UI (default: http://127.0.0.1:23080)
captain-claw       # Interactive terminal
captain-claw --tui # Terminal UI

First run starts interactive onboarding automatically. The web UI redirects to /onboarding on first launch too. To re-run it later: captain-claw --onboarding.

4. Try it

> Investigate failing integration tests and propose a fix.

/models                          # see available models
/session model claude-sonnet     # switch this session to Claude

/new release-notes               # create a second session
/session model chatgpt-fast      # use GPT for this one

> Draft release notes from the previous session updates.

/session run #1 summarize current blockers   # run a prompt in session #1

Each session keeps its own model, context, and history.

5. Open the Web UI

Run captain-claw-web to start the web UI at http://127.0.0.1:23080. Use Ctrl+K for the command palette, Ctrl+B for the sidebar, and edit instruction files live in the Instructions tab.

For the terminal, use captain-claw (interactive) or captain-claw --tui (TUI mode).

How It Works

Sessions

Sessions are first-class. Create named sessions for separate projects, switch instantly, and persist everything.

/new incident-hotfix
/session model claude-sonnet
/session protect on                            # prevent accidental /clear
/session procreate #1 #2 "merged context"      # merge two sessions
/session run #2 summarize current blockers     # run prompt in another session
/session export all                            # export chat + monitor history

Tools

Captain Claw ships with 22 built-in tools. The agent picks the right tool for each task automatically.

Tool	What it does
`shell`	Execute terminal commands
`read` / `write` / `glob`	File operations and pattern matching
`web_fetch`	Fetch and extract readable text from web pages (always text mode)
`web_get`	Fetch raw HTML source for scraping and DOM inspection
`web_search`	Search the web via Brave Search API
`pdf_extract`	Extract PDF content to markdown
`docx_extract`	Extract Word documents to markdown
`xlsx_extract`	Extract Excel sheets to markdown tables
`pptx_extract`	Extract PowerPoint slides to markdown
`pocket_tts`	Generate speech audio (MP3) locally
`send_mail`	Send email via SMTP, Mailgun, or SendGrid
`google_drive`	List, search, read, upload, and manage Google Drive files
`google_calendar`	Create, list, update, and delete Google Calendar events
`google_mail`	Read-only Gmail access — search, read messages and threads
`todo`	Persistent cross-session to-do list with auto-capture
`contacts`	Persistent cross-session address book with auto-capture
`scripts`	Persistent cross-session script/file memory with auto-capture
`apis`	Persistent cross-session API memory with auto-capture
`datastore`	Manage relational data tables with CRUD, import/export, raw SQL, and protection rules
`typesense`	Index, search, and manage documents in deep memory (Typesense)

See USAGE.md for full parameters and configuration.

Guards

Three built-in guard types protect against risky operations:

guards:
  input:
    enabled: true
    level: "ask_for_approval"     # or "stop_suspicious"
  output:
    enabled: true
    level: "stop_suspicious"
  script_tool:
    enabled: true
    level: "ask_for_approval"

Guards run before LLM requests (input), after model responses (output), and before any command or tool execution (script_tool). See USAGE.md for details.

Configuration at a Glance

Captain Claw is YAML-driven with environment variable overrides.

model:
  provider: "openai"
  model: "gpt-4o-mini"
  allowed:
    - id: "claude-sonnet"
      provider: "anthropic"
      model: "claude-sonnet-4-20250514"

tools:
  enabled: ["shell", "read", "write", "glob", "web_fetch", "web_search",
            "pdf_extract", "docx_extract", "xlsx_extract", "pptx_extract",
            "pocket_tts", "send_mail", "google_drive", "google_calendar",
            "google_mail", "todo", "contacts", "scripts", "apis",
            "datastore"]

web:
  enabled: true
  port: 23080

Load precedence: ./config.yaml > ~/.captain-claw/config.yaml > environment variables > .env file > defaults.

For the full configuration reference (23 sections, every field), see USAGE.md.

Advanced Features

Each of these is documented in detail in USAGE.md.

Orchestrator / DAG mode — /orchestrate decomposes a complex request into a task DAG and runs tasks in parallel across separate sessions with real-time progress monitoring. Also available headless via captain-claw-orchestrate.
Skills system — OpenClaw-compatible SKILL.md files. Auto-discovered from workspace, managed, and plugin directories. Install from GitHub with /skill install <url>.
Memory / RAG — Hybrid vector + text retrieval. Indexes workspace files and session messages. Configurable embedding providers (OpenAI, Ollama, local hash fallback).
Cross-session to-do memory — Persistent task list shared across sessions. Auto-capture from natural language, context injection to nudge the agent, and full /todo command support across CLI, Web UI, Telegram, Slack, and Discord.
Cross-session address book — Persistent contact memory that tracks people across sessions. Auto-captures from conversation and email, auto-computes importance from mention frequency, and injects relevant contact context on demand.
Cross-session script memory — Persistent tracking of scripts and files the agent creates. Auto-captures from the write tool when executable extensions are detected. Stores path + metadata (no file content in DB). On-demand context injection when script names appear in conversation.
Cross-session API memory — Persistent tracking of external APIs the agent interacts with. Auto-captures from web_fetch and web_get when API-like URLs are detected. Stores credentials, endpoints, and accumulated context. On-demand context injection when API names or URLs appear in conversation.
Cron scheduling — Pseudo-cron within the runtime. Schedule prompts, scripts, or tools at intervals, daily, or weekly. Guards remain active for every cron execution.
Execution queue — Five queue modes (steer, followup, collect, interrupt, queue) control how follow-up messages are handled during agent execution.
Remote integrations — Connect Telegram, Slack, or Discord bots. Unknown users get a pairing token; the operator approves locally with /approve user.
OpenAI-compatible API — POST /v1/chat/completions endpoint proxied through the Captain Claw agent pool. Streaming supported.
Google Drive + OAuth — Connect your Google account for Drive file operations (list, search, read, upload, create, update) and Vertex AI model access.
Google Calendar — Create, list, update, and delete calendar events. Uses the same Google OAuth connection as Drive.
Google Mail (Gmail) — Read-only Gmail access — search, read messages, browse threads and labels. No send/modify/delete scope required.
Datastore — SQLite-backed relational data tables managed entirely by the agent. 19 tool actions cover schema management, CRUD operations, raw SELECT queries, CSV/XLSX import and export, and a four-level protection system (table, column, row, cell). Includes a web dashboard for browsing tables, editing rows, running SQL, and uploading files.
Deep Memory (Typesense) — Long-term searchable archive backed by Typesense. Indexes processed items from the scale loop, web fetches, and manual input. Hybrid keyword + vector search. Separate from the SQLite-backed semantic memory. Includes a web dashboard for browsing, searching, and managing indexed documents.
Send mail — SMTP, Mailgun, or SendGrid. Supports attachments up to 25 MB.
Document extraction — PDF, DOCX, XLSX, PPTX converted to markdown for agent consumption.
Context compaction — Auto-compacts long sessions at configurable thresholds. Manual compaction with /compact.
Session export — Export chat, monitor, pipeline trace, or pipeline summary to files.

Development

pip install -e ".[dev]"
pytest
ruff check captain_claw/

Architecture

Path	Role
`captain_claw/agent.py`	Main orchestration logic
`captain_claw/llm/`	Provider abstraction (OpenAI, Anthropic, Gemini, Ollama)
`captain_claw/tools/`	Tool registry and 22 tool implementations
`captain_claw/session/`	SQLite-backed session persistence
`captain_claw/skills.py`	Skill discovery, loading, and invocation
`captain_claw/session_orchestrator.py`	Parallel multi-session DAG orchestrator
`captain_claw/semantic_memory.py`	Hybrid vector + text retrieval (RAG)
`captain_claw/datastore.py`	SQLite-backed relational datastore
`captain_claw/deep_memory.py`	Typesense-backed long-term archive
`captain_claw/google_oauth_manager.py`	Google OAuth token management
`captain_claw/cli.py`	Terminal UI
`captain_claw/web/`	Web server (WebSocket + REST + static)
`captain_claw/orchestrator_cli.py`	Headless orchestrator CLI
`captain_claw/config.py`	Configuration and env overrides
`instructions/`	Externalized prompt and instruction templates

FAQ

Is Captain Claw only for coding? No. It handles coding, ops automation, web research, document processing, email, and multi-session orchestration.

Can I use local models only? Yes. Set provider to ollama and run fully local.

Can I run different models at the same time? Yes. Model selection is per session. Different sessions can use different providers and models simultaneously.

Do I need guards enabled? No. Guards are off by default. Enable them when you want stricter safety behavior.

Is there a web interface? Yes — it's the default. Run captain-claw and open http://127.0.0.1:23080. Same agent, sessions, tools, and guardrails as the terminal. Use --tui for the terminal UI.

Where is the full reference? See USAGE.md for comprehensive documentation of every command, tool, config option, and feature.

Get Started

python -m venv venv && source venv/bin/activate
pip install captain-claw
captain-claw-web

If Captain Claw is useful to you, give the repo a star to help others find it.

Found a bug or have a feature idea? Open an issue. Contributions welcome.

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.4.24

May 6, 2026

0.4.22

Apr 28, 2026

0.4.21

Apr 9, 2026

0.4.20

Apr 8, 2026

0.4.19

Apr 7, 2026

0.4.18

Apr 7, 2026

0.4.17

Apr 6, 2026

0.4.16

Apr 5, 2026

0.4.15

Apr 5, 2026

0.4.14

Apr 4, 2026

0.4.13

Apr 4, 2026

0.4.12.5

Apr 3, 2026

0.4.12

Apr 3, 2026

0.4.11

Apr 3, 2026

0.4.10

Apr 3, 2026

0.4.9.3

Apr 2, 2026

0.4.9.2

Apr 2, 2026

0.4.9.1

Apr 2, 2026

0.4.9

Apr 2, 2026

0.4.8.1

Apr 2, 2026

0.4.8

Mar 31, 2026

0.4.7

Mar 29, 2026

0.4.6

Mar 27, 2026

0.4.5

Mar 22, 2026

0.4.4

Mar 21, 2026

0.4.3

Mar 19, 2026

0.4.2

Mar 16, 2026

0.4.1

Mar 16, 2026

0.4.0.1

Mar 15, 2026

0.4.0

Mar 15, 2026

0.3.5

Mar 13, 2026

0.3.4.1

Mar 12, 2026

0.3.4

Mar 12, 2026

0.3.3.7

Mar 8, 2026

0.3.3.6

Mar 8, 2026

0.3.3.5

Mar 7, 2026

0.3.3.4

Mar 6, 2026

0.3.3.3

Mar 6, 2026

0.3.3.2

Mar 5, 2026

0.3.3.1

Mar 5, 2026

0.3.3

Mar 4, 2026

0.3.2

Mar 3, 2026

0.3.1.9

Mar 3, 2026

0.3.1.8

Mar 3, 2026

0.3.1.7

Mar 3, 2026

0.3.1.6

Mar 3, 2026

0.3.1.5

Mar 3, 2026

0.3.1.4

Mar 3, 2026

0.3.1.2

Mar 2, 2026

0.3.1.1

Mar 2, 2026

0.3.1

Mar 2, 2026

0.3.0.4

Mar 2, 2026

0.3.0.3

Mar 2, 2026

0.3.0.2

Mar 2, 2026

0.3.0.1

Mar 2, 2026

0.3.0

Mar 1, 2026

0.2.7.1

Mar 1, 2026

0.2.7

Mar 1, 2026

0.2.6.3.6

Mar 1, 2026

0.2.6.3.5

Mar 1, 2026

0.2.6.3.4

Mar 1, 2026

0.2.6.3.3

Mar 1, 2026

0.2.6.3.2

Mar 1, 2026

0.2.6.3.1

Mar 1, 2026

0.2.6.3

Feb 28, 2026

0.2.6.2

Feb 28, 2026

0.2.6.1

Feb 28, 2026

0.2.6

Feb 28, 2026

This version

0.2.5

Feb 28, 2026

0.2.1

Feb 28, 2026

0.2.0

Feb 28, 2026

0.1.0

Feb 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

captain_claw-0.2.5.tar.gz (598.6 kB view details)

Uploaded Feb 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

captain_claw-0.2.5-py3-none-any.whl (659.9 kB view details)

Uploaded Feb 28, 2026 Python 3

File details

Details for the file captain_claw-0.2.5.tar.gz.

File metadata

Download URL: captain_claw-0.2.5.tar.gz
Upload date: Feb 28, 2026
Size: 598.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for captain_claw-0.2.5.tar.gz
Algorithm	Hash digest
SHA256	`114ab787cfb143cd2010c6819f43f84f3d6aed365a89fb102760bdb6f04fa7cd`
MD5	`694d7218f81cd42d68fbe8a308944870`
BLAKE2b-256	`68ce41419c2ba2a2d5be43b56d5de4b2efe6b4cc9322170ceeafab64663cf3f4`

See more details on using hashes here.

File details

Details for the file captain_claw-0.2.5-py3-none-any.whl.

File metadata

Download URL: captain_claw-0.2.5-py3-none-any.whl
Upload date: Feb 28, 2026
Size: 659.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for captain_claw-0.2.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`bfcd3b035654692e4c766e78866762319e89118150dbad6085c869403a06c73e`
MD5	`89c89bcc788782a83536630666af0fdf`
BLAKE2b-256	`5c46983e762a279dc049c35c1ff20f7de5a3c06f8b2b78110e6870918406c8ec`

See more details on using hashes here.

captain-claw 0.2.5

Navigation

Verified details

Maintainers

Meta

Unverified details

Meta

Classifiers

Project description

Captain Claw

Feature Snapshot

Quick Start

1. Install

2. Set an API key

3. Launch

4. Try it

5. Open the Web UI

How It Works

Sessions

Tools

Guards

Configuration at a Glance

Advanced Features

Development

Architecture

FAQ

Get Started

License

Project details

Verified details

Maintainers

Meta

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes