Open-source AI agent runtime for any LLM — production-grade coding agent with multi-layer memory, multi-agent orchestration, and defense-in-depth security

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

djfeu

These details have not been verified by PyPI

Project description

llmcode

Open-source AI agent runtime for any LLM
Production-grade coding agent with Claude Code-level architecture — your model, your hardware, zero vendor lock-in

Quick Start · Why llmcode · Features · Marketplace · Configuration · Architecture · Contributing

Python 3.11+ Tests MIT License

Why llmcode?

Most AI coding tools lock you into a single provider. llmcode doesn't.

Run the same agent experience with a free local model on your own GPU, or with any cloud API. Switch between them with one config change. No API key required for local models.

 ██╗      ██╗      ███╗   ███╗
 ██║      ██║      ████╗ ████║
 ██║      ██║      ██╔████╔██║
 ██║      ██║      ██║╚██╔╝██║
 ███████╗ ███████╗ ██║ ╚═╝ ██║
 ╚══════╝ ╚══════╝ ╚═╝     ╚═╝
  ██████╗  ██████╗  ██████╗  ███████╗
 ██╔════╝ ██╔═══██╗ ██╔══██╗ ██╔════╝
 ██║      ██║   ██║ ██║  ██║ █████╗
 ██║      ██║   ██║ ██║  ██║ ██╔══╝
 ╚██████╗ ╚██████╔╝ ██████╔╝ ███████╗
  ╚═════╝  ╚═════╝  ╚═════╝  ╚══════╝

Not just a CLI tool — a complete AI Agent Runtime with:

ReAct engine with 5-stage turn loop and streaming tool execution
7-layer error recovery that self-heals instead of crashing
5-layer memory system with governance, working, project, task, and summary memory
Multi-agent orchestration with coordinator pattern and inter-agent messaging
Defense-in-depth security with 21-point bash checks and sensitive file protection

Quick Start

pip install llmcode-cli

With a local model (zero cost):

mkdir -p ~/.llmcode
cat > ~/.llmcode/config.json << 'EOF'
{
  "model": "qwen3.5",
  "provider": {
    "base_url": "http://localhost:8000/v1"
  }
}
EOF

llmcode

With a cloud API:

cat > ~/.llmcode/config.json << 'EOF'
{
  "model": "gpt-4o",
  "provider": {
    "base_url": "https://api.openai.com/v1",
    "api_key_env": "OPENAI_API_KEY"
  }
}
EOF

llmcode

Modes

llmcode                       # Default: Fullscreen TUI (Python Textual)
llmcode --provider ollama     # Auto-detect Ollama + interactive model selector
llmcode --serve --port 8765   # Remote WebSocket server
llmcode --connect host:8765   # Connect to remote agent
llmcode --ssh user@host       # SSH tunnel + auto-connect
llmcode --replay <file>       # Replay a recorded session
llmcode --resume              # Resume from checkpoint

Optional Features

pip install llmcode-cli[voice]          # Voice input via STT
pip install llmcode-cli[computer-use]   # GUI automation
pip install llmcode-cli[ide]            # IDE integration
pip install llmcode-cli[telemetry]      # OpenTelemetry tracing

Features

Model Freedom

Provider	Examples	Cost
Local (vLLM)	Qwen 3.5, Llama, Mistral, DeepSeek	Free
Local (Ollama)	Any GGUF model	Free
Local (LM Studio)	Any supported model	Free
OpenAI	GPT-4o, GPT-4o-mini, o3	Pay-per-use
Anthropic	Claude Opus, Sonnet, Haiku	Pay-per-use
Google	Gemini 2.5 Pro, Gemini 2.5 Flash	Pay-per-use
xAI	Grok	Pay-per-use
DeepSeek	DeepSeek V3, R1	Pay-per-use

Model aliases — qwen, gpt, opus, sonnet resolve to full model paths
Model routing — different models for sub-agents, compaction, and fallback
Local models get unlimited token output — no artificial cap on localhost

Agent Runtime Engine

The core loop follows a 5-stage ReAct (Reason + Act) pattern:

Context preparation — compress history, load relevant memory, apply HIDA filtering
Streaming model call — send conversation + tools, stream response in real-time
Tool execution — read-only tools run concurrently during streaming; writes wait
Attachment collection — gather file changes, task state, memory updates
Continue or stop — loop back if tools were called, stop if model is done

Resilience features:

7-layer error recovery — API retry with exponential backoff, 529 overload handling (30/60/120s), native-to-XML tool fallback, reactive context compression, token limit auto-upgrade, context drain, model fallback after 3 consecutive failures
Speculative execution — writes pre-execute in a tmpdir overlay before user confirms; confirm copies back, deny discards
4-level context compression — snip (truncate tool results), microcompact (deduplicate reads), autocompact (AI summary), reactive (emergency on 413)
Cache-aware compression — preferentially removes non-API-cached messages to preserve cache hits
3-tier prompt cache — global/project/session scope boundaries for optimal API cache utilization
HIDA dynamic loading — classifies input into 10 task types, loads only relevant tools/memory/governance rules

Tools

Built-in tools with smart permission classification:

Category	Tools
File I/O	read_file, write_file, edit_file (with fuzzy quote matching + mtime conflict detection)
Search	glob_search, grep_search, tool_search (deferred tool discovery)
Execution	bash (21-point security), agent (sub-agents)
Git	git_status, git_diff, git_log, git_commit, git_push, git_stash, git_branch
Notebook	notebook_read, notebook_edit (Jupyter .ipynb)
Computer Use	screenshot, mouse_click, keyboard_type, key_press, scroll, mouse_drag
Task Lifecycle	task_plan, task_verify, task_close
Scheduling	cron_create, cron_list, cron_delete
IDE	ide_open, ide_diagnostics, ide_selection
Swarm	swarm_create, swarm_list, swarm_message, swarm_delete, coordinate
Memory	LSP, memory tools

When tool count exceeds 20, non-core tools are deferred and discoverable via tool_search.

Multi-Agent Collaboration

/swarm create coder "Implement the login API"
/swarm create tester "Write tests for the login API"
/swarm create reviewer "Review the login implementation"
/swarm coordinate "Build a complete user auth system"

Coordinator auto-decomposes complex tasks into subtasks and dispatches to workers
tmux backend — each agent in its own terminal pane (subprocess fallback for non-tmux)
Mailbox — file-based JSONL message passing between agents
Shared memory — all agents access the same project memory with file locking
Built-in roles — coder, reviewer, researcher, tester, or define custom roles

Security

21-point Bash security:

Injection detection, newline attack prevention, pipe chain limits, interpreter REPL blacklist, environment variable leak protection, network access control, file permission change detection, system package operation alerts, redirect overwrite detection, credential path protection, background execution detection, recursive operation warnings, multi-command chain limits, and Zsh dangerous builtin blocking.

File protection: Sensitive files (.env, SSH keys, credentials.*, *.pem) are blocked on write and warned on read.

Sandbox detection: Auto-detects Docker/container environments and restricts paths.

Permission system: 5 modes (read_only → auto_accept) with allow/deny lists, shadowed rule detection, and input-aware classification (ls auto-approved, rm -rf needs confirmation).

Memory System

Layer	Scope	Lifetime	Purpose
L0 Governance	Project	Permanent	Rules from CLAUDE.md + .llmcode/rules/ — always loaded
L1 Working	Session	Ephemeral	In-memory scratch space for current task
L2 Project	Project	Long-term	DreamTask-consolidated knowledge with tag-based queries
L3 Task	Cross-session	Until done	PLAN/DO/VERIFY/CLOSE state machine persisted as JSON
L4 Summary	Per-session	Long-term	Conversation summaries for future reference

DreamTask: On session exit, automatically consolidates conversation into structured long-term memory — files modified, decisions made, patterns learned.

Checkpoint recovery: Auto-saves every 60 seconds. Resume with --resume or /checkpoint resume.

Task Lifecycle

PLAN --> DO --> VERIFY --> CLOSE --> DONE
                  |
           [auto checks]
            pass --> CLOSE
            fail --> diagnostics
                     |-- continue (minor fix)
                     |-- replan (redo PLAN)
                     |-- escalate (ask user)

VERIFY runs automated checks: pytest, ruff, file existence — then LLM judges
Cross-session: incomplete tasks persist and resume in the next session
CLOSE writes summaries to L3 task memory and L2 project memory

Terminal UI

Fullscreen TUI (default) — Python Textual, no Node.js required, Claude Code-style UI
- Welcome banner, markdown rendering, syntax-highlighted code blocks
- Slash command autocomplete dropdown with Tab/arrow navigation
- Inline [image] markers with Cmd+V paste support
- Interactive marketplace browser for skills, plugins, and MCP servers
- Tabbed /help modal (general / commands / custom-commands)
- ToolBlock diff view with colored +/- lines and line numbers
- Spinner with orange→red color transition on long operations
- Permission prompts with single-key y/n/a
- Cursor movement (←→, Home/End) in input bar
Vim mode — full motions (hjkl, w/b/e, 0/$, gg/G, f/F/t/T), operators (d/c/y), text objects (iw, i", i()
Diff visualization — colored inline diffs on every file change
Search — /search or Ctrl+F with match highlighting
OSC8 hyperlinks — clickable URLs in supporting terminals
Voice input — hold-to-talk STT (Whisper, Google, Anthropic backends)
Extended thinking — collapsible thinking panel with adaptive/enabled/disabled modes

Hook System

6 event categories, 24 events, glob pattern matching:

Category	Events
tool	pre_tool_use, post_tool_use, tool_error, tool_denied
command	pre_command, post_command, command_error
prompt	prompt_submit, prompt_compile, prompt_cache_hit, prompt_cache_miss
agent	agent_spawn, agent_complete, agent_error, agent_message
session	session_start, session_end, session_save, session_compact, session_dream
http	http_request, http_response, http_error, http_retry, http_fallback

{
  "hooks": [
    {"event": "post_tool_use", "tool_pattern": "write_file|edit_file", "command": "ruff format {path}"},
    {"event": "session.*", "command": "echo $HOOK_EVENT >> ~/agent.log", "on_error": "ignore"}
  ]
}

IDE Integration

llmcode runs a WebSocket JSON-RPC server that any IDE can connect to:

Open files at specific lines in your editor
Read diagnostics (lint errors, type errors) from the IDE
Get selection — the agent can read your currently selected code
Auto-detection — scans for running VSCode, JetBrains, Neovim, Sublime

Observability

OpenTelemetry — spans for turns and tool executions with LLM semantic conventions
VCR recording — structured JSONL event streams for debugging and replay
Cost tracking — per-model pricing with cache-aware calculations and budget enforcement
Version check — notifies on startup if a newer release is available

Marketplace

Compatible with Claude Code's plugin ecosystem — skills, plugins, and MCP servers work out of the box.

Skills — `/skill`

 > brainstorming          (installed)
   test-driven-development (installed)
   code-review-fix         [ClawHub]
   security-check          [npm]

Sources: ClawHub.ai, npm, local plugins

Plugins — `/plugin`

/plugin install obra/superpowers

Sources: Official (Claude Code), ClawHub, npm, GitHub

MCP Servers — `/mcp`

{
  "mcpServers": {
    "github": {
      "command": "npx",
      "args": ["-y", "@modelcontextprotocol/server-github"],
      "env": {"GITHUB_TOKEN": "ghp_xxx"}
    }
  }
}

Supports stdio, HTTP, SSE, and WebSocket transports with health monitoring and auto-reconnection.

Configuration

Config Locations (precedence low -> high)

~/.llmcode/config.json — User global
.llmcode/config.json — Project
.llmcode/config.local.json — Local (gitignored)
CLI flags / env vars — Highest

Example Config

{
  "model": "qwen3.5-122b",
  "model_aliases": {
    "qwen": "/models/Qwen3.5-122B-A10B-int4-AutoRound",
    "fast": "qwen3.5-7b",
    "gpt": "gpt-4o"
  },
  "provider": {
    "base_url": "http://localhost:8000/v1",
    "api_key_env": "LLM_API_KEY",
    "timeout": 120
  },
  "permissions": {
    "mode": "prompt",
    "allow_tools": ["read_file", "glob_search", "grep_search"]
  },
  "model_routing": {
    "sub_agent": "qwen3.5-32b",
    "compaction": "qwen3.5-7b",
    "fallback": "qwen3.5-7b"
  },
  "max_budget_usd": 5.00,
  "thinking": { "mode": "adaptive", "budget_tokens": 10000 },
  "dream": { "enabled": true, "min_turns": 3 },
  "hida": { "enabled": true },
  "hooks": [
    {"event": "post_tool_use", "tool_pattern": "write_*|edit_*", "command": "ruff format {path}"}
  ],
  "mcpServers": {}
}

Commands

Command	Description
`/help`	Show all commands
`/model <name>`	Switch model
`/config`	View/set runtime configuration
`/session`	Session management
`/skill`	Browse & install skills
`/plugin`	Browse & install plugins
`/mcp`	Browse & install MCP servers
`/memory`	View project memory
`/memory consolidate`	Run DreamTask now
`/memory history`	View consolidation history
`/task`	Task lifecycle (new/verify/close)
`/swarm`	Multi-agent (create/coordinate/stop)
`/search <query>`	Search conversation history
`/thinking`	Toggle thinking mode
`/vim`	Toggle vim keybindings
`/voice`	Toggle voice input
`/image`	Paste/load an image
`/cron`	Scheduled tasks
`/vcr`	Session recording
`/checkpoint`	Session checkpoints
`/ide`	IDE connection status
`/lsp`	Language Server Protocol status
`/index`	Codebase indexing
`/hida`	HIDA classification info
`/cd <path>`	Change working directory
`/undo`	Undo last file change
`/cancel`	Cancel running operation
`/cost`	Token usage + cost
`/budget <n>`	Set token budget
`/clear`	Clear conversation
`/exit`, `/quit`	Quit

Architecture

llm_code/               21,000 lines Python
├── api/                Provider abstraction (OpenAI-compat + Anthropic)
├── cli/                CLI entry point + Textual TUI launcher
├── runtime/            ReAct engine, memory layers, compression, hooks,
│                       permissions, checkpoint, dream, VCR, speculative
│                       execution, telemetry, file protection, sandbox
├── tools/              30+ tools with deferred loading + security
├── task/               PLAN/DO/VERIFY/CLOSE state machine
├── hida/               Dynamic context loading (10-type classifier)
├── mcp/                MCP client (4 transports) + OAuth + health checks
├── marketplace/        Plugin system + ClawHub integration
├── lsp/                Language Server Protocol client
├── remote/             WebSocket server/client + SSH proxy
├── vim/                Vim engine (motions, operators, text objects)
├── voice/              STT (Whisper, Google, Anthropic backends)
├── computer_use/       GUI automation (screenshot + input control)
├── cron/               Task scheduler (cron parser + async poller)
├── ide/                IDE bridge (WebSocket JSON-RPC server)
├── swarm/              Multi-agent (coordinator, tmux/subprocess, mailbox)
├── utils/              Notebook, diff, hyperlinks, search, text normalize
tests/                  2,861 tests across 170+ test files

Contributing

git clone https://github.com/DJFeu/llmcode
cd llmcode
python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"
pytest                  # 2,861 tests
ruff check llm_code/    # lint

Requirements

Python 3.11+
An LLM server (vLLM, Ollama, LM Studio, or cloud API)

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

djfeu

These details have not been verified by PyPI

Release history Release notifications | RSS feed

2.0.0

Apr 13, 2026

1.23.1

Apr 11, 2026

1.23.0

Apr 11, 2026

1.22.1

Apr 11, 2026

1.22.0

Apr 11, 2026

1.21.0

Apr 11, 2026

1.20.0

Apr 11, 2026

1.19.0

Apr 11, 2026

1.18.2

Apr 10, 2026

1.18.1

Apr 10, 2026

1.18.0

Apr 10, 2026

1.17.0

Apr 10, 2026

1.16.1

Apr 10, 2026

1.16.0

Apr 10, 2026

1.15.1

Apr 10, 2026

1.15.0

Apr 9, 2026

1.14.0

Apr 9, 2026

1.12.0

Apr 8, 2026

1.11.0

Apr 8, 2026

1.10.0

Apr 8, 2026

1.9.0

Apr 7, 2026

1.8.0

Apr 7, 2026

1.7.0

Apr 7, 2026

1.5.0

Apr 7, 2026

1.4.0

Apr 7, 2026

1.3.0

Apr 7, 2026

1.2.0

Apr 7, 2026

1.1.1

Apr 6, 2026

1.1.0

Apr 6, 2026

1.0.5

Apr 6, 2026

1.0.3

Apr 6, 2026

1.0.2

Apr 6, 2026

This version

1.0.1

Apr 6, 2026

1.0.0

Apr 6, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llmcode_cli-1.0.1.tar.gz (447.9 kB view details)

Uploaded Apr 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llmcode_cli-1.0.1-py3-none-any.whl (332.8 kB view details)

Uploaded Apr 6, 2026 Python 3

File details

Details for the file llmcode_cli-1.0.1.tar.gz.

File metadata

Download URL: llmcode_cli-1.0.1.tar.gz
Upload date: Apr 6, 2026
Size: 447.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for llmcode_cli-1.0.1.tar.gz
Algorithm	Hash digest
SHA256	`c827cafb818774a69815281e886913b8f19676f71725c8d6577c3d2b5dcd4451`
MD5	`8b858abdffd21274f2a3953699f2f89c`
BLAKE2b-256	`4aa9178d14b3d46e2c621bbc696c4843ae3306c0804172fdb7ef340d5f1dde2c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llmcode_cli-1.0.1.tar.gz:

Publisher: publish.yml on DJFeu/llmcode

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llmcode_cli-1.0.1.tar.gz
- Subject digest: c827cafb818774a69815281e886913b8f19676f71725c8d6577c3d2b5dcd4451
- Sigstore transparency entry: 1242771820
- Sigstore integration time: Apr 6, 2026
Source repository:
- Permalink: DJFeu/llmcode@7a6c83a152e2362c5a7acd9a7e726967da3742f8
- Branch / Tag: refs/tags/v1.0.1
- Owner: https://github.com/DJFeu
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@7a6c83a152e2362c5a7acd9a7e726967da3742f8
- Trigger Event: release

File details

Details for the file llmcode_cli-1.0.1-py3-none-any.whl.

File metadata

Download URL: llmcode_cli-1.0.1-py3-none-any.whl
Upload date: Apr 6, 2026
Size: 332.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for llmcode_cli-1.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`4b0db1141bb4c3f823e54e0f59f3affc794f294c01c27d1dbcfd16076798e183`
MD5	`f0cb01445e033cbb666235205ad378d7`
BLAKE2b-256	`c87aaea7cbd127520169584fde962043b18e63905cebe311caeb2550e52a3bd4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for llmcode_cli-1.0.1-py3-none-any.whl:

Publisher: publish.yml on DJFeu/llmcode

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: llmcode_cli-1.0.1-py3-none-any.whl
- Subject digest: 4b0db1141bb4c3f823e54e0f59f3affc794f294c01c27d1dbcfd16076798e183
- Sigstore transparency entry: 1242771822
- Sigstore integration time: Apr 6, 2026
Source repository:
- Permalink: DJFeu/llmcode@7a6c83a152e2362c5a7acd9a7e726967da3742f8
- Branch / Tag: refs/tags/v1.0.1
- Owner: https://github.com/DJFeu
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@7a6c83a152e2362c5a7acd9a7e726967da3742f8
- Trigger Event: release

llmcode-cli 1.0.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

llmcode

Why llmcode?

Quick Start

Modes

Optional Features

Features

Model Freedom

Agent Runtime Engine

Tools

Multi-Agent Collaboration

Security

Memory System

Task Lifecycle

Terminal UI

Hook System

IDE Integration

Observability

Marketplace

Skills — /skill

Plugins — /plugin

MCP Servers — /mcp

Configuration

Config Locations (precedence low -> high)

Example Config

Commands

Architecture

Contributing

Requirements

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Skills — `/skill`

Plugins — `/plugin`

MCP Servers — `/mcp`