AI agent framework with Jupyter sandbox, data analysis, MCP tools, ACP protocol, multi-provider LLM, and standalone runtime packaging
Project description
Box Agent
A general-purpose AI agent with sandboxed code execution, sub-agent parallelism, and multi-provider LLM support.
English | 中文
Get started in 30 seconds:
uv tool install box-agent # or: pip install box-agent (Python 3.10+)
box-agent setup # interactive config wizard
box-agent # start chatting
Or run a one-shot task:
box-agent --task "Analyze sales.csv — show top 10 products by revenue with a bar chart"
Why Box Agent?
Most agent frameworks are either too simple (no sandbox, no tools) or too complex (massive dependencies, rigid architecture). Box Agent hits the sweet spot:
| Feature | Box Agent | Open Interpreter | Aider |
|---|---|---|---|
| Sandboxed code execution | Jupyter kernel in isolated venv | Runs in host Python | N/A |
| Sub-agent parallelism | Multiple sub-agents run concurrently | No | No |
| Multi-provider LLM | Anthropic, OpenAI, DeepSeek, SiliconFlow, any API | OpenAI + a few others | OpenAI + Anthropic |
| MCP tool integration | Native | No | No |
| ACP protocol (embed in apps) | Full support | No | No |
| Standalone binary | PyInstaller runtime, no Python needed | No | No |
| Context compression | 2-layer automatic (micro-compact + LLM summary) | Manual | Git-based |
Key Features
Sub-Agent Parallelism
Delegate tasks to isolated sub-agents that run concurrently. Each sub-agent has its own context — only the summary comes back. Perfect for multi-file analysis.
You: "Analyze data1.csv, data2.csv, and data3.csv separately, then give me a combined summary"
┌─ Sub-Agent 1 ──────┐ ┌─ Sub-Agent 2 ──────┐ ┌─ Sub-Agent 3 ──────┐
│ Read data1.csv │ │ Read data2.csv │ │ Read data3.csv │
│ Run statistics │ │ Run statistics │ │ Run statistics │
│ Generate charts │ │ Generate charts │ │ Generate charts │
│ → Summary: ... │ │ → Summary: ... │ │ → Summary: ... │
└─────────────────────┘ └─────────────────────┘ └─────────────────────┘
↓ parallel ↓
┌─ Parent Agent ──────────┐
│ Combines 3 summaries │
│ Produces final report │
└─────────────────────────┘
Sandboxed Code Execution
Python runs in an isolated Jupyter kernel with pre-installed data science packages (pandas, numpy, matplotlib, scikit-learn, openpyxl, xlrd). Generated files (charts, CSVs, PDFs) are automatically detected and surfaced as structured artifacts.
Multi-Provider LLM
One config, any provider:
# Anthropic
api_base: "https://api.anthropic.com"
provider: "anthropic"
model: "claude-sonnet-4-20250514"
# DeepSeek
api_base: "https://api.deepseek.com"
provider: "openai"
model: "deepseek-chat"
# Any OpenAI-compatible endpoint
api_base: "https://your-api.example.com/v1"
provider: "openai"
model: "your-model"
2-Layer Context Compression
- Layer 1 — Micro-compact: Every step, old tool results (3+ turns back) are replaced with short placeholders. Zero cost, no LLM call.
- Layer 2 — Auto-summary: When tokens exceed the threshold (default 80k), an LLM call summarizes the conversation. Original data is preserved in logs.
More
- MCP Tools: Connect to any MCP server — web search, knowledge graphs, databases
- Claude Skills: 11 built-in skills for documents (DOCX, PDF, PPTX, XLSX), canvas design, web app testing, and more
- ACP Protocol: Embed Box Agent in Electron apps, Zed Editor, or any ACP-compatible host via JSON-RPC over stdio
- Standalone Runtime: PyInstaller binary bundles Python + all dependencies. No external Python needed — download and run
- Cross-session Memory: Persistent memory lets the agent retain key information across conversations
- Safety Layer: Dangerous command detection, workspace scope control, auto-backup before file modifications
- Task Tracking: Built-in todo tool for multi-step task decomposition and progress tracking
Demos
Task Execution
The agent creates a webpage and opens it in the browser.
Claude Skill — PDF Generation
The agent uses a skill to create a professional document.
Web Search via MCP
The agent searches the web and summarizes results.
Installation
Requires Python 3.10+. If your system Python is older (e.g. 3.9), use
uv tool install— it manages Python automatically.
Quick Start (uv, recommended)
uv handles Python version management for you — no need to upgrade your system Python:
# Install uv (if not already)
curl -LsSf https://astral.sh/uv/install.sh | sh
# Install box-agent (auto-downloads Python 3.10+ if needed)
uv tool install box-agent
box-agent setup # interactive config wizard
box-agent # start chatting
# Upgrade later
uv tool upgrade box-agent
Quick Start (pip)
If you already have Python 3.10+:
pip install box-agent
box-agent setup
box-agent
From Source
git clone https://github.com/Raccoon-Office/Box-Agent.git
cd Box-Agent
uv sync
git submodule update --init --recursive # optional: load skills
uv run python -m box_agent.cli
Configuration
After running box-agent setup, your config lives at ~/.box-agent/config/config.yaml:
api_key: "your-api-key"
api_base: "https://api.anthropic.com"
model: "claude-sonnet-4-20250514"
provider: "anthropic" # "anthropic" or "openai"
max_steps: 50
box-agent config # show current config
box-agent config --edit # open in editor
box-agent doctor # check environment & API connectivity
CLI Usage
# Interactive mode
box-agent
box-agent --workspace /path/to/project
box-agent --sandbox # enable Jupyter sandbox
# Non-interactive (CI/CD, scripts)
box-agent --task "analyze data.csv and create a report"
# Subcommands
box-agent setup # config wizard
box-agent config # show/edit config
box-agent doctor # health check
box-agent log # open log directory
In-session commands: /help, /clear, /history, /stats, /log, /exit
ACP & Editor Integration
Box Agent supports the Agent Communication Protocol for embedding in editors and apps.
Zed Editor — add to settings.json:
{
"agent_servers": {
"box-agent": {
"command": "/path/to/box-agent-acp"
}
}
}
Standalone Runtime — for Electron apps and other hosts:
# Download pre-built binary
gh release download v0.6.7 --repo Raccoon-Office/Box-Agent --pattern "box-agent-runtime-*.tar.gz"
# Or build from source
uv run python scripts/build_runtime.py
The runtime communicates via JSON-RPC over stdio. stdout = protocol only, stderr = diagnostics.
Testing
pytest tests/ -v # all tests
pytest tests/test_core.py -v # core + context compression
pytest --cov # with coverage
Troubleshooting
SSL Certificate Error: pip install --upgrade certifi or set verify=False for testing.
Module Not Found: Make sure you're in the project directory: cd Box-Agent && uv run python -m box_agent.cli
Contributing
Issues and PRs welcome! See Contributing Guide.
License
Links
If this project helps you, give it a ⭐!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file box_agent-0.7.2.tar.gz.
File metadata
- Download URL: box_agent-0.7.2.tar.gz
- Upload date:
- Size: 3.4 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7a88c2d82115675f0a08ba180516fc842eab1a71c3a9766d6099fa44dec8a12c
|
|
| MD5 |
7462f7f3ff20420b492ca2c3ce5628b3
|
|
| BLAKE2b-256 |
51549a3e33d4063693287e801ef47df393410a17153f71fd59c063011c95ac05
|
File details
Details for the file box_agent-0.7.2-py3-none-any.whl.
File metadata
- Download URL: box_agent-0.7.2-py3-none-any.whl
- Upload date:
- Size: 3.5 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.20
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d3bb9cbcf86f64e989c559980ed0aa1653cc129583b2701aaa7365acf2447850
|
|
| MD5 |
49d46520bcab8e660b9987efcc786aa1
|
|
| BLAKE2b-256 |
2a99a1e195405acb8183b028acffcc495d474ded23d9330c33f4320d3511a576
|