Apple Silicon native AI agent sandbox — three-tier isolated code execution with MCP server

These details have not been verified by PyPI

Project links

Project description

SiliconSandbox

Archived — not actively maintained. This project is published as-is. It works, has 246 tests (94% pass rate), and ran in production for a month. PRs and issues are welcome but may not be addressed.

Apple Silicon native AI agent sandbox and orchestration platform. Runs entirely on macOS using native primitives — no Docker, no cloud dependency.

What It Does

Provides isolated execution environments for AI agents to run code safely, plus an orchestrator to break complex tasks into parallel subtasks routed across local and cloud LLMs.

Three isolation tiers — pick the right tradeoff between security and speed:

Tier	Technology	RAM	Boot	Use Case
A (Seatbelt)	macOS `sandbox-exec` + SBPL	~0 MB	~50ms	Default for code execution
B (MicroVM)	Apple Virtualization.framework + Alpine Linux	~256 MB	~260ms	Untrusted binaries, Linux envs, browser automation
C (Native)	`subprocess` + rlimit + setpgrp	~0 MB	~10ms	Trusted internal tools

Architecture

Web UI (:8095) ─── Orchestrator (:8094) ─── Sandbox Engine (:8093)
                         │                        │
                    Model Router              Three Tiers:
                    ├─ Local LLM (coder)      ├─ A: Seatbelt (sandbox-exec)
                    ├─ Local LLM (classifier) ├─ B: MicroVM (Virt.framework)
                    └─ Claude API (planner)   └─ C: Native (subprocess + rlimit)
                                                   │
MCP Server (:8100) ──────────────────────────── Engine API
  11 tools for Claude Code                         │
                                              Network Proxy (:8098)
                                              Domain allowlist, deny-all default

Requirements

macOS 14+ (Sonoma) on Apple Silicon (M1/M2/M3/M4)
Python 3.12+
Xcode Command Line Tools (for Swift vm-launcher build)
Optional: Anthropic API key for the orchestrator's planner

Quick Start

# Install dependencies, build vm-launcher, set up LaunchAgents
./scripts/install.sh

# Start all services
./scripts/launch.sh

# Verify
curl -s http://127.0.0.1:8093/health | python3 -m json.tool

# Stop all services
./scripts/stop.sh

Usage

Direct API

# Run a command in a Seatbelt sandbox
curl -X POST http://127.0.0.1:8093/sandbox \
  -H 'Content-Type: application/json' \
  -d '{"command": "python3 -c \"print(2+2)\"", "tier": "A", "timeout": 10}'

# Create a persistent session
curl -X POST http://127.0.0.1:8093/session \
  -H 'Content-Type: application/json' \
  -d '{"tier": "A", "ttl_seconds": 3600}'

# Execute in session
curl -X POST http://127.0.0.1:8093/session/{id}/exec \
  -H 'Content-Type: application/json' \
  -d '{"command": "python3 main.py"}'

Python SDK

from silicon_sandbox import Sandbox, Session

# One-shot execution
result = Sandbox.run("echo hello", tier="A")
print(result.stdout)

# Persistent session with file operations
with Session.create() as session:
    session.write_files({"main.py": "print('hello')"})
    result = session.exec("python3 main.py")
    print(result.stdout)

MCP Tools (Claude Code / AI Agents)

sandbox_run — one-shot sandboxed execution
sandbox_health — engine health check
session_create / session_exec / session_destroy — persistent sessions
session_write_files / session_read_file — file operations in sessions
session_pause / session_resume — SIGSTOP/SIGCONT process control
session_list — list active sessions

Desktop Automation (Tier B)

Tier B boots a full Alpine Linux VM with Xvfb, Openbox, and Chromium:

# Create a desktop session
curl -X POST http://127.0.0.1:8093/session \
  -H 'Content-Type: application/json' \
  -d '{"tier": "B", "image": "desktop"}'

# Take a screenshot
curl http://127.0.0.1:8093/session/{id}/screenshot --output screen.png

# Control the browser via CDP
curl -X POST http://127.0.0.1:8093/session/{id}/browser/control \
  -H 'Content-Type: application/json' \
  -d '{"method": "Page.navigate", "params": {"url": "https://example.com"}}'

Security

Deny-default Seatbelt profiles (SBPL v2): (deny default) base with selective allows
Blocked paths: ~/.ssh, ~/.gnupg, ~/Library/Keychains, ~/.config/git/credentials, ~/.netrc, ~/.aws
Process isolation: os.setpgrp() + resource.setrlimit() per sandbox
Network: deny-all by default, opt-in through domain allowlist proxy on :8098
Auth: optional Bearer token via SILICONSANDBOX_AUTH_TOKEN env var

Configuration

Copy and edit config/default.yaml:

sandbox:
  seatbelt:
    denied_paths: ["~/.ssh", "~/.gnupg", "~/Library/Keychains"]
    max_cpu_seconds: 120
    max_processes: 50
  microvm:
    default_cpus: 2
    default_memory_gb: 2
  network:
    proxy_port: 8098
    allowed_domains: ["pypi.org", "github.com"]
    deny_all_by_default: true

orchestrator:
  models:
    planner:
      provider: anthropic
      model: claude-sonnet-4-20250514
    coder:
      provider: openai_compatible
      endpoint: "http://127.0.0.1:8080/v1"
      model: "your-local-model"

The orchestrator's model router supports any OpenAI-compatible endpoint for local models and Anthropic API for planning/research roles. Set ANTHROPIC_API_KEY in your environment for the planner.

Tests

.venv/bin/python3 -m pytest tests/ -v
# 246 tests across 8 modules

Project Structure

silicon-sandbox/
├── sandbox-engine/          # Core engine (seatbelt, microvm, native, server)
│   ├── sandbox_engine/      # Python package
│   ├── guest-agent/         # MicroVM guest agent (init + shell scripts)
│   └── vm-launcher/         # Swift CLI (Virtualization.framework)
├── orchestrator/            # DAG engine, model router, planner, memory
├── tools/                   # MCP tool servers
│   ├── sandbox-mcp/         # Main MCP server (port 8100)
│   ├── code-interpreter/    # Python/Node/Bash execution
│   ├── file-manager/        # Scoped workspace operations
│   ├── web-research/        # DuckDuckGo + readability
│   └── browser-automation/  # CDP via MicroVM desktop
├── sdk/                     # Python SDK (silicon_sandbox package)
├── ui/                      # Web UI (Alpine.js, single HTML file)
├── config/                  # YAML config, SBPL profiles, VM image scripts
├── scripts/                 # install.sh, launch.sh, stop.sh
├── launchd/                 # LaunchAgent plist templates
└── tests/                   # 246 tests

Acknowledgments

Phase 8 hardening was informed by review of Alibaba's OpenSandbox — specifically the deny-default Seatbelt profile pattern, preexec_fn process isolation approach, persistent session concept, and SDK design. No code was copied; the implementation is independent.

License

MIT — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.5.0

Mar 31, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

silicon_sandbox-0.5.0.tar.gz (56.9 kB view details)

Uploaded Mar 31, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

silicon_sandbox-0.5.0-py3-none-any.whl (40.4 kB view details)

Uploaded Mar 31, 2026 Python 3

File details

Details for the file silicon_sandbox-0.5.0.tar.gz.

File metadata

Download URL: silicon_sandbox-0.5.0.tar.gz
Upload date: Mar 31, 2026
Size: 56.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for silicon_sandbox-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`eaa620bd6a7b31d78c9227c28b071a01d28c3d73f81901b373b06d03968ec61a`
MD5	`66069c60f3bb7b2028b5c433dc293a6e`
BLAKE2b-256	`69899ec5b1caf9a08b19cb4cda5f92e6befbc546db59540492a4ddb62192fe67`

See more details on using hashes here.

File details

Details for the file silicon_sandbox-0.5.0-py3-none-any.whl.

File metadata

Download URL: silicon_sandbox-0.5.0-py3-none-any.whl
Upload date: Mar 31, 2026
Size: 40.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for silicon_sandbox-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e032e5c63ca43b61f80c5e140134973f9fb798c28a78a7706e94cf8f38eee70e`
MD5	`8380497cda13334b83d2b8c36080a951`
BLAKE2b-256	`29f86053aa6d104a042dfccc22944ab3a47459fd9f700f11d9b86dc1c92eb69f`

See more details on using hashes here.

silicon-sandbox 0.5.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

SiliconSandbox

What It Does

Architecture

Requirements

Quick Start

Usage

Direct API

Python SDK

MCP Tools (Claude Code / AI Agents)

Desktop Automation (Tier B)

Security

Configuration

Tests

Project Structure

Acknowledgments

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes