Defense-in-depth proxy sandbox for AI agents

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

agentcage

These details have not been verified by PyPI

Project description

agentcage logo

agentcage

Defense-in-depth proxy sandbox for AI agents.

Because "the agent would never do that" is not a security policy.

:warning: Warning: This is an experimental project. It has not been audited by security professionals. Use it at your own risk. See Security & Threat Model for details and known limitations.

Setting up OpenClaw? See the OpenClaw guide and openclaw/config.yaml.

What is it?

agentcage is a CLI that generates hardened, sandboxed environments for AI agents. In the default container mode, it produces systemd quadlet files that deploy three containers on a rootless Podman network -- no root privileges required. In Firecracker mode, the same container topology runs inside a dedicated microVM with its own Linux kernel, providing hardware-level isolation via KVM. In both modes, your agent runs on an internal-only network with no internet gateway; the only way out is through an inspecting mitmproxy that scans every HTTP request before forwarding it.

Features

:mag: Pluggable inspector chain -- domain filtering, secret detection, payload analysis, and custom Python inspectors
:key: Bidirectional secret injection -- agent gets placeholders, proxy injects outbound, redacts inbound
:detective: Regex-based secret scanning -- automatic provider-to-domain mapping, extensible via config
:bar_chart: Payload analysis -- Shannon entropy, content-type mismatch detection, base64 blob scanning, body-size limits
:globe_with_meridians: WebSocket frame inspection -- same inspector chain applied to every frame post-handshake
:satellite: DNS filtering -- dnsmasq sidecar, RFC 5737 placeholder IPs for non-allowlisted domains, query logging
:stopwatch: Per-host rate limiting -- token-bucket with configurable burst
:pencil: Structured audit logging -- JSON lines for all inspection decisions (block, flag, allow)
:lock: Container hardening -- read-only rootfs, all capabilities dropped, no-new-privileges
:package: Supply chain hardening -- pinned base image digests, lockfile integrity, SHA-256 patch verification

Design Principles

:no_entry: Fail-closed -- if any component fails, traffic stops, not bypasses.
:shield: Secure by default -- all hardening is on out of the box; security is opt-out, not opt-in.
:mag: Inspect, don't just isolate -- every request, frame, and query is analyzed before forwarding.
:closed_lock_with_key: Agent never holds real secrets -- placeholders in, real values injected in transit only.
:scroll: Audit everything -- all decisions logged as structured JSON by default.

Why is it needed?

Most AI agent deployments hand the agent a lethal trifecta:

:globe_with_meridians: Internet access -- the agent can reach any server on the internet.
:key: Secrets -- tokens and other secrets are passed as environment variables or mounted files.
:computer: Arbitrary code execution -- the agent runs code it writes itself, or code suggested by a model.

Any one of these alone is manageable. Combined, they create an exfiltration risk: if the agent is compromised, misaligned, or simply makes a mistake, it can send your secrets, source code, or private data to any endpoint on the internet. Most current setups have zero defense against this -- the agent has the same network access as any other process on the machine.

agentcage breaks the trifecta by placing the agent behind a defense-in-depth proxy sandbox: network isolation, domain filtering, secret injection, secret scanning, payload analysis, and container hardening -- all fail-closed. See Security & Threat Model for the full breakdown of each layer and known limitations.

How is it different?

Most agent sandboxes stop at network-level isolation: put the agent in a VM or container and control which hosts it can reach. agentcage adds a full inspection layer on top -- every HTTP request, WebSocket frame, and DNS query passes through a pluggable inspector chain before reaching the internet.

The agent never holds real secrets. Secret injection gives the agent placeholder tokens ({{ANTHROPIC_API_KEY}}); the proxy swaps in real values on outbound requests and redacts them from inbound responses. If a placeholder is sent to an unauthorized domain, the request is blocked. The secrets inspector provides a second line of defense with regex-based secret scanning that detects common key formats, each with automatic provider-to-domain mapping so legitimate API calls pass through without manual configuration.

On top of domain filtering and secret detection, the inspector chain analyzes payloads for anomalies -- Shannon entropy (catching encrypted/compressed exfiltration), content-type mismatches, base64 blobs -- and inspects WebSocket frames with the same chain. All decisions are written as structured JSON audit logs.

agentcage runs natively on headless Linux using rootless Podman -- fully self-hosted, single-binary CLI, open source.

How does it work?

A cage is three containers on an internal Podman network: your agent (no internet gateway), a dual-homed DNS sidecar, and a dual-homed mitmproxy that inspects and forwards all traffic.

  podman network: <name>-net (--internal, no internet gateway)
  ┌──────────────────────────────────────────────────────────────────┐
  │                                                                  │
  │  ┌──────────────┐    ┌───────────────┐    ┌──────────────────┐  │
  │  │ Agent         │    │ DNS sidecar   │    │ mitmproxy        │  │
  │  │               │    │ (dnsmasq)     │    │ + inspector chain│  │
  │  │ HTTP_PROXY=  ─┼────┼───────────────┼───►│                  │  │
  │  │  10.89.0.11  ─┼────┼───────────────┼──►│ scans + forwards─┼──┼─► Internet
  │  │               │    │               │    │                  │  │
  │  │ resolv.conf  ─┼───►│ resolves via  │    │                  │  │
  │  │               │    │ external net ─┼────┼──────────────────┼──┼─► Upstream DNS
  │  │               │    │               │    │                  │  │
  │  │ ONLY on       │    │ internal +    │    │ internal +       │  │
  │  │ internal net  │    │ external net  │    │ external net     │  │
  │  └──────────────┘    └───────────────┘    └──────────────────┘  │
  │                                                                  │
  └──────────────────────────────────────────────────────────────────┘

All HTTP traffic is routed via HTTP_PROXY / HTTPS_PROXY to the mitmproxy container. A pluggable inspector chain evaluates every request -- enforcing domain allowlists, scanning for secret leaks, analyzing payloads -- before forwarding or blocking with a 403. The chain short-circuits on the first hard block.

See Architecture for the full inspector chain, startup order, and certificate sharing.

Isolation Modes

agentcage supports two isolation modes. Both share the same three-container topology and inspector chain — the difference is what provides the outer isolation boundary.

	Container mode (default)	Firecracker mode
Isolation	Linux namespaces (rootless Podman)	Hardware virtualization (KVM)
Kernel	Shared with host	Dedicated guest kernel per cage
Container escape risk	Mitigated by hardening, not eliminated	Eliminated — escape lands in VM, not on host
Root required	No	Yes (for TAP device and bridge setup)
macOS support	Yes (via Podman machine)	No (requires `/dev/kvm`)
Boot overhead	~1s	~7s
Best for	Development, CI, low-risk workloads	Production, untrusted agents, high-security

Set isolation: firecracker in your config to use Firecracker mode. See Firecracker MicroVM Isolation for setup and details.

Prerequisites

Podman (rootless)
Python 3.12+
uv (Python package manager)

Linux

Arch Linux:

sudo pacman -S podman python uv

Debian / Ubuntu (24.04+):

sudo apt install podman python3
curl -LsSf https://astral.sh/uv/install.sh | sh

Fedora:

sudo dnf install podman python3 uv

macOS

brew install podman python uv
podman machine init
podman machine start

Note: On macOS, Podman runs containers inside a Linux VM. podman machine init creates and podman machine start starts it.

Firecracker Mode (optional)

Firecracker mode requires Linux with /dev/kvm access. See Firecracker setup for full prerequisites. macOS is not supported for Firecracker mode.

Install

uv tool install agentcage            # from PyPI (when published)
uv tool install git+https://github.com/agentcage/agentcage.git  # from GitHub

Or for development:

git clone https://github.com/agentcage/agentcage.git
cd agentcage
uv run agentcage --help

Updating Dependencies

All dependencies are pinned (lock files, image digests, binary checksums). To check for updates:

./scripts/update-deps.py              # check all, report only
./scripts/update-deps.py --update     # check all, apply updates
./scripts/update-deps.py containers   # check a single category

Categories: python, containers, firecracker, kernel, node, pip.

Requires skopeo for container image checks (sudo pacman -S skopeo on Arch).

Usage

# 1. Write your config
cp examples/basic/config.yaml config.yaml
vim config.yaml

# 2. Store secrets (before creating the cage)
agentcage secret set myapp ANTHROPIC_API_KEY
agentcage secret set myapp GITHUB_TOKEN

# 3. Create the cage (builds images, generates quadlets, starts containers)
agentcage cage create -c config.yaml

# 4. Verify it's healthy
agentcage cage verify myapp

# 5. View logs
agentcage cage logs myapp             # agent logs
agentcage cage logs myapp -s proxy    # proxy inspection logs
agentcage cage logs myapp -s dns      # DNS query logs

# 6. Update after code/config changes
agentcage cage update myapp -c config.yaml

# 7. Rotate a secret (auto-reloads the cage)
agentcage secret set myapp ANTHROPIC_API_KEY

# 8. Restart without rebuild (config-only change)
agentcage cage reload myapp

# 9. Tear it all down
agentcage cage destroy myapp

CLI Overview

Group	Commands
`cage`	`create`, `update`, `list`, `destroy`, `verify`, `reload`
`secret`	`set`, `list`, `rm`
`domain`	`list`, `add`, `rm`

See CLI Reference for full documentation of all commands and options.

Deployment State

agentcage tracks each cage in ~/.config/agentcage/deployments/<name>/config.yaml. This stored config copy allows commands like cage update (without -c) and cage reload to operate without requiring the original config file. The state is removed when a cage is destroyed.

Architecture

See Architecture for the full container topology, inspector chain, startup order, and certificate sharing.

Configuration

See the Configuration Reference for all settings, defaults, and examples. Example configs: basic/config.yaml | openclaw/config.yaml

Security

The agent has no internet gateway -- all traffic must pass through the proxy, which applies domain filtering, secret detection, payload inspection, and custom inspectors. For workloads requiring hardware-level isolation, Firecracker mode adds a dedicated guest kernel per cage, eliminating container escape as an attack vector. See Security & Threat Model for the full threat model, defense layers, and known limitations.

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

agentcage

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.10.2

Mar 22, 2026

0.10.1

Mar 21, 2026

0.10.0

Mar 21, 2026

0.9.2

Mar 20, 2026

0.9.1

Mar 18, 2026

0.9.0

Mar 18, 2026

0.8.1

Mar 14, 2026

0.8.0

Mar 14, 2026

0.7.1

Mar 6, 2026

0.7.0

Mar 6, 2026

0.6.4

Mar 4, 2026

0.6.3

Mar 4, 2026

0.6.2

Mar 2, 2026

0.6.1

Mar 1, 2026

0.6.0

Feb 28, 2026

0.5.0

Feb 27, 2026

0.4.1

Feb 26, 2026

0.4.0

Feb 26, 2026

0.3.19

Feb 26, 2026

0.3.18

Feb 25, 2026

0.3.17

Feb 25, 2026

0.3.16

Feb 25, 2026

0.3.15

Feb 25, 2026

0.3.14

Feb 25, 2026

0.3.13

Feb 23, 2026

0.3.12

Feb 23, 2026

0.3.11

Feb 23, 2026

0.3.10

Feb 21, 2026

0.3.9

Feb 21, 2026

0.3.8

Feb 21, 2026

0.3.7

Feb 21, 2026

0.3.6

Feb 21, 2026

0.3.4

Feb 20, 2026

0.3.3

Feb 20, 2026

0.3.2

Feb 20, 2026

This version

0.3.1

Feb 20, 2026

0.1.2

Feb 17, 2026

0.1.1

Feb 17, 2026

0.1.0

Feb 17, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentcage-0.3.1.tar.gz (564.6 kB view details)

Uploaded Feb 20, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentcage-0.3.1-py3-none-any.whl (72.1 kB view details)

Uploaded Feb 20, 2026 Python 3

File details

Details for the file agentcage-0.3.1.tar.gz.

File metadata

Download URL: agentcage-0.3.1.tar.gz
Upload date: Feb 20, 2026
Size: 564.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for agentcage-0.3.1.tar.gz
Algorithm	Hash digest
SHA256	`76ea570417538400cbe35771a77ac7cfa97c5a97e763406e5db7de80f9788c4e`
MD5	`76fb481319d2af0a2428cf19ab1561f6`
BLAKE2b-256	`e9e0e113ab1dafa52f59455c156529e9a1072050758f81c29b0ff291cd2eb7e0`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agentcage-0.3.1.tar.gz:

Publisher: publish.yml on agentcage/agentcage

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agentcage-0.3.1.tar.gz
- Subject digest: 76ea570417538400cbe35771a77ac7cfa97c5a97e763406e5db7de80f9788c4e
- Sigstore transparency entry: 973749006
- Sigstore integration time: Feb 20, 2026
Source repository:
- Permalink: agentcage/agentcage@fcd205d974b107801fecba6a6f3e4ce060cda3e7
- Branch / Tag: refs/tags/v0.3.1
- Owner: https://github.com/agentcage
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@fcd205d974b107801fecba6a6f3e4ce060cda3e7
- Trigger Event: push

File details

Details for the file agentcage-0.3.1-py3-none-any.whl.

File metadata

Download URL: agentcage-0.3.1-py3-none-any.whl
Upload date: Feb 20, 2026
Size: 72.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for agentcage-0.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cb9572f7098ff12244c962455de3d5ce6c5cccd0e7fdc0e6127235d574d46332`
MD5	`1035e2cb78090cdbfa4b70c446a7ef18`
BLAKE2b-256	`1573d99dc840b88ae5e610ca2ef4c3b207035d4abb76c25c39b01e2c57f81fda`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agentcage-0.3.1-py3-none-any.whl:

Publisher: publish.yml on agentcage/agentcage

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agentcage-0.3.1-py3-none-any.whl
- Subject digest: cb9572f7098ff12244c962455de3d5ce6c5cccd0e7fdc0e6127235d574d46332
- Sigstore transparency entry: 973749112
- Sigstore integration time: Feb 20, 2026
Source repository:
- Permalink: agentcage/agentcage@fcd205d974b107801fecba6a6f3e4ce060cda3e7
- Branch / Tag: refs/tags/v0.3.1
- Owner: https://github.com/agentcage
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@fcd205d974b107801fecba6a6f3e4ce060cda3e7
- Trigger Event: push

agentcage 0.3.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

agentcage

What is it?

Features

Design Principles

Why is it needed?

How is it different?

How does it work?

Isolation Modes

Prerequisites

Linux

macOS

Firecracker Mode (optional)

Install

Updating Dependencies

Usage

CLI Overview

Deployment State

Architecture

Configuration

Security

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance