Skip to main content

Runtime governance for AI agents — deterministic fail-closed enforcement. Wraps any agent tool and blocks dangerous calls before execution. Zero LLM calls, zero cloud dependencies, works offline.

Project description

ShadowAudit

Deterministic runtime authorization for AI agent tool calls.

PyPI version Python versions License: MIT Tests: passing

ShadowAudit sits between AI agents and their tools. It evaluates every tool call against deterministic policy before execution, then allows it, blocks it, or requires approval.

It is closer to IAM, Open Policy Agent, API gateways, and admission controllers than prompt guardrails or moderation.

Agent -> ShadowAudit -> Tool
          |
          +-- allow
          +-- deny
          +-- require approval

Why This Exists

Agents increasingly call tools that can move money, edit databases, run shell commands, read files, modify permissions, and operate SaaS admin APIs.

Prompt instructions are not an execution boundary. ShadowAudit gives those tool calls an explicit runtime authorization layer:

  • No LLM call in the gate path.
  • Policy-as-code decisions.
  • Default-deny behavior when policy does not match.
  • Fail-closed handling for policy and audit failures.
  • Local audit logs with hash-chain verification.
  • Observe mode for safe rollout before enforcement.

Install

pip install shadowaudit

60-Second Example

Create a policy:

deny:
  - capability: shell.execute
    contains: "rm -rf"

require_approval:
  - capability: payments.transfer
    amount_gt: 1000

allow:
  - capability: shell.execute
  - capability: payments.transfer
    amount_lte: 1000

Evaluate a tool call before execution:

from shadowaudit.core.gate import Gate

gate = Gate()

result = gate.evaluate(
    agent_id="ops-agent",
    task_context="shell",
    risk_category="shell_execution",
    capability="shell.execute",
    policy_path="policies/shell_agent.yaml",
    payload={"command": "rm -rf /var/lib/postgresql"},
)

if not result.passed:
    raise PermissionError(f"Blocked: {result.reason}")

The dangerous command never reaches the tool.

Framework Integrations

ShadowAudit can wrap tools in:

  • LangChain
  • LangGraph
  • CrewAI
  • OpenAI Agents SDK
  • MCP
  • PydanticAI
  • LlamaIndex
  • AutoGen
  • Direct Python runtimes

LangChain example:

from shadowaudit import ShadowAuditTool
from langchain.tools import ShellTool

safe_shell = ShadowAuditTool(
    tool=ShellTool(),
    agent_id="ops-agent",
    capability="shell.execute",
    policy_path="policies/shell_agent.yaml",
)

CLI Workflows

Scan for ungated tools:

shadowaudit check ./src --fail-on-ungated

Validate a policy:

shadowaudit check-policy policies/shell_agent.yaml

Replay a trace through policy:

shadowaudit simulate --trace-file traces/agent_trace.jsonl --compare

Verify audit-log integrity:

shadowaudit verify --audit-log audit.db

Rollout Path

  1. Start in observe mode and log decisions without blocking.
  2. Review audit logs and tune policies.
  3. Enforce policy for high-risk tools.
  4. Add CI checks so new ungated tools cannot quietly ship.
  5. Replay incidents and policy changes against saved traces.

What ShadowAudit Is Not

ShadowAudit is a runtime authorization and audit layer. It does not replace:

  • sandboxing for shell/code execution
  • least-privilege credentials
  • network egress controls
  • prompt-injection defenses
  • human review for genuinely ambiguous high-risk actions

Use it as the deterministic control point before tool execution.

Why Not Prompt Guardrails?

Prompt guardrails are useful, but they do not enforce execution. A prompt can be bypassed, misinterpreted, or ignored under pressure. ShadowAudit adds a deterministic enforcement boundary that blocks tool calls before they execute and records a tamper-evident audit trail.

Where It Sits In Your Stack

Agent framework -> ShadowAudit gate -> Tool/API/DB/Shell

ShadowAudit does not replace sandboxing, network egress controls, or least-privilege credentials. It complements them by enforcing runtime policy at the tool boundary.

Project Status

ShadowAudit is beta infrastructure for agent tool governance. The repo includes:

  • deterministic gate and policy engine
  • local audit log with hash-chain verification
  • approval workflows
  • observe mode and bypass audit trails
  • framework adapters
  • MCP gateway and adapter
  • static scanner and CI mode
  • OWASP Agentic Top 10 and EU AI Act evidence reports
  • typed Python package with ruff, mypy, and pytest coverage

Current local verification: run pytest -q.

Documentation

Useful starting points:

Contributing

git clone https://github.com/shadowauditlabs/shadowaudit-python.git
cd shadowaudit-python
pip install -e ".[dev]"
pytest tests/ -q

Policy recipes, framework adapters, taxonomies, docs, and focused bug reports are welcome.

License

MIT License

Built by ShadowAudit Labs

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shadowaudit-0.6.2.tar.gz (165.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

shadowaudit-0.6.2-py3-none-any.whl (112.5 kB view details)

Uploaded Python 3

File details

Details for the file shadowaudit-0.6.2.tar.gz.

File metadata

  • Download URL: shadowaudit-0.6.2.tar.gz
  • Upload date:
  • Size: 165.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for shadowaudit-0.6.2.tar.gz
Algorithm Hash digest
SHA256 a753250159d3fdab572e8c9b52e009b63d8e931dda27ee428a89bf13c34068c2
MD5 49c71bc7ba7c95e00a70b76c83ea513d
BLAKE2b-256 d696e0ccfbf6e6e297da2f4e05a3d5ee7202d0ddf02f9ca2a9f4c825178f1bcd

See more details on using hashes here.

File details

Details for the file shadowaudit-0.6.2-py3-none-any.whl.

File metadata

  • Download URL: shadowaudit-0.6.2-py3-none-any.whl
  • Upload date:
  • Size: 112.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for shadowaudit-0.6.2-py3-none-any.whl
Algorithm Hash digest
SHA256 0fab0f29657f5fcaa6e2edce661ee906dade4152dcddbb2e26f05c08a76e4279
MD5 31733eb8cafd10c2b9b59c6406cd2c56
BLAKE2b-256 3b4788d1dd5e738ef26da2c0f8b6fffd4bf3f3cd1cc00e8b874547e1b0d0b3ae

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page