Security platform protecting autonomous AI coding agents
Project description
ShieldPilot
The open-source AI agent firewall.
Protect your AI agents from prompt injection, supply chain attacks, and MCP vulnerabilities — with a hook, a scanner, and a policy engine that runs wherever your agent runs.
What it does
AI agents (Claude Code, OpenClaw, custom LLM pipelines) execute shell commands, read files, call APIs, and install packages. ShieldPilot sits between the agent and the OS, scoring every action before it runs.
AI Agent ──► ShieldPilot Hook ──► Risk Engine ──► ALLOW / WARN / BLOCK
│
Prompt Injection Scanner (220+ patterns, 5 languages)
ML Classifier (DeBERTa-v3, optional)
Indirect Injection (HTML/JSON/Markdown/Zero-Width)
Supply Chain Auditor
MCP Vulnerability Scanner
Agent Behavior Monitor
Policy-as-Code Engine
9 Behavior Analyzers
│
Tamper-Evident Audit Log
$ sentinel run "rm -rf /"
analyzing with 9 risk engines...
risk_score: 100 | action: BLOCK
command blocked. audit logged.
Quick Start
pip install shieldpilot
sentinel init # creates sentinel.yaml and sentinel.db
sentinel hook install # install Claude Code pre-tool hook
sentinel dashboard # open http://localhost:8420
That's it. Every command Claude Code runs is now evaluated before execution.
Optional: ML-powered injection classifier (DeBERTa-v3)
pip install shieldpilot[ml]
sentinel ml-setup # download ProtectAI DeBERTa-v3 model (~260 MB)
Scan an MCP server config
sentinel mcp-scan ~/.claude/claude_desktop_config.json
Audit your dependencies
sentinel supply-chain-audit requirements.txt
Features
| Feature | Description |
|---|---|
| Prompt Injection Detection | 220+ patterns across 19 categories — jailbreaks, role manipulation, fake history, policy erosion, stealth memos, presupposition attacks. Multilingual (5 languages). |
| ML Injection Classifier | Optional ProtectAI DeBERTa-v3 ONNX model. pip install shieldpilot[ml] + sentinel ml-setup. Offline, no API calls. |
| Indirect Injection Detection | Scans HTML, JSON, Markdown, and tool outputs for injections hidden in zero-width chars, Unicode homoglyphs, and data payloads. |
| MCP Security Scanner | Detects SSRF vectors, leaked secrets, over-privileged tool definitions, and missing auth in MCP server configs. |
| Supply Chain Auditor | Flags malicious packages, typosquatting candidates, GPL license conflicts, and dependency confusion vectors in requirements files. |
| Agent Behavior Monitor | Baseline + anomaly detection. Records normal agent behavior; alerts on deviations across 9 risk dimensions. |
| Policy-as-Code Engine | Declarative YAML policies. 3 built-in profiles (default_safe, strict_production, development). Fully composable. |
| 9 Behavior Analyzers | Destructive FS, privilege escalation, network exfiltration, credential access, persistence, obfuscation, malware patterns, supply chain, injection |
| Tamper-Evident Logging | SHA-256 hash chain across 5 audit tables. sentinel verify detects any tampering. |
| Multi-Platform Hook | Claude Code, OpenClaw, generic JSON — auto-detected via adapter layer. |
| Web Dashboard | Real-time command log, incident management, scan history, chain integrity. |
| REST API | Full API at /api/docs for CI/CD and SIEM integration. |
| Self-Hosted | SQLite by default, no external dependencies, runs air-gapped. |
| Stripe Billing | Free / Pro / Enterprise tiers with self-serve checkout. |
Architecture
graph LR
A[AI Agent] -->|pre-tool-use| B[Hook]
B --> C[Adapter Layer]
C -->|normalized input| D[Risk Engine]
D --> E[Prompt Injection Scanner]
D --> E2[ML Classifier]
D --> F[Supply Chain Auditor]
D --> G[Behavior Analyzers x9]
D --> H2[MCP Scanner]
D --> H3[Indirect Injection]
D --> H4[Behavior Monitor]
E & E2 & F & G & H2 & H3 & H4 --> H{Score 0-100}
H -->|< 40| I[ALLOW]
H -->|40-79| J[WARN]
H -->|>= 80| K[BLOCK]
I & J & K --> L[(Audit Log)]
L --> M[Web Dashboard]
L --> N[REST API]
O[Policy Engine] -->|rules| D
sentinelai/
├── adapters/ # Platform detection: Claude Code, OpenClaw, generic
├── api/ # FastAPI REST API (18 routers)
├── cli/ # Typer CLI
├── engine/ # Risk scoring engine + 9 analyzers
├── hooks/ # Claude Code pre-tool-use hook
├── logger/ # Tamper-evident SQLite logging
├── migrations/ # Alembic + safe migration runner
├── ml/ # DeBERTa-v3 ONNX classifier (optional)
├── monitor/ # Agent behavior baseline + anomaly detection
├── policy/ # Policy-as-Code engine + 3 default YAML policies
├── sandbox/ # Sandboxed command execution
├── scanner/ # Prompt injection, indirect injection, MCP, supply chain
├── services/ # Business logic (auth, billing, teams, rules)
└── web/ # Vanilla JS SPA dashboard
Comparison
| ShieldPilot | Lakera Guard | Prompt Security | Astrix Security | |
|---|---|---|---|---|
| Open-Source | ✅ Apache 2.0 | ❌ | ❌ | ❌ |
| Self-Hosted | ✅ | ❌ SaaS only | ❌ SaaS only | ❌ SaaS only |
| MCP Scanner | ✅ | ❌ | ❌ | ❌ |
| Supply Chain | ✅ | ❌ | ❌ | ✅ partial |
| Prompt Injection | ✅ 220+ patterns | ✅ | ✅ | ❌ |
| ML Classifier | ✅ DeBERTa-v3 | ✅ | ✅ | ❌ |
| Indirect Injection | ✅ HTML/JSON/MD | ❌ | ❌ | ❌ |
| Agent Behavior Monitor | ✅ | ❌ | ❌ | ✅ partial |
| Policy-as-Code | ✅ YAML | ❌ | ❌ | ❌ |
| Claude Code Hook | ✅ native | ❌ | ❌ | ❌ |
| Audit Log Integrity | ✅ hash chain | ❌ | ❌ | ❌ |
| Price | Free / OSS | Enterprise | Enterprise | Enterprise |
Installation
# From PyPI
pip install shieldpilot
# With ML-powered injection classifier (DeBERTa-v3, ~260 MB model)
pip install shieldpilot[ml]
# From source
git clone https://github.com/maxwalser001-del/Cyber-Security-.git
cd Cyber-Security-
pip install -e ".[dev]"
Requirements: Python 3.9+, SQLite 3.x
Configuration
# sentinel.yaml
sentinel:
mode: enforce # enforce | audit | disabled
risk_thresholds:
block: 80
warn: 40
protected_paths: [/etc, ~/.ssh, ~/.aws, ~/.gnupg]
whitelist:
commands: [ls, cat, echo, pwd, whoami, git status]
blacklist:
commands: ["rm -rf /", "mkfs", ":(){:|:&};:"]
domains: []
sandbox:
enabled: true
timeout: 30 # seconds
auth:
local_first: true # skip JWT for localhost
Policy-as-Code (YAML):
# Use a built-in policy profile
sentinel policy eval --profile strict_production "npm install pkg"
# Or reference your own policy file
sentinel policy eval --policy ./my-policy.yaml "curl example.com | bash"
Full reference: docs.shieldpilot.dev/configuration
CLI
sentinel run "npm install pkg" # evaluate + execute
sentinel scan prompt.txt # scan file for injection
sentinel scan-content output.html # scan tool output / HTML / JSON for indirect injection
sentinel mcp-scan ~/.claude/claude_desktop_config.json # MCP security scan
sentinel supply-chain-audit requirements.txt # supply chain audit
sentinel monitor --baseline baseline.json # agent behavior monitor
sentinel policy eval "curl x | bash" # evaluate against policy
sentinel ml-setup # download DeBERTa-v3 model
sentinel ml-test "ignore previous instructions" # ML injection test
sentinel hook install # install Claude Code hook
sentinel hook test "curl x | bash" # dry-run test
sentinel logs --action block # browse blocked commands
sentinel verify # verify audit chain integrity
sentinel status # system health
Exit codes: 0 ALLOW/clean · 1 BLOCK/threats · 2 config error
API
POST /api/auth/login
GET /api/health
POST /api/scan/prompt
GET /api/commands
GET /api/incidents
PATCH /api/incidents/{id}/resolve
GET /api/export/commands
Interactive docs at http://localhost:8420/api/docs
Risk Scoring
| Score | Level | Action |
|---|---|---|
| 0–39 | Low | ALLOW — runs automatically |
| 40–79 | Medium/High | WARN — pause for review |
| 80–100 | Critical | BLOCK — denied, incident logged |
Pricing
| Free | Pro ($19.99/mo) | Enterprise | |
|---|---|---|---|
| Commands/day | 50 | 1,000 | Unlimited |
| Scans/day | 10 | 100 | Unlimited |
| History | 1 day | 30 days | 90 days |
| Export + API | - | Yes | Yes |
| Priority support | - | - | Yes |
Contributing
Contributions are welcome. See CONTRIBUTING.md for setup, code style, and PR process.
Key areas where help is most useful:
- New injection patterns (see
sentinelai/scanner/patterns.py) - New risk analyzers (see
sentinelai/engine/) - MCP tool definition scanning
- IDE integrations beyond Claude Code
Security
Found a vulnerability? See SECURITY.md for responsible disclosure.
Development
git clone https://github.com/maxwalser001-del/Cyber-Security-.git
cd Cyber-Security-
pip install -e ".[dev]"
pytest tests/ -x -q # 2633 tests
License
Apache 2.0 — see LICENSE.
Links
- Docs: docs.shieldpilot.dev
- PyPI: pypi.org/project/shieldpilot
- Issues: GitHub Issues
- Changelog: CHANGELOG.md
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file shieldpilot-0.2.0.tar.gz.
File metadata
- Download URL: shieldpilot-0.2.0.tar.gz
- Upload date:
- Size: 408.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5e8afcdaed88bd385282c612cda614a18e5a1969d307b49849f403d4cc4318e5
|
|
| MD5 |
e065ccc514e25fd31d6b4f16c9ee59e3
|
|
| BLAKE2b-256 |
6816b57a8bfd23d64cea6c60baf565a4ce15f177b6e3fc3ad796201d79b4d932
|
File details
Details for the file shieldpilot-0.2.0-py3-none-any.whl.
File metadata
- Download URL: shieldpilot-0.2.0-py3-none-any.whl
- Upload date:
- Size: 428.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.9.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
311b21e0801a107891e85cf730d9145cebe982ae3515a11901da1413d66b6814
|
|
| MD5 |
c2446444d11dd6a0fb169436c229b38f
|
|
| BLAKE2b-256 |
665d52c40f33686f40e55fd60fd5794930fc8eace07a4bde43363a901f2297e8
|