OWASP-inspired launch gate for AI agents โ scan MCP configs, test policies, simulate risk, and generate compliance evidence. Static-only, offline, no API keys.
Project description
๐ก๏ธ Pluto AgentGuard
Security launch gate for AI agents. Other tools scan configs โ AgentGuard tests your policy against attack scenarios, simulates risk impact, maps results to an OWASP-inspired control framework, and generates launch evidence.
What Makes This Different
MCP security scanners are multiplying fast (Snyk agent-scan, Invariant guardrails, AgentSeal). Most focus on config detection or runtime analysis. AgentGuard adds policy coverage testing, what-if simulation, drift detection, and launch evidence โ all offline, no LLM or vendor lock-in:
| Capability | Scanners | AgentGuard |
|---|---|---|
| Detect secrets & misconfigs statically (no server execution) | ๐ก Varies | โ
aguard scan |
| Policy coverage testing (22 attack scenarios) | โ | โ
aguard test |
| "What-if" risk impact before applying changes | โ | โ
aguard whatif |
| OWASP-inspired control coverage (20 controls) | โ | โ
aguard owasp |
| Launch readiness evidence packets | โ | โ
aguard evidence |
| Baseline drift detection | โ | โ
aguard baseline |
| Behavioral trace audit with approval model | โ | โ
aguard monitor |
๐บ Interactive demo โ see all 7 commands in action (clone repo, open in browser)
Quick Start (60 seconds)
pip install pluto-aguard
# Clone for examples
git clone https://github.com/arpitha-dhanapathi/pluto-aguard.git && cd pluto-aguard
# Scan a realistic insecure AI project โ finds 18 real issues
aguard scan ./examples/demo-agent-project/
# Test your policy against 22 attack scenarios
aguard test --policy ./examples/agent-policy.yaml --attack-pack all
# Generate OWASP-inspired control coverage report
aguard owasp ./examples/demo-agent-project/
# Simulate policy changes โ see risk drop before applying
aguard whatif --config ./examples/insecure-agent-config.yaml
# Generate launch readiness evidence packet
aguard evidence ./examples/ --config ./examples/insecure-agent-config.yaml \
--policy ./examples/agent-policy.yaml
# Save baseline, detect drift later
aguard baseline create ./examples/
aguard baseline compare ./examples/
No cloud accounts. No API keys. Runs entirely locally.
GitHub Action
- name: Agent Security Gate
uses: arpitha-dhanapathi/pluto-aguard@v0.9.2
with:
path: '.'
max-risk: '50'
fail-on: 'high'
policy: 'agent-policy.yaml'
attack-pack: 'all'
sarif-output: 'results.sarif'
- uses: github/codeql-action/upload-sarif@v3
with:
sarif_file: results.sarif
See docs/github-action-usage.md for full options.
Commands
| Command | What It Does | Maturity |
|---|---|---|
aguard scan |
Static analysis โ secrets, misconfigs, unsafe AI code patterns | โ Stable |
aguard test |
Policy coverage testing โ 22 attack scenarios across 6 packs | โ Stable |
aguard owasp |
OWASP-inspired control coverage report (20 controls) | โ Stable |
aguard whatif |
Policy impact simulation โ risk delta before applying changes | โ Stable |
aguard evidence |
Launch readiness packet with approval checklist | ๐ถ Beta |
aguard baseline |
Security snapshot + drift comparison over time | ๐ถ Beta |
aguard monitor |
Behavioral trace audit โ replays tool calls against policy | ๐ถ Beta |
aguard scan
Finds real issues in any AI project โ no MCP configs needed. Detects eval/exec on LLM output, hardcoded secrets (18+ patterns), Dockerfile misconfigs, unpinned AI deps, LangChain unsafe settings, system prompt leaks, and more.
$ aguard scan ./my-project/
๐ด CRITICAL: Unsafe execution of LLM output: eval() (MCP05:2025)
๐ HIGH: Hardcoded OpenAI Key detected (MCP01:2025)
๐ HIGH: .env file not in .gitignore (MCP01:2025)
๐ก MEDIUM: Unpinned AI dependencies (MCP04:2025)
๐ Risk Score: 100/100 โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
๐ Findings: 1 critical ยท 14 high ยท 3 medium
CI flags: --max-risk 50 / --fail-on high / --format sarif
aguard test
Tests 22 attack scenarios against your declared security policy. Reports what gets caught vs. what gets through. Pure policy coverage testing โ no LLM needed.
โ ๏ธ This tests whether your policy document would block each attack โ not whether your actual LLM agent would resist it. Real agent resistance requires runtime testing against a live agent (planned for v1.2).
6 attack packs: prompt-injection, data-exfiltration, permission-escalation, approval-bypass, tool-poisoning, context-manipulation.
$ aguard test --policy agent-policy.yaml --attack-pack all
โ
PASS PI-001 Direct instruction override execute Blocked
โ
PASS DE-001 File export of sensitive data file_write Blocked
โ FAIL DE-004 SQL data dump sql_query NOT caught
๐ Results: 21 blocked ยท 1 missed ยท 22 total
Recommended fixes:
โ Add sql_query to require_human_approval
CI flag: --fail-on-miss exits with code 1 if any attacks succeed.
aguard owasp
Evaluates 20 controls mapped to an OWASP-inspired control framework. Control IDs use a project-defined MCP01โMCP10 taxonomy that draws on OWASP LLM Top 10 and the emerging OWASP Agentic AI initiative, with MCP-specific extensions the existing standards don't yet cover.
$ aguard owasp ./my-project/
โ MCP01:2025 Token Mismanagement: 3 failed, 1 passed
โ AGC-MCP01-001: No hardcoded secrets
โ AGC-MCP01-002: No static long-lived tokens
โ
MCP07:2025 AuthN/AuthZ: 2 passed
โ AGC-MCP07-001: Remote servers have auth
โ AGC-MCP07-002: HTTPS transport
๐ Control Coverage: 9/10 risks
Controls: 8 passed ยท 6 failed ยท 6 not tested ยท 20 total
aguard whatif
Simulates policy changes and shows risk score impact before applying them.
$ aguard whatif --config agent-config.yaml
Current Risk Score: 100/100
โ
Restrict SQL to SELECT-only โ 68 (โ 17%)
โ
Add human-in-the-loop for file ops โ 54 (โ 34%)
โ
Add rate limits + timeout โ 48 (โ 41%)
๐ก Apply all 3 โ Risk drops to 38 (โ54%)
aguard evidence
Generates a launch readiness packet โ risk summary, findings, tool permissions, policy coverage, required mitigations, and sign-off checklist. See examples/sample-launch-readiness.md.
aguard baseline
Save a security snapshot, compare later to detect drift.
aguard baseline create . # Save current state
aguard baseline compare . # What changed?
aguard baseline compare . --fail-on-drift # CI: fail if new findings
aguard monitor
Replays agent action traces against a declared policy. Detects denied tool calls, unauthorized access, permission escalation, and missing/expired approvals.
aguard monitor --trace-file traces.jsonl --policy policy.yaml
Accepts OpenTelemetry JSONL or simple {"tool_name": "X", "tool_args": {}} format.
How It Fits
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
โ LAYER 1: Content Guardrails (existing) โ
โ Azure Content Safety ยท NeMo ยท Guardrails AI โ
โ โ Protects what LLMs SAY โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโค
โ LAYER 2: Agent Security (Pluto AgentGuard) โ
โ scan ยท test ยท owasp ยท whatif ยท evidence ยท baseline โ
โ โ Watches what agents DO โ
โโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโโ
Risk Scoring
See docs/risk-scoring.md for the full scoring methodology โ formula, weights, examples, CI threshold guidance, and limitations.
OWASP-Inspired Control Matrix
See docs/owasp-control-matrix.md for the complete mapping of 20 controls. Control IDs draw on OWASP LLM Top 10 (LLM01โLLM10) and introduce MCP-specific extensions (MCP01โMCP10) for risks the existing standards don't yet cover.
Roadmap
- v0.1โv0.5 โ Scanner, monitor, whatif, evidence, baseline, CI gates, SARIF, HTML reports
- v0.8 โ Policy coverage testing (17 scenarios, 5 attack packs)
- v0.9 โ OWASP-inspired control framework (20 controls, coverage reports)
- v0.9.1 โ Context manipulation pack (context stuffing, multi-turn confusion, indirect injection, RAG poisoning), supply-chain manifest poisoning scenario
- v1.0 โ Runtime proxy / tool-call firewall (observability on live tool calls without full red-team harness)
- v1.1 โ Multi-framework adapters (LangChain, CrewAI, AutoGen)
- v1.2 โ Live agent testing (send adversarial inputs to running agents)
Project Structure
pluto-aguard/
โโโ src/pluto_aguard/
โ โโโ cli.py # 7 CLI commands
โ โโโ models.py # Finding, RiskScore, ControlResult, etc.
โ โโโ scanners/ # MCP + AI config + permission scanners
โ โโโ testing/ # 22 attack scenarios across 6 packs
โ โโโ controls/ # 20 OWASP-aligned control definitions
โ โโโ evidence/ # Launch readiness packet generator
โ โโโ baseline/ # Snapshot + drift comparison
โ โโโ monitor/ # Behavioral trace audit
โ โโโ simulator/ # What-If policy simulation
โ โโโ reports/ # HTML + SARIF output
โโโ examples/ # Demo project + configs + traces
โโโ docs/ # Risk scoring, OWASP matrix, GitHub Action docs
โโโ tests/ # 95 tests
โโโ action.yml # GitHub Action
โโโ SECURITY.md
Contributing
See CONTRIBUTING.md for setup and guidelines.
License
Apache License 2.0 โ see LICENSE.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pluto_aguard-0.9.2.tar.gz.
File metadata
- Download URL: pluto_aguard-0.9.2.tar.gz
- Upload date:
- Size: 83.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
97bd85b2af76ade922c8552ae1597f8b6da5ff9a01e30fe912da39fc7441456d
|
|
| MD5 |
25ea706b90689eb276cb336970910c42
|
|
| BLAKE2b-256 |
3acdb599c31a65a26bca715a9a82479f5f091e01486b2ee79b74b9c6e84ca6de
|
File details
Details for the file pluto_aguard-0.9.2-py3-none-any.whl.
File metadata
- Download URL: pluto_aguard-0.9.2-py3-none-any.whl
- Upload date:
- Size: 63.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
481d7c9b9c360da6671eab34e094fee9f604012d9f3efd860352b6b614fbc374
|
|
| MD5 |
ae3dfeca6c66453a1e8bce635bf5c044
|
|
| BLAKE2b-256 |
0235d4ccea4f792013d882cd28d721f7966cf6ae0dcdfc0728fe8bb0ed258bf0
|