Skip to main content

OWASP-inspired launch gate for AI agents โ€” scan MCP configs, test policies, simulate risk, and generate compliance evidence. Static-only, offline, no API keys.

Project description

๐Ÿ›ก๏ธ Pluto AgentGuard

Security launch gate for AI agents. Other tools scan configs โ€” AgentGuard tests your policy against attack scenarios, simulates risk impact, maps results to an OWASP-inspired control framework, and generates launch evidence.

CI License: Apache-2.0 Python 3.10+ PyPI

What Makes This Different

MCP security scanners are multiplying fast (Snyk agent-scan, Invariant guardrails, AgentSeal). Most focus on config detection or runtime analysis. AgentGuard adds policy coverage testing, what-if simulation, drift detection, and launch evidence โ€” all offline, no LLM or vendor lock-in:

Capability Scanners AgentGuard
Detect secrets & misconfigs statically (no server execution) ๐ŸŸก Varies โœ… aguard scan
Policy coverage testing (22 attack scenarios) โŒ โœ… aguard test
"What-if" risk impact before applying changes โŒ โœ… aguard whatif
OWASP-inspired control coverage (20 controls) โŒ โœ… aguard owasp
Launch readiness evidence packets โŒ โœ… aguard evidence
Baseline drift detection โŒ โœ… aguard baseline
Behavioral trace audit with approval model โŒ โœ… aguard monitor

๐Ÿ“บ Interactive demo โ€” see all 7 commands in action (clone repo, open in browser)

Quick Start (60 seconds)

pip install pluto-aguard

# Clone for examples
git clone https://github.com/arpitha-dhanapathi/pluto-aguard.git && cd pluto-aguard

# Scan a realistic insecure AI project โ€” finds 18 real issues
aguard scan ./examples/demo-agent-project/

# Test your policy against 22 attack scenarios
aguard test --policy ./examples/agent-policy.yaml --attack-pack all

# Generate OWASP-inspired control coverage report
aguard owasp ./examples/demo-agent-project/

# Simulate policy changes โ€” see risk drop before applying
aguard whatif --config ./examples/insecure-agent-config.yaml

# Generate launch readiness evidence packet
aguard evidence ./examples/ --config ./examples/insecure-agent-config.yaml \
  --policy ./examples/agent-policy.yaml

# Save baseline, detect drift later
aguard baseline create ./examples/
aguard baseline compare ./examples/

No cloud accounts. No API keys. Runs entirely locally.

GitHub Action

- name: Agent Security Gate
  uses: arpitha-dhanapathi/pluto-aguard@v0.9.2
  with:
    path: '.'
    max-risk: '50'
    fail-on: 'high'
    policy: 'agent-policy.yaml'
    attack-pack: 'all'
    sarif-output: 'results.sarif'

- uses: github/codeql-action/upload-sarif@v3
  with:
    sarif_file: results.sarif

See docs/github-action-usage.md for full options.


Commands

Command What It Does Maturity
aguard scan Static analysis โ€” secrets, misconfigs, unsafe AI code patterns โœ… Stable
aguard test Policy coverage testing โ€” 22 attack scenarios across 6 packs โœ… Stable
aguard owasp OWASP-inspired control coverage report (20 controls) โœ… Stable
aguard whatif Policy impact simulation โ€” risk delta before applying changes โœ… Stable
aguard evidence Launch readiness packet with approval checklist ๐Ÿ”ถ Beta
aguard baseline Security snapshot + drift comparison over time ๐Ÿ”ถ Beta
aguard monitor Behavioral trace audit โ€” replays tool calls against policy ๐Ÿ”ถ Beta

aguard scan

Finds real issues in any AI project โ€” no MCP configs needed. Detects eval/exec on LLM output, hardcoded secrets (18+ patterns), Dockerfile misconfigs, unpinned AI deps, LangChain unsafe settings, system prompt leaks, and more.

$ aguard scan ./my-project/

  ๐Ÿ”ด CRITICAL: Unsafe execution of LLM output: eval() (MCP05:2025)
  ๐ŸŸ  HIGH: Hardcoded OpenAI Key detected (MCP01:2025)
  ๐ŸŸ  HIGH: .env file not in .gitignore (MCP01:2025)
  ๐ŸŸก MEDIUM: Unpinned AI dependencies (MCP04:2025)

  ๐Ÿ“Š Risk Score: 100/100 โ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆโ–ˆ
  ๐Ÿ“‹ Findings: 1 critical ยท 14 high ยท 3 medium

CI flags: --max-risk 50 / --fail-on high / --format sarif

aguard test

Tests 22 attack scenarios against your declared security policy. Reports what gets caught vs. what gets through. Pure policy coverage testing โ€” no LLM needed.

โš ๏ธ This tests whether your policy document would block each attack โ€” not whether your actual LLM agent would resist it. Real agent resistance requires runtime testing against a live agent (planned for v1.2).

6 attack packs: prompt-injection, data-exfiltration, permission-escalation, approval-bypass, tool-poisoning, context-manipulation.

$ aguard test --policy agent-policy.yaml --attack-pack all

  โœ… PASS  PI-001  Direct instruction override        execute       Blocked
  โœ… PASS  DE-001  File export of sensitive data      file_write    Blocked
  โŒ FAIL  DE-004  SQL data dump                      sql_query     NOT caught

  ๐Ÿ“Š Results: 21 blocked ยท 1 missed ยท 22 total

  Recommended fixes:
    โ†’ Add sql_query to require_human_approval

CI flag: --fail-on-miss exits with code 1 if any attacks succeed.

aguard owasp

Evaluates 20 controls mapped to an OWASP-inspired control framework. Control IDs use a project-defined MCP01โ€“MCP10 taxonomy that draws on OWASP LLM Top 10 and the emerging OWASP Agentic AI initiative, with MCP-specific extensions the existing standards don't yet cover.

$ aguard owasp ./my-project/

  โŒ MCP01:2025 Token Mismanagement: 3 failed, 1 passed
    โœ— AGC-MCP01-001: No hardcoded secrets
    โœ“ AGC-MCP01-002: No static long-lived tokens
  โœ… MCP07:2025 AuthN/AuthZ: 2 passed
    โœ“ AGC-MCP07-001: Remote servers have auth
    โœ“ AGC-MCP07-002: HTTPS transport

  ๐Ÿ“Š Control Coverage: 9/10 risks
     Controls: 8 passed ยท 6 failed ยท 6 not tested ยท 20 total

aguard whatif

Simulates policy changes and shows risk score impact before applying them.

$ aguard whatif --config agent-config.yaml

  Current Risk Score: 100/100

  โœ… Restrict SQL to SELECT-only              โ†’ 68  (โ†“ 17%)
  โœ… Add human-in-the-loop for file ops       โ†’ 54  (โ†“ 34%)
  โœ… Add rate limits + timeout                โ†’ 48  (โ†“ 41%)

  ๐Ÿ’ก Apply all 3 โ†’ Risk drops to 38 (โ†“54%)

aguard evidence

Generates a launch readiness packet โ€” risk summary, findings, tool permissions, policy coverage, required mitigations, and sign-off checklist. See examples/sample-launch-readiness.md.

aguard baseline

Save a security snapshot, compare later to detect drift.

aguard baseline create .               # Save current state
aguard baseline compare .              # What changed?
aguard baseline compare . --fail-on-drift  # CI: fail if new findings

aguard monitor

Replays agent action traces against a declared policy. Detects denied tool calls, unauthorized access, permission escalation, and missing/expired approvals.

aguard monitor --trace-file traces.jsonl --policy policy.yaml

Accepts OpenTelemetry JSONL or simple {"tool_name": "X", "tool_args": {}} format.


How It Fits

โ”Œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”
โ”‚  LAYER 1: Content Guardrails (existing)             โ”‚
โ”‚  Azure Content Safety ยท NeMo ยท Guardrails AI        โ”‚
โ”‚  โ†’ Protects what LLMs SAY                           โ”‚
โ”œโ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”ค
โ”‚  LAYER 2: Agent Security (Pluto AgentGuard)         โ”‚
โ”‚  scan ยท test ยท owasp ยท whatif ยท evidence ยท baseline  โ”‚
โ”‚  โ†’ Watches what agents DO                           โ”‚
โ””โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”€โ”˜

Risk Scoring

See docs/risk-scoring.md for the full scoring methodology โ€” formula, weights, examples, CI threshold guidance, and limitations.

OWASP-Inspired Control Matrix

See docs/owasp-control-matrix.md for the complete mapping of 20 controls. Control IDs draw on OWASP LLM Top 10 (LLM01โ€“LLM10) and introduce MCP-specific extensions (MCP01โ€“MCP10) for risks the existing standards don't yet cover.

Roadmap

  • v0.1โ€“v0.5 โ€” Scanner, monitor, whatif, evidence, baseline, CI gates, SARIF, HTML reports
  • v0.8 โ€” Policy coverage testing (17 scenarios, 5 attack packs)
  • v0.9 โ€” OWASP-inspired control framework (20 controls, coverage reports)
  • v0.9.1 โ€” Context manipulation pack (context stuffing, multi-turn confusion, indirect injection, RAG poisoning), supply-chain manifest poisoning scenario
  • v1.0 โ€” Runtime proxy / tool-call firewall (observability on live tool calls without full red-team harness)
  • v1.1 โ€” Multi-framework adapters (LangChain, CrewAI, AutoGen)
  • v1.2 โ€” Live agent testing (send adversarial inputs to running agents)

Project Structure

pluto-aguard/
โ”œโ”€โ”€ src/pluto_aguard/
โ”‚   โ”œโ”€โ”€ cli.py                  # 7 CLI commands
โ”‚   โ”œโ”€โ”€ models.py               # Finding, RiskScore, ControlResult, etc.
โ”‚   โ”œโ”€โ”€ scanners/               # MCP + AI config + permission scanners
โ”‚   โ”œโ”€โ”€ testing/                # 22 attack scenarios across 6 packs
โ”‚   โ”œโ”€โ”€ controls/               # 20 OWASP-aligned control definitions
โ”‚   โ”œโ”€โ”€ evidence/               # Launch readiness packet generator
โ”‚   โ”œโ”€โ”€ baseline/               # Snapshot + drift comparison
โ”‚   โ”œโ”€โ”€ monitor/                # Behavioral trace audit
โ”‚   โ”œโ”€โ”€ simulator/              # What-If policy simulation
โ”‚   โ””โ”€โ”€ reports/                # HTML + SARIF output
โ”œโ”€โ”€ examples/                   # Demo project + configs + traces
โ”œโ”€โ”€ docs/                       # Risk scoring, OWASP matrix, GitHub Action docs
โ”œโ”€โ”€ tests/                      # 95 tests
โ”œโ”€โ”€ action.yml                  # GitHub Action
โ””โ”€โ”€ SECURITY.md

Contributing

See CONTRIBUTING.md for setup and guidelines.

License

Apache License 2.0 โ€” see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pluto_aguard-0.9.2.tar.gz (83.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pluto_aguard-0.9.2-py3-none-any.whl (63.6 kB view details)

Uploaded Python 3

File details

Details for the file pluto_aguard-0.9.2.tar.gz.

File metadata

  • Download URL: pluto_aguard-0.9.2.tar.gz
  • Upload date:
  • Size: 83.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for pluto_aguard-0.9.2.tar.gz
Algorithm Hash digest
SHA256 97bd85b2af76ade922c8552ae1597f8b6da5ff9a01e30fe912da39fc7441456d
MD5 25ea706b90689eb276cb336970910c42
BLAKE2b-256 3acdb599c31a65a26bca715a9a82479f5f091e01486b2ee79b74b9c6e84ca6de

See more details on using hashes here.

File details

Details for the file pluto_aguard-0.9.2-py3-none-any.whl.

File metadata

  • Download URL: pluto_aguard-0.9.2-py3-none-any.whl
  • Upload date:
  • Size: 63.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.13

File hashes

Hashes for pluto_aguard-0.9.2-py3-none-any.whl
Algorithm Hash digest
SHA256 481d7c9b9c360da6671eab34e094fee9f604012d9f3efd860352b6b614fbc374
MD5 ae3dfeca6c66453a1e8bce635bf5c044
BLAKE2b-256 0235d4ccea4f792013d882cd28d721f7966cf6ae0dcdfc0728fe8bb0ed258bf0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page