Security toolkit for AI agents - machine scan for dangerous skills/MCP configs + prompt injection/extraction testing
Project description
Security scanner for AI agents
Docs · MCP Registry · Dashboard · Blog
pip install agentseal
agentseal guard
Scans your machine for dangerous skill files, MCP server configs, and toxic data flows across 17+ AI agents. No API key required.
Architecture
graph TD
U["User"] -->|prompt| A["AI Agent (LLM)"]
A -->|tool call| M1["MCP Server\n(filesystem)"]
A -->|tool call| M2["MCP Server\n(slack)"]
A -->|tool call| M3["MCP Server\n(database)"]
M1 -->|reads| FS["~/.ssh/\n~/.aws/\n~/Documents/"]
M2 -->|reads| SL["Messages\nChannels"]
M3 -->|queries| DB["Tables\nCredentials"]
SL -.->|"toxic flow"| M1
M1 -.->|"exfiltration"| EX["Attacker"]
style U fill:#1a1a2e,stroke:#58a6ff,color:#e6edf3
style A fill:#1a1a2e,stroke:#58a6ff,color:#e6edf3
style M1 fill:#3b1d0e,stroke:#f59e0b,color:#e6edf3
style M2 fill:#3b1d0e,stroke:#f59e0b,color:#e6edf3
style M3 fill:#3b1d0e,stroke:#f59e0b,color:#e6edf3
style EX fill:#3b0e0e,stroke:#ef4444,color:#e6edf3
style FS fill:#1a1a2e,stroke:#30363d,color:#8b949e
style SL fill:#1a1a2e,stroke:#30363d,color:#8b949e
style DB fill:#1a1a2e,stroke:#30363d,color:#8b949e
MCP servers give AI agents access to local files, databases, APIs, and credentials. Tool descriptions can contain hidden instructions that the agent follows but the user never sees. AgentSeal detects these threats across four attack surfaces.
Commands
| Command | Description | API key |
|---|---|---|
agentseal guard |
Scan skill files, MCP configs, toxic data flows, and supply chain changes | No |
agentseal shield |
Real-time file monitoring with desktop alerts and auto-quarantine | No |
agentseal scan |
Test system prompts against 225+ adversarial probes | Yes* |
agentseal scan-mcp |
Audit live MCP server tool descriptions for poisoning | No |
*Free with Ollama. Cloud providers require an API key.
Guard
Scans all AI agent configurations on your machine. Supports Claude Code, Cursor, Windsurf, VS Code, Gemini CLI, Codex, Cline, Copilot, and others.
agentseal guard
SKILLS
[XX] sketchy-rules MALWARE Credential access
Remove this skill immediately and rotate all credentials.
[OK] 4 more safe skills
MCP SERVERS
[XX] filesystem DANGER Access to SSH private keys
Restrict filesystem MCP server: remove .ssh from allowed paths.
TOXIC FLOWS
[HIGH] Data exfiltration path: filesystem + slack
Detection pipeline
graph LR
IN["Skill Files\nMCP Configs"] --> P["Pattern\nSignatures"]
P --> D["Deobfuscation\n(Unicode Tags,\nBase64, BiDi, ZWC)"]
D --> S["Semantic\nAnalysis\n(MiniLM-L6-v2)"]
S --> B["Baseline\nTracking\n(SHA-256)"]
B --> OUT["Report +\nSeverity"]
style IN fill:#1a1a2e,stroke:#58a6ff,color:#e6edf3
style P fill:#161b22,stroke:#30363d,color:#e6edf3
style D fill:#161b22,stroke:#30363d,color:#e6edf3
style S fill:#161b22,stroke:#30363d,color:#e6edf3
style B fill:#161b22,stroke:#30363d,color:#e6edf3
style OUT fill:#0d4429,stroke:#22c55e,color:#e6edf3
Scan
225 attack probes: 82 extraction techniques, 143 injection techniques, 8 adaptive mutation transforms. Deterministic n-gram and canary token scoring. No LLM judge.
OpenAI
agentseal scan --prompt "You are a helpful assistant..." --model gpt-4o
Ollama (free, local)
agentseal scan --prompt "You are a helpful assistant..." --model ollama/llama3.1:8b
HTTP endpoint
agentseal scan --url http://localhost:8080/chat
Scan-MCP
Connects to live MCP servers over stdio or SSE. Enumerates tools, analyzes descriptions through pattern matching, deobfuscation, semantic similarity, and optional LLM classification. Outputs a trust score per server.
agentseal scan-mcp --server npx @modelcontextprotocol/server-filesystem /tmp
Shield
Watches agent config paths in real time. Desktop notifications on threats. Quarantines files with detected payloads.
pip install agentseal[shield]
agentseal shield
Python API
from agentseal import AgentValidator
validator = AgentValidator.from_openai(
client=openai.AsyncOpenAI(),
model="gpt-4o",
system_prompt="You are a helpful assistant...",
)
report = await validator.run()
print(f"Trust score: {report.trust_score}/100")
Anthropic / HTTP / Custom
# Anthropic
validator = AgentValidator.from_anthropic(
client=client, model="claude-sonnet-4-5-20250929", system_prompt="..."
)
# HTTP endpoint
validator = AgentValidator.from_endpoint(url="http://localhost:8080/chat")
# Custom function
validator = AgentValidator(agent_fn=my_agent, ground_truth_prompt="...")
CI/CD
agentseal scan --file ./prompt.txt --model gpt-4o --min-score 75
Exit code 1 if trust score is below threshold. SARIF output supported via --output sarif.
Supported Providers
| Provider | Flag | API key |
|---|---|---|
| OpenAI | --model gpt-4o |
OPENAI_API_KEY |
| Anthropic | --model claude-sonnet-4-5-20250929 |
ANTHROPIC_API_KEY |
| Ollama | --model ollama/llama3.1:8b |
None |
| LiteLLM | --model any --litellm-url http://... |
Varies |
| HTTP | --url http://your-agent.com/chat |
None |
MCP Security Registry
2,200+ MCP servers scanned for security risks. Trust scores, tool analysis, and finding details for each server.
Pro
AgentSeal Pro extends the scanner with MCP tool poisoning probes (+45), RAG poisoning probes (+28), multimodal attack probes (+13), behavioral genome mapping, PDF reports, and a dashboard.
Contributing
If you find a detection gap or a false positive, please open an issue.
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agentseal-0.8.1.tar.gz.
File metadata
- Download URL: agentseal-0.8.1.tar.gz
- Upload date:
- Size: 318.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ab5db58ee381cf344a7d16b4a2b4ec5f300e5ce654c457317a54d120352126c2
|
|
| MD5 |
613b8a2bf0a8054db1095d225bb61a34
|
|
| BLAKE2b-256 |
67d88671e75c0bb777c163529d8c3e7ab074d8a5874aaf7cab7002f6a1df98a1
|
File details
Details for the file agentseal-0.8.1-py3-none-any.whl.
File metadata
- Download URL: agentseal-0.8.1-py3-none-any.whl
- Upload date:
- Size: 261.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.14.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b67b5ed174949473dc2b47d9a295e838fbd97fc308dc975ef62465a6b724e633
|
|
| MD5 |
0f4abca7fb4426c79d53d8297ca3083a
|
|
| BLAKE2b-256 |
3a1ef07ed22be4585ebc93ba23b4d182fe38706d55b560d3aaffcae828d73947
|