AI Agent Security Testing — discovers dangerous tool chain compositions via knowledge graph analysis
Project description
ZIRAN 🧘
AI Agent Security Testing
Find vulnerabilities in AI agents -- not just LLMs, but agents with tools, memory, and multi-step reasoning.
Install · Quick Start · Web UI · Examples · Docs
Benchmarks
565 attack vectors · 11 categories · 90% OWASP LLM Top 10 · 20 benchmarks analyzed
| Benchmark | Coverage |
|---|---|
| AgentHarm (ICLR 2025) | 100% harm categories |
| JailbreakBench (NeurIPS 2024) | 100% categories, 175 vectors |
| Agent Security Bench | 100% vectors (565/400) |
| HarmBench (ICML 2024) | 55.6% tactics, 175 jailbreak vectors |
| R-Judge | 100% risk types |
| ALERT | 100% micro categories (32/32) |
| MITRE ATLAS | 73.3% attack categories |
Full results: benchmarks/ · docs
Why ZIRAN?
Most security tools test prompts and tools in isolation. But agent vulnerabilities emerge from how tools interact -- an agent with read_file and http_request has a data exfiltration path, even though neither tool is dangerous alone. Testing each tool individually misses this entirely.
ZIRAN models your agent as a graph of capabilities and tests what happens when they combine.
| Capability | ZIRAN | Promptfoo | Invariant (Snyk) | Garak | PyRIT | Inspect AI |
|---|---|---|---|---|---|---|
| Tool chain discovery (graph-based) | Yes | -- | Policy-based | -- | -- | -- |
| Side-effect detection (execution-level) | Yes | -- | Trace-based | -- | -- | Sandbox |
| Multi-phase campaigns w/ graph feedback | Yes | Turn-level | Flow analysis | -- | Composable | Multi-turn |
| Autonomous pentesting agent | Yes | -- | -- | -- | -- | -- |
| Multi-agent coordination | Yes | -- | -- | -- | -- | -- |
| Knowledge graph tracking | Yes | -- | Policy lang. | -- | -- | -- |
| Agent-aware (tools + memory) | Yes | Partial | Yes | -- | -- | Partial |
| A2A protocol support | Yes | -- | -- | -- | -- | -- |
| MCP protocol support | Yes | Partial | Yes | -- | -- | -- |
| Encoding/obfuscation attacks | Yes (8) | Yes (12+) | -- | -- | -- | -- |
| Industry compliance plugins | -- | Yes (46) | -- | -- | -- | -- |
| Streaming (SSE/WebSocket) | Yes | -- | -- | -- | -- | -- |
| CI/CD quality gate | Yes | Yes | -- | -- | -- | -- |
| Open source | Apache-2.0 | MIT | Partial | Apache-2.0 | MIT | MIT |
What these capabilities catch:
- Tool Chain Discovery -- Individual tools pass security review, but their compositions create vulnerabilities. Graph-based analysis finds transitive attack paths (e.g.,
read_file->http_request= data exfiltration) that list-based testing misses. - Side-Effect Detection -- Agents can refuse a request in their text response while still executing the dangerous tool call. Without execution-level monitoring, these silent failures go undetected.
- Multi-Phase Campaigns -- Real attackers don't send a single malicious prompt. They build trust, map capabilities, then exploit. Multi-phase campaigns model this behavior to find vulnerabilities that single-turn tests miss.
- Knowledge Graph -- A live graph of discovered tools, permissions, and data flows grows as the scan progresses. Each phase uses it to plan the next, catching vulnerabilities that only appear after building enough context about the agent.
- Multi-Agent Coordination -- In multi-agent systems, an agent may trust messages from peers without validation. Testing cross-agent trust boundaries reveals lateral movement paths.
- A2A + MCP Protocols -- Tests Agent-to-Agent and MCP agents through their native protocols, exercising the actual attack surface rather than a simplified proxy.
- Framework Agnostic -- LangChain, CrewAI, Bedrock, MCP, browser UIs, remote HTTPS agents, or custom adapters.
What ZIRAN Is / What ZIRAN Is Not
ZIRAN is an agent security scanner that discovers dangerous tool compositions via graph analysis, detects execution-level side effects, and runs multi-phase campaigns that model real attacker behavior.
ZIRAN is not:
- An LLM safety/alignment tool -- for prompt injection breadth, jailbreak templates, and compliance testing, use Promptfoo or Garak
- A runtime guardrail -- for real-time input/output protection, use NeMo Guardrails, Lakera Guard, or LLM Guard
- A general-purpose eval framework -- for model evaluation and benchmarking, use Inspect AI or Deepeval
Works With
ZIRAN is complementary to other tools in the AI security ecosystem:
Pre-deploy testing:
- Promptfoo for attack breadth (encoding strategies, jailbreak templates, compliance plugins) + ZIRAN for agent depth (tool chains, side-effects, campaigns)
- Garak for LLM-layer vulnerability scanning + ZIRAN for agent-layer tool chain analysis
Runtime governance:
- NeMo Guardrails / Lakera for runtime input/output protection + ZIRAN for pre-deployment testing
- Invariant (Snyk) for runtime policy enforcement + ZIRAN for pre-deploy tool chain analysis
Observability:
- Langfuse for production trace analytics + ZIRAN
analyze-tracesfor security evaluation of production behavior - LangSmith for debugging and eval + ZIRAN for security-focused campaign testing
See the Agent Security Landscape for a full mapping of tools across pre-deploy, runtime, and observability layers.
Install
pip install ziran
# with framework adapters
pip install ziran[langchain] # LangChain support
pip install ziran[crewai] # CrewAI support
pip install ziran[a2a] # A2A protocol support
pip install ziran[streaming] # SSE/WebSocket streaming
pip install ziran[pentest] # autonomous pentesting agent
pip install ziran[otel] # OpenTelemetry tracing
pip install ziran[ui] # web dashboard
pip install ziran[all] # everything
Web UI
ZIRAN includes a built-in web dashboard for visual security analysis. Install the UI extra and start:
pip install ziran[ui]
ziran ui
# Dashboard: http://127.0.0.1:8484
Or with Docker:
docker compose up
# Dashboard: http://localhost:8484
Attack Library -- 565 vectors across 11 categories
Scan Configuration
Quick Start
CLI
# scan a LangChain agent (in-process)
ziran scan --framework langchain --agent-path my_agent.py
# scan a remote agent over HTTPS
ziran scan --target target.yaml
# adaptive campaign with LLM-driven strategy
ziran scan --target target.yaml --strategy llm-adaptive
# stream responses in real-time
ziran scan --target target.yaml --streaming
# scan with encoding bypass variants (Base64 + ROT13)
ziran scan --target target.yaml --encoding base64 --encoding rot13
# scan with OpenTelemetry tracing
ziran scan --target target.yaml --otel
# scan a multi-agent system
ziran multi-agent-scan --target target.yaml
# discover capabilities of a remote agent
ziran discover --target target.yaml
# autonomous pentesting agent
ziran pentest --target target.yaml
# interactive red-team mode
ziran pentest --target target.yaml --interactive
# view the interactive HTML report
open reports/campaign_*_report.html
Python API
import asyncio
from ziran.application.agent_scanner.scanner import AgentScanner
from ziran.application.attacks.library import AttackLibrary
from ziran.infrastructure.adapters.langchain_adapter import LangChainAdapter
adapter = LangChainAdapter(agent=your_agent)
scanner = AgentScanner(adapter=adapter, attack_library=AttackLibrary())
result = asyncio.run(scanner.run_campaign())
print(f"Vulnerabilities found: {result.total_vulnerabilities}")
print(f"Dangerous tool chains: {len(result.dangerous_tool_chains)}")
See examples/ for 22 runnable demos -- from static analysis to autonomous pentesting.
Remote Agent Scanning
ZIRAN can test any published agent over HTTPS -- no source code or in-process access required. Define your target in a YAML file:
# target.yaml
name: my-agent
url: https://agent.example.com
protocol: auto # auto | rest | openai | mcp | a2a
auth:
type: bearer
token_env: AGENT_API_KEY
tls:
verify: true
Supported protocols:
| Protocol | Use Case | Auto-detected via |
|---|---|---|
| REST | Generic HTTP endpoints | Fallback default |
| OpenAI-compatible | Chat completions API (/v1/chat/completions) |
Path probing |
| MCP | Model Context Protocol agents (JSON-RPC 2.0) | JSON-RPC response |
| A2A | Google Agent-to-Agent protocol | /.well-known/agent.json |
# auto-detect protocol and scan
ziran scan --target target.yaml
# force a specific protocol
ziran scan --target target.yaml --protocol openai
# A2A agent with Agent Card discovery
ziran scan --target a2a_target.yaml --protocol a2a
See examples/15-remote-agent-scan/ for ready-to-use target configurations.
What ZIRAN Finds
Prompt-level -- injection, system prompt extraction, memory poisoning, chain-of-thought manipulation.
Tool-level -- tool manipulation, privilege escalation, data exfiltration chains.
Tool chains -- automatic graph analysis of dangerous tool compositions:
+----------+---------------------+-----------------------------+--------------------------------------+
| Risk | Type | Tools | Description |
+----------+---------------------+-----------------------------+--------------------------------------+
| critical | data_exfiltration | read_file -> http_request | File contents sent to external server|
| critical | sql_to_rce | sql_query -> execute_code | SQL results executed as code |
| high | pii_leakage | get_user_info -> external_api| User PII sent to third-party API |
+----------+---------------------+-----------------------------+--------------------------------------+
How It Works
flowchart LR
subgraph agent["🤖 Your Agent"]
direction TB
T["🔧 Tools"]
M["🧠 Memory"]
P["🔑 Permissions"]
end
agent -->|"adapter layer"| D
subgraph ziran["⛩️ ZIRAN Pipeline"]
direction TB
D["🔍 DISCOVER\nProbe tools, permissions,\ndata access"]
MAP["🗺️ MAP\nBuild knowledge graph\n(NetworkX MultiDiGraph)"]
A["⚡ ANALYZE\nWalk graph for dangerous\nchains (30+ patterns)"]
ATK["🎯 ATTACK\nMulti-phase exploits\ninformed by the graph"]
R["📋 REPORT\nScored findings with\nremediation guidance"]
D --> MAP --> A --> ATK --> R
end
R --> HTML["📊 HTML\nInteractive graph"]
R --> MD["📝 Markdown\nCI/CD tables"]
R --> JSON["📦 JSON\nMachine-parseable"]
style agent fill:#1a1a2e,stroke:#e94560,color:#fff,stroke-width:2px
style ziran fill:#0f3460,stroke:#e94560,color:#fff,stroke-width:2px
style D fill:#16213e,stroke:#0ea5e9,color:#fff
style MAP fill:#16213e,stroke:#0ea5e9,color:#fff
style A fill:#16213e,stroke:#0ea5e9,color:#fff
style ATK fill:#16213e,stroke:#e94560,color:#fff
style R fill:#16213e,stroke:#10b981,color:#fff
style HTML fill:#1e293b,stroke:#10b981,color:#fff
style MD fill:#1e293b,stroke:#10b981,color:#fff
style JSON fill:#1e293b,stroke:#10b981,color:#fff
style T fill:#2d2d44,stroke:#e94560,color:#fff
style M fill:#2d2d44,stroke:#e94560,color:#fff
style P fill:#2d2d44,stroke:#e94560,color:#fff
Campaign Phases
Campaigns run 8 phases: reconnaissance, trust building, capability mapping, vulnerability discovery, exploitation setup, execution, persistence, and exfiltration. Each phase feeds its findings into a live knowledge graph, and the graph informs which phase to run next.
Phases are not linear -- the knowledge graph drives execution order. A discovery in the exploitation phase may trigger a return to reconnaissance. An agent that reveals new tools during trust building causes capability mapping to re-run with updated context.
flowchart TD
KG["🧠 Knowledge Graph\n(live state)"]
R["🔍 Reconnaissance"] --> KG
TB["🤝 Trust Building"] --> KG
CM["🗺️ Capability Mapping"] --> KG
VD["⚡ Vulnerability Discovery"] --> KG
ES["🎯 Exploitation Setup"] --> KG
EX["💥 Execution"] --> KG
PE["🔒 Persistence"] --> KG
EXF["📤 Exfiltration"] --> KG
KG -->|"decides next phase"| R
KG -->|"decides next phase"| TB
KG -->|"decides next phase"| CM
KG -->|"decides next phase"| VD
KG -->|"decides next phase"| ES
KG -->|"decides next phase"| EX
KG -->|"decides next phase"| PE
KG -->|"decides next phase"| EXF
style KG fill:#1a1a2e,stroke:#e94560,color:#fff,stroke-width:2px
style R fill:#16213e,stroke:#0ea5e9,color:#fff
style TB fill:#16213e,stroke:#0ea5e9,color:#fff
style CM fill:#16213e,stroke:#0ea5e9,color:#fff
style VD fill:#16213e,stroke:#0ea5e9,color:#fff
style ES fill:#16213e,stroke:#e94560,color:#fff
style EX fill:#16213e,stroke:#e94560,color:#fff
style PE fill:#16213e,stroke:#e94560,color:#fff
style EXF fill:#16213e,stroke:#e94560,color:#fff
Three strategies control this:
fixed-- Sequential execution through all 8 phases (reproducible, good for CI)adaptive-- Rule-based reordering: skips phases that won't yield results given current graph state, revisits phases when new capabilities are discoveredllm-adaptive-- LLM-driven planning: an LLM examines the knowledge graph after each phase and decides what to do next
Reports
Three output formats, generated automatically:
- HTML -- Interactive knowledge graph with attack path highlighting
- Markdown -- CI/CD-friendly summary tables
- JSON -- Machine-parseable for programmatic consumption
CI/CD Integration
Use ZIRAN as a quality gate in your pipeline. Templates are available for five CI systems:
| CI System | Template | SARIF Integration |
|---|---|---|
| GitHub Actions | ziran-scan.yml |
GitHub Security tab |
| GitLab CI | gitlab-ci.yml |
GitLab Security Dashboard |
| Jenkins | Jenkinsfile |
Warnings Next Generation Plugin |
| CircleCI | circleci-config.yml |
Build artifacts |
| Azure Pipelines | azure-pipelines.yml |
PublishBuildArtifacts |
GitHub Actions (official action)
# .github/workflows/security.yml
- uses: taoq-ai/ziran@v0
with:
command: ci
result-file: scan_results.json
severity-threshold: medium
sarif-output: results.sarif
GitLab CI
ziran-security-scan:
stage: test
image: python:3.12-slim
before_script:
- pip install ziran
script:
- ziran ci --result-file scan_results.json --severity-threshold medium --output sarif --sarif-file gl-sast-report.json
artifacts:
reports:
sast: gl-sast-report.json
Outputs: status (passed/failed), trust-score, total-findings, critical-findings, sarif-file.
See CI integrations docs for Jenkins, CircleCI, and Azure Pipelines examples, or browse the template directory.
Development
git clone https://github.com/taoq-ai/ziran.git && cd ziran
uv sync --group dev
uv run ruff check . # lint
uv run mypy ziran/ # type-check
uv run pytest --cov=ziran # test
Contributing
See CONTRIBUTING.md. Ways to help:
- Report bugs
- Request features
- Submit Skill CVEs for tool vulnerabilities
- Add attack vectors (YAML) or adapters
Citation
If you use ZIRAN in academic work, please cite:
@software{ziran2026,
title = {ZIRAN: AI Agent Security Testing},
author = {{TaoQ AI} and Lage Perdigao, Leone},
year = {2026},
url = {https://github.com/taoq-ai/ziran},
license = {Apache-2.0},
version = {0.25.0}
}
License
Apache License 2.0 -- See NOTICE for third-party attributions.
Built by TaoQ AI
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ziran-0.26.0.tar.gz.
File metadata
- Download URL: ziran-0.26.0.tar.gz
- Upload date:
- Size: 3.6 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6e0ddcfb57a0b86be4acc0c4712bc089acf989f2b4bf69a02a802874c8d7bf11
|
|
| MD5 |
a1149e98bbc6b3ce1552200e86cef3a1
|
|
| BLAKE2b-256 |
c0821e52f374c2f5849caeaf04b2ef8f8eef51013c93f8bb2c104bab196def9a
|
Provenance
The following attestation bundles were made for ziran-0.26.0.tar.gz:
Publisher:
release.yml on taoq-ai/ziran
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
ziran-0.26.0.tar.gz -
Subject digest:
6e0ddcfb57a0b86be4acc0c4712bc089acf989f2b4bf69a02a802874c8d7bf11 - Sigstore transparency entry: 1264311933
- Sigstore integration time:
-
Permalink:
taoq-ai/ziran@0cd888c0011ffcf2b098ef5d8616b50dbef8c3cf -
Branch / Tag:
refs/tags/v0.26.0 - Owner: https://github.com/taoq-ai
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@0cd888c0011ffcf2b098ef5d8616b50dbef8c3cf -
Trigger Event:
push
-
Statement type:
File details
Details for the file ziran-0.26.0-py3-none-any.whl.
File metadata
- Download URL: ziran-0.26.0-py3-none-any.whl
- Upload date:
- Size: 543.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
58185524101d8e94d224f31f5f28b8d606fcff63c4e5a9cbccbe219af48310f7
|
|
| MD5 |
40185f3ab862a921cd326fcea1d36e20
|
|
| BLAKE2b-256 |
2ccfb0a64dee6807631a511d67bbefa306f7636f3857d957d5cbd247dfffdd0e
|
Provenance
The following attestation bundles were made for ziran-0.26.0-py3-none-any.whl:
Publisher:
release.yml on taoq-ai/ziran
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
ziran-0.26.0-py3-none-any.whl -
Subject digest:
58185524101d8e94d224f31f5f28b8d606fcff63c4e5a9cbccbe219af48310f7 - Sigstore transparency entry: 1264312037
- Sigstore integration time:
-
Permalink:
taoq-ai/ziran@0cd888c0011ffcf2b098ef5d8616b50dbef8c3cf -
Branch / Tag:
refs/tags/v0.26.0 - Owner: https://github.com/taoq-ai
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@0cd888c0011ffcf2b098ef5d8616b50dbef8c3cf -
Trigger Event:
push
-
Statement type: