AI Agent Security Testing — discovers dangerous tool chain compositions via knowledge graph analysis

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

leoneperdigao

These details have not been verified by PyPI

Project description

Find vulnerabilities in your AI agents.

Star us ❤️ → · 📚 Docs · 🧪 Examples · 📦 PyPI · 🐛 Issues

ZIRAN finds vulnerabilities in AI agents — not just LLMs, but agents with tools, memory, and multi-step reasoning. It models your agent as a graph of capabilities and tests what happens when they combine — surfacing dangerous tool chains, execution-level side effects, and multi-phase exploits that single-prompt scanners miss.

Graph-based · tool-chain discovery · Execution-aware · side-effect detection · Adaptive · 8-phase campaigns

Install · Quick Start · Web UI · Examples · Docs

ZIRAN Dashboard — web UI showing campaign results, attack library, and knowledge graph

Benchmarks

639 attack vectors · 11 categories · 100% OWASP LLM Top 10 · 72/86 MITRE ATLAS techniques · 20 benchmarks analyzed

Benchmark	Coverage
OWASP LLM Top 10	10/10 categories (strong or comprehensive)
MITRE ATLAS (Oct 2025)	72/86 techniques, 14/14 agent-specific
AgentHarm (ICLR 2025)	100% harm categories
JailbreakBench (NeurIPS 2024)	100% categories, 175 vectors
Agent Security Bench	100% vectors (639/400)
HarmBench (ICML 2024)	55.6% tactics, 175 jailbreak vectors
R-Judge	100% risk types
ALERT	100% micro categories (32/32)
TensorTrust / WildJailbreak / ToolEmu / CyberSecEval	Representative pattern families
LLMail-Inject / RAG Poisoning	Retrieval-ranked vectors across 4 document framings

Full results: benchmarks/ · docs

Why ZIRAN?

Most security tools test prompts and tools in isolation. But agent vulnerabilities emerge from how tools interact -- an agent with read_file and http_request has a data exfiltration path, even though neither tool is dangerous alone. Testing each tool individually misses this entirely.

ZIRAN models your agent as a graph of capabilities and tests what happens when they combine.

Capability	ZIRAN	Promptfoo	Invariant (Snyk)	Garak	PyRIT	Inspect AI
Tool chain discovery (graph-based)	Yes	--	Policy-based	--	--	--
Side-effect detection (execution-level)	Yes	--	Trace-based	--	--	Sandbox
Multi-phase campaigns w/ graph feedback	Yes	Turn-level	Flow analysis	--	Composable	Multi-turn
Autonomous pentesting agent	Yes	--	--	--	--	--
Multi-agent coordination	Yes	--	--	--	--	--
Knowledge graph tracking	Yes	--	Policy lang.	--	--	--
Agent-aware (tools + memory)	Yes	Partial	Yes	--	--	Partial
A2A protocol support	Yes	--	--	--	--	--
MCP protocol support	Yes	Partial	Yes	--	--	--
Encoding/obfuscation attacks	Yes (8)	Yes (12+)	--	--	--	--
Industry compliance plugins	--	Yes (46)	--	--	--	--
Streaming (SSE/WebSocket)	Yes	--	--	--	--	--
CI/CD quality gate	Yes	Yes	--	--	--	--
Open source	Apache-2.0	MIT	Partial	Apache-2.0	MIT	MIT

What these capabilities catch:

Tool-chain discovery — graph beats list

Side-by-side comparison: a list-based scanner sees four individually-safe tools (read_file, http_request, sql_query, exec_code) and reports no findings, while ZIRAN walks the capability graph and surfaces dangerous transitive compositions — read_file→http_request as critical data exfiltration, sql_query→exec_code as high-severity SQL-to-RCE.

Individual tools pass security review in isolation, but their compositions create vulnerabilities. Graph-based analysis finds transitive attack paths — read_file → http_request for data exfiltration, sql_query → exec_code for SQL-to-RCE — that list-based testing misses entirely.

Side-effect detection — chat is not the truth

Two stacked layers: on the surface, the agent replies 'I can't do that — request refused' to 'Delete user 42' and a chat-only scanner marks it safe; on the execution layer below, ZIRAN intercepts the actual tool call delete_user(id=42) firing silently and flags it as critical.

Agents can refuse a request in their text response while still executing the dangerous tool call underneath. ZIRAN intercepts at the execution layer and flags these silent failures — chat-only scanners mark them as safe.

Adaptive 8-phase campaigns — the graph drives the next move

A live knowledge graph grows as the scan progresses, and the graph picks the next phase — not a fixed sequence. A critical chain found mid-campaign immediately routes to Exploit Setup, while phases like Trust Building or Persistence are skipped when graph state shows they would not yield results. Three strategies control this: fixed (sequential, reproducible for CI), adaptive (rule-based reordering), and llm-adaptive (LLM examines the graph after each phase to plan).

And…

Multi-Agent Coordination -- In multi-agent systems, an agent may trust messages from peers without validation. Testing cross-agent trust boundaries reveals lateral movement paths.
A2A + MCP Protocols -- Tests Agent-to-Agent and MCP agents through their native protocols, exercising the actual attack surface rather than a simplified proxy.
Framework Agnostic -- LangChain, CrewAI, Bedrock, MCP, browser UIs, remote HTTPS agents, or custom adapters.

What ZIRAN Is / What ZIRAN Is Not

ZIRAN is an agent security scanner that discovers dangerous tool compositions via graph analysis, detects execution-level side effects, and runs multi-phase campaigns that model real attacker behavior.

ZIRAN is not:

An LLM safety/alignment tool -- for prompt injection breadth, jailbreak templates, and compliance testing, use Promptfoo or Garak
A runtime guardrail -- for real-time input/output protection, use NeMo Guardrails, Lakera Guard, or LLM Guard
A general-purpose eval framework -- for model evaluation and benchmarking, use Inspect AI or Deepeval

Works With

ZIRAN is complementary to other tools in the AI security ecosystem:

Pre-deploy testing:

Promptfoo for attack breadth (encoding strategies, jailbreak templates, compliance plugins) + ZIRAN for agent depth (tool chains, side-effects, campaigns)
Garak for LLM-layer vulnerability scanning + ZIRAN for agent-layer tool chain analysis

Runtime governance:

NeMo Guardrails / Lakera for runtime input/output protection + ZIRAN for pre-deployment testing
Invariant (Snyk) for runtime policy enforcement + ZIRAN for pre-deploy tool chain analysis

Observability:

Langfuse for production trace analytics + ZIRAN analyze-traces for security evaluation of production behavior
LangSmith for debugging and eval + ZIRAN for security-focused campaign testing

See the Agent Security Landscape for a full mapping of tools across pre-deploy, runtime, and observability layers.

Install

pip install ziran

# with framework adapters
pip install ziran[langchain]    # LangChain support
pip install ziran[crewai]       # CrewAI support
pip install ziran[a2a]          # A2A protocol support
pip install ziran[streaming]    # SSE/WebSocket streaming
pip install ziran[pentest]      # autonomous pentesting agent
pip install ziran[otel]         # OpenTelemetry tracing
pip install ziran[ui]            # web dashboard
pip install ziran[all]          # everything

Web UI

ZIRAN includes a built-in web dashboard for visual security analysis. Install the UI extra and start:

pip install ziran[ui]
ziran ui
# Dashboard: http://127.0.0.1:8484

Or with Docker:

docker compose up
# Dashboard: http://localhost:8484

Attack Library -- 639 vectors across 11 categories

Attack Library

Scan Configuration

New Run

Quick Start

CLI

# scan a LangChain agent (in-process)
ziran scan --framework langchain --agent-path my_agent.py

# scan a remote agent over HTTPS
ziran scan --target target.yaml

# adaptive campaign with LLM-driven strategy
ziran scan --target target.yaml --strategy llm-adaptive

# stream responses in real-time
ziran scan --target target.yaml --streaming

# scan with encoding bypass variants (Base64 + ROT13)
ziran scan --target target.yaml --encoding base64 --encoding rot13

# scan with OpenTelemetry tracing
ziran scan --target target.yaml --otel

# scan a multi-agent system
ziran multi-agent-scan --target target.yaml

# discover capabilities of a remote agent
ziran discover --target target.yaml

# autonomous pentesting agent
ziran pentest --target target.yaml

# interactive red-team mode
ziran pentest --target target.yaml --interactive

# view the interactive HTML report
open reports/campaign_*_report.html

Python API

import asyncio
from ziran.application.agent_scanner.scanner import AgentScanner
from ziran.application.attacks.library import AttackLibrary
from ziran.infrastructure.adapters.langchain_adapter import LangChainAdapter

adapter = LangChainAdapter(agent=your_agent)
scanner = AgentScanner(adapter=adapter, attack_library=AttackLibrary())

result = asyncio.run(scanner.run_campaign())
print(f"Vulnerabilities found: {result.total_vulnerabilities}")
print(f"Dangerous tool chains: {len(result.dangerous_tool_chains)}")

See examples/ for 22 runnable demos -- from static analysis to autonomous pentesting.

Remote Agent Scanning

ZIRAN can test any published agent over HTTPS -- no source code or in-process access required. Define your target in a YAML file:

# target.yaml
name: my-agent
url: https://agent.example.com
protocol: auto  # auto | rest | openai | mcp | a2a

auth:
  type: bearer
  token_env: AGENT_API_KEY

tls:
  verify: true

Supported protocols:

Protocol	Use Case	Auto-detected via
REST	Generic HTTP endpoints	Fallback default
OpenAI-compatible	Chat completions API (`/v1/chat/completions`)	Path probing
MCP	Model Context Protocol agents (JSON-RPC 2.0)	JSON-RPC response
A2A	Google Agent-to-Agent protocol	`/.well-known/agent.json`

# auto-detect protocol and scan
ziran scan --target target.yaml

# force a specific protocol
ziran scan --target target.yaml --protocol openai

# A2A agent with Agent Card discovery
ziran scan --target a2a_target.yaml --protocol a2a

See examples/15-remote-agent-scan/ for ready-to-use target configurations.

What ZIRAN Finds

Prompt-level -- injection, system prompt extraction, memory poisoning, chain-of-thought manipulation.

Tool-level -- tool manipulation, privilege escalation, data exfiltration chains.

Tool chains -- automatic graph analysis of dangerous tool compositions:

+----------+---------------------+-----------------------------+--------------------------------------+
| Risk     | Type                | Tools                       | Description                          |
+----------+---------------------+-----------------------------+--------------------------------------+
| critical | data_exfiltration   | read_file -> http_request   | File contents sent to external server|
| critical | sql_to_rce          | sql_query -> execute_code   | SQL results executed as code         |
| high     | pii_leakage         | get_user_info -> external_api| User PII sent to third-party API    |
+----------+---------------------+-----------------------------+--------------------------------------+

How It Works

Five sequential stages: DISCOVER probes tools, permissions, and data access; MAP builds a NetworkX MultiDiGraph of capabilities; ANALYZE walks the graph against 30+ dangerous-chain patterns; ATTACK runs multi-phase exploits informed by the graph; REPORT emits scored findings with remediation guidance.

Campaign phases

The ATTACK stage runs an 8-phase campaign — reconnaissance, trust building, capability mapping, vulnerability discovery, exploitation setup, execution, persistence, exfiltration. Phases are not linear: the live knowledge graph drives execution order, so a discovery during exploitation may trigger a return to reconnaissance, and revealed tools cause capability mapping to re-run with updated context. (See Adaptive 8-phase campaigns above for an animated walk-through, including how Trust Building and Persistence are skipped when graph state makes them irrelevant.)

Three strategies control this:

fixed -- Sequential execution through all 8 phases (reproducible, good for CI)
adaptive -- Rule-based reordering: skips phases that won't yield results given current graph state, revisits phases when new capabilities are discovered
llm-adaptive -- LLM-driven planning: an LLM examines the knowledge graph after each phase and decides what to do next

See adaptive campaigns docs.

Reports

Three output formats, generated automatically:

HTML -- Interactive knowledge graph with attack path highlighting
Markdown -- CI/CD-friendly summary tables
JSON -- Machine-parseable for programmatic consumption

Mock-up of a ZIRAN HTML campaign report — header with target metadata, severity counters (3 critical, 7 high, 12 medium, 28 low), a findings table listing the top tool-chain vulnerabilities (data exfiltration, SQL-to-RCE, PII leakage, prompt injection, multi-agent trust boundary), and a live knowledge graph with the critical attack paths highlighted.

CI/CD Integration

Use ZIRAN as a quality gate in your pipeline. Templates are available for five CI systems:

CI System	Template	SARIF Integration
GitHub Actions	`ziran-scan.yml`	GitHub Security tab
GitLab CI	`gitlab-ci.yml`	GitLab Security Dashboard
Jenkins	`Jenkinsfile`	Warnings Next Generation Plugin
CircleCI	`circleci-config.yml`	Build artifacts
Azure Pipelines	`azure-pipelines.yml`	PublishBuildArtifacts

GitHub Actions (official action)

# .github/workflows/security.yml
- uses: taoq-ai/ziran@v0
  with:
    command: ci
    result-file: scan_results.json
    severity-threshold: medium
    sarif-output: results.sarif

GitLab CI

ziran-security-scan:
  stage: test
  image: python:3.12-slim
  before_script:
    - pip install ziran
  script:
    - ziran ci --result-file scan_results.json --severity-threshold medium --output sarif --sarif-file gl-sast-report.json
  artifacts:
    reports:
      sast: gl-sast-report.json

Outputs: status (passed/failed), trust-score, total-findings, critical-findings, sarif-file.

See CI integrations docs for Jenkins, CircleCI, and Azure Pipelines examples, or browse the template directory.

Development

git clone https://github.com/taoq-ai/ziran.git && cd ziran
uv sync --group dev

uv run ruff check .            # lint
uv run mypy ziran/             # type-check
uv run pytest --cov=ziran      # test

Contributing

See CONTRIBUTING.md. Ways to help:

Report bugs
Request features
Submit Skill CVEs for tool vulnerabilities
Add attack vectors (YAML) or adapters

Citation

If you use ZIRAN in academic work, please cite:

@software{ziran2026,
  title     = {ZIRAN: AI Agent Security Testing},
  author    = {{TaoQ AI} and Lage Perdigao, Leone},
  year      = {2026},
  url       = {https://github.com/taoq-ai/ziran},
  license   = {Apache-2.0},
  version   = {0.25.0}
}

License

Apache License 2.0 -- See NOTICE for third-party attributions.

Built by TaoQ AI

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

leoneperdigao

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.31.0

May 22, 2026

0.30.0

Apr 22, 2026

0.29.0

Apr 22, 2026

0.28.0

Apr 21, 2026

0.27.0

Apr 21, 2026

0.26.0

Apr 9, 2026

0.25.0

Apr 4, 2026

0.23.0

Mar 24, 2026

0.21.0

Mar 23, 2026

0.20.0

Mar 23, 2026

0.19.0

Mar 23, 2026

0.18.4

Mar 23, 2026

0.18.3

Mar 23, 2026

0.18.2

Mar 21, 2026

0.18.1

Mar 21, 2026

0.18.0

Mar 21, 2026

0.17.0

Mar 21, 2026

0.16.0

Mar 21, 2026

0.15.0

Mar 21, 2026

0.14.0

Mar 20, 2026

0.13.0

Mar 19, 2026

0.12.0

Mar 17, 2026

0.11.0

Mar 16, 2026

0.10.2

Mar 15, 2026

0.10.1

Mar 14, 2026

0.10.0

Mar 14, 2026

0.9.0

Mar 14, 2026

0.8.0

Mar 11, 2026

0.7.0

Mar 11, 2026

0.6.1

Mar 7, 2026

0.6.0

Mar 5, 2026

0.5.0

Feb 27, 2026

0.4.0

Feb 25, 2026

0.3.3

Feb 14, 2026

0.3.2

Feb 13, 2026

0.3.1

Feb 12, 2026

0.3.0

Feb 12, 2026

0.2.0

Feb 12, 2026

0.1.1

Feb 12, 2026

0.1.0

Feb 11, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ziran-0.31.0.tar.gz (3.8 MB view details)

Uploaded May 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ziran-0.31.0-py3-none-any.whl (594.8 kB view details)

Uploaded May 22, 2026 Python 3

File details

Details for the file ziran-0.31.0.tar.gz.

File metadata

Download URL: ziran-0.31.0.tar.gz
Upload date: May 22, 2026
Size: 3.8 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ziran-0.31.0.tar.gz
Algorithm	Hash digest
SHA256	`813af109e2d8f78ed90011e617f45ad03df10ca05d7ea224484842cf238b7107`
MD5	`2928b80616d47d691f1cde8083593102`
BLAKE2b-256	`03b22643de85cdf11bfcdabeb670b1d076913a6fd796a44282f5d7fe808a52c4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ziran-0.31.0.tar.gz:

Publisher: release.yml on taoq-ai/ziran

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ziran-0.31.0.tar.gz
- Subject digest: 813af109e2d8f78ed90011e617f45ad03df10ca05d7ea224484842cf238b7107
- Sigstore transparency entry: 1605680346
- Sigstore integration time: May 22, 2026
Source repository:
- Permalink: taoq-ai/ziran@ccc92f9f77d430ccf2c6d07d7495565793254c70
- Branch / Tag: refs/tags/v0.31.0
- Owner: https://github.com/taoq-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@ccc92f9f77d430ccf2c6d07d7495565793254c70
- Trigger Event: push

File details

Details for the file ziran-0.31.0-py3-none-any.whl.

File metadata

Download URL: ziran-0.31.0-py3-none-any.whl
Upload date: May 22, 2026
Size: 594.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for ziran-0.31.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6c83cddbe6bfbf2a0a4baa23205b09fcd398cdc3ed4766d66a42c88c5372f1e3`
MD5	`8e2aea6bea8293b2c9198db5133266ec`
BLAKE2b-256	`684a6ea1d23e941464a95184c5ad09dbc51cdac1b53c3c34853e448a16c99bda`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ziran-0.31.0-py3-none-any.whl:

Publisher: release.yml on taoq-ai/ziran

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ziran-0.31.0-py3-none-any.whl
- Subject digest: 6c83cddbe6bfbf2a0a4baa23205b09fcd398cdc3ed4766d66a42c88c5372f1e3
- Sigstore transparency entry: 1605680489
- Sigstore integration time: May 22, 2026
Source repository:
- Permalink: taoq-ai/ziran@ccc92f9f77d430ccf2c6d07d7495565793254c70
- Branch / Tag: refs/tags/v0.31.0
- Owner: https://github.com/taoq-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@ccc92f9f77d430ccf2c6d07d7495565793254c70
- Trigger Event: push

ziran 0.31.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Find vulnerabilities in your AI agents.

Benchmarks

Why ZIRAN?

Tool-chain discovery — graph beats list

Side-effect detection — chat is not the truth

Adaptive 8-phase campaigns — the graph drives the next move

And…

What ZIRAN Is / What ZIRAN Is Not

Works With

Install

Web UI

Attack Library -- 639 vectors across 11 categories

Scan Configuration

Quick Start

CLI

Python API

Remote Agent Scanning

What ZIRAN Finds

How It Works

Campaign phases

Reports

CI/CD Integration

GitHub Actions (official action)

GitLab CI

Development

Contributing

Citation

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance