Multi-agent attack simulation framework for AI systems
Project description
AgentHive
Multi-agent attack simulation framework for AI systems.
AgentHive extends mcpwn from the single-agent domain to the multi-agent domain, modeling attacks that are only possible when multiple AI agents interact with each other.
Installation
pip install agenthive-sim
For development:
pip install -e ".[dev,lab]"
Quick Start
# Generate a scenario template
agenthive scenario my-scenario -o scenario.yaml
# Run a simulation
agenthive simulate scenario.yaml -v
# List available attack categories
agenthive list-scenarios
# Start the vulnerable lab server
agenthive lab
Scenario YAML Format
Scenarios define the agent topology and attacks to simulate:
name: my-scenario
description: "Multi-agent SSRF through agent chain"
max_steps: 50 # max simulation steps before timeout
timeout_seconds: 300
agents:
- role: attacker # attacker | victim | observer | coordinator
name: red-agent-1
capabilities:
- prompt_injection
- tool_manipulation
- role: victim
name: blue-agent-1
capabilities:
- data_processing
- tool_usage
- role: victim
name: blue-agent-2
capabilities:
- collaboration
- file_operations
- role: observer
name: observer-1
capabilities: [monitoring]
attacks:
- category: tool_drift # must match AttackCategory
name: "Tool Drift"
description: "Drift tool definitions via shared memory"
severity: high # critical | high | medium | low | info
parameters: {} # attack-specific parameters
mitre_atlas: ["ATLAS-001"]
metadata:
environment: lab
difficulty: medium
Ecosystem Integration
AgentHive integrates with the MCP security ecosystem:
- mcp-taxonomy: Findings are convertible via
agenthive_finding_to_taxonomy()for unified correlation - MCPscop: SARIF and JSON reports are consumable by MCPscop dashboards
- mcpwn: Extends mcpwn patterns from single-agent to multi-agent domain
Attack Scenarios
| Category | Description | Severity |
|---|---|---|
tool_drift |
Exploit tool derivation caused by shared memory between agents | High |
long_horizon |
RL-based attacks spanning multiple agents in sequence | Critical |
collaboration_attack |
Manipulation of agent-to-agent collaboration | High |
authority_hijack |
Hijacking the authority chain between agents | Critical |
cross_agent_injection |
Prompt injection that propagates across agents | Critical |
multi_agent_ssrf |
Coordinated SSRF through multiple agents | High |
swarm_poisoning |
Poison one agent that propagates to the swarm | Critical |
identity_spoofing |
Identity spoofing between agents | High |
Academic References
- Evo-Attacker: Memory-Augmented RL for Long-Horizon Tool Attacks (ACL 2026)
- Memory-Induced Tool-Drift in LLM Agents
- Behind EvoMap: Agent-to-Agent Collaboration Network
- Authority Frontier Framework for Runtime Actuarial Control
- Deep-Research Agents Can Be Poisoned (Shmatikov et al.)
- MITRE ATLAS — Multi-Agent System Attack Patterns
Ecosystem
- Extends mcpwn — same stack, multi-agent domain
- Uses mcp-taxonomy
- Reports consumable by MCPscop
License
MIT — see LICENSE
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file agenthive_sim-0.1.0.tar.gz.
File metadata
- Download URL: agenthive_sim-0.1.0.tar.gz
- Upload date:
- Size: 37.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8a5ddba953604906fda47f9e4408877f3a64638f3ad78de36e882bade569338a
|
|
| MD5 |
c70d95df752aa8adaf4a38ba4cc31bce
|
|
| BLAKE2b-256 |
73d0dc5e4231ac5cc850bb20fc3f0935c0af55a95c77daf11df816d093719b9a
|
File details
Details for the file agenthive_sim-0.1.0-py3-none-any.whl.
File metadata
- Download URL: agenthive_sim-0.1.0-py3-none-any.whl
- Upload date:
- Size: 37.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
04ec87c1b4ae7f366292cb4a68fcb2bde0233ab065463e5a2d612205672cbcdc
|
|
| MD5 |
f1b6ecafb8def7492c335350b60a31c7
|
|
| BLAKE2b-256 |
bfd586437f5214558d83ffe4078089b4a1dac60f1cafc64189b13852493c51d6
|