Black box for AI agents — capture decisions, generate forensic reports for EU AI Act compliance

These details have not been verified by PyPI

Project links

Homepage

Project description

Agent Forensics

Black box for AI agents. Capture every decision, auto-detect failure patterns, generate forensic reports for EU AI Act compliance.

When an AI agent makes a wrong purchase, leaks data, or fails silently — you need to know why. Agent Forensics records every decision point, tool call, and LLM interaction, then reconstructs the causal chain and auto-classifies what went wrong.

Agent Forensics Dashboard — Incident Session

Why

EU AI Act (Aug 2026): High-risk AI systems must provide decision traceability. Fines up to €35M or 7% of global revenue.
AI agents are already causing incidents: Meta Sev-1 data leak (Mar 2026), unauthorized purchases, fabricated customer responses.
No existing tool reconstructs the why behind agent failures. Monitoring tools watch in real-time. Forensics analyzes after the fact.

Install

From PyPI

pip install agent-forensics              # Core only (manual recording)
pip install agent-forensics[langchain]   # + LangChain integration
pip install agent-forensics[openai-agents]  # + OpenAI Agents SDK
pip install agent-forensics[crewai]      # + CrewAI
pip install agent-forensics[all]         # Everything

From Source

git clone https://github.com/ilflow4592/agent-forensics.git
cd agent-forensics
pip install -e ".[all]"

Quick Start — Full Walkthrough

Step 1: Install

pip install agent-forensics[all]

Step 2: Record agent actions

Choose one of the methods below depending on your setup.

Option A: Manual recording (any framework or custom agent)

# save this as demo.py
from agent_forensics import Forensics

f = Forensics(session="order-123", agent="shopping-agent")

# Record decisions and tool calls
f.decision("search_products", input={"query": "mouse"}, reasoning="User requested product search")
f.tool_call("search_api", input={"q": "mouse"}, output={"results": [{"name": "Logitech M750", "price": 45}]})

# Record external context injections (RAG, memory, etc.)
f.context_injection("vector_db", content={
    "document": "refund_policy.md",
    "chunk": "Refunds available within 30 days",
    "similarity_score": 0.92,
})

# Track system prompt changes (auto-detects drift)
f.prompt_state("You are a shopping assistant. Buy the cheapest option.")

# Guardrail checkpoints — was this action allowed?
f.guardrail(intent="check price", action="purchase item", allowed=True, reason="Within budget")

# Record errors and final output
f.error("purchase_failed", output={"reason": "Out of stock"})
f.finish("Could not complete purchase due to stock unavailability.")

Option B: LangChain — auto-capture with one line

from agent_forensics import Forensics

f = Forensics(session="order-123")
agent.invoke({"input": "..."}, config={"callbacks": [f.langchain()]})

Prompt drift detection is automatic — no manual calls needed.

Option C: OpenAI Agents SDK — auto-capture with one line

from agent_forensics import Forensics
from agents import Agent, Runner

f = Forensics(session="order-123")
agent = Agent(name="shopper", tools=[...], hooks=f.openai_agents())
result = await Runner.run(agent, "Buy a wireless mouse")

Model config (name, temperature, seed) is automatically captured for deterministic replay.

Option D: CrewAI — auto-capture with callbacks

from agent_forensics import Forensics

f = Forensics(session="order-123")
hooks = f.crewai()
agent = Agent(role="shopper", step_callback=hooks.step_callback)
task = Task(description="...", agent=agent, callback=hooks.task_callback)

Step 3: Generate reports

# Full Markdown report — timeline + decisions + causal chain + failure classification
print(f.report())

# Save as files
f.save_markdown()   # → forensics-report-order-123.md
f.save_pdf()        # → forensics-report-order-123.pdf

Step 4: Auto-classify failure patterns

# Detect failure patterns in current session
failures = f.classify()
for fail in failures:
    print(f"[{fail['severity']}] {fail['type']} — {fail['description']}")

# Aggregate patterns across all sessions
stats = f.failure_stats()
print(f"Total failures: {stats['total_failures']}")
for ftype, info in stats['by_type'].items():
    print(f"  {ftype}: {info['count']}x")

Step 5: Deterministic replay

# Extract model config + step sequence from a recorded session
config = f.get_replay_config("order-123")
print(config["model_config"])  # {'model': 'gpt-4o', 'temperature': 0, 'seed': 42}

# After re-running your agent with the same config into a new session:
diff = f.replay_diff("order-123", "order-123-replay")
print(f"Matching: {diff['matching']}")
for d in diff['divergences']:
    print(f"  Step {d['step']}: {d['type']}")

Step 6: Launch the web dashboard

f.dashboard(port=8080)  # → http://localhost:8080

Or from the command line:

python -c "from agent_forensics import Forensics; Forensics(db_path='forensics.db').dashboard()"

Step 7: Access raw event data (optional)

events = f.events()
for e in events:
    print(f"[{e.event_type}] {e.action} — {e.reasoning}")

print(f.sessions())  # ['order-123', 'order-456', ...]

All events are stored in a local SQLite file (forensics.db by default).

Features

Forensic Report

Every report includes:

Timeline — Chronological record of all agent actions
Decision Chain — Each decision with its reasoning
Incident Analysis — Automatic error detection + root cause chain
Causal Chain — Decision → Tool Call → Result → Error trace
Failure Classification — Auto-detected failure patterns with severity and evidence
Prompt Drift Analysis — Detects when system prompt changes between steps
Context Injections — Which RAG documents / memory influenced each decision
Compliance Notes — EU AI Act Article 14 (Human Oversight) support

Failure Auto-Classification

Agent Forensics automatically detects these failure patterns:

Pattern	Severity	What It Detects
`HALLUCINATED_TOOL_OUTPUT`	HIGH	Agent ignored a tool error and proceeded as if it succeeded
`MISSING_APPROVAL`	HIGH	Critical action (purchase, delete, send) without guardrail check
`SILENT_SUBSTITUTION`	HIGH	Final output differs from user's original request without approval
`PROMPT_DRIFT_CAUSED`	MEDIUM	Decision influenced by a system prompt change between steps
`REPEATED_FAILURE`	MEDIUM	Same failing action retried without changing approach
`RETRIEVAL_MISMATCH`	MEDIUM	Low-similarity RAG context used (potentially irrelevant)

Guardrail Checkpoints

Record whether critical actions were allowed or blocked:

f.guardrail(
    intent="buy Apple Magic Mouse per user request",
    action="purchase Logitech M750",
    allowed=False,
    reason="User explicitly requested Apple Magic Mouse — substitution not allowed"
)

Blocked actions trigger incident detection and appear in the causal chain as [GUARDRAIL BLOCKED].

Context Injection Tracking

Trace which external data influenced each decision:

f.context_injection("vector_db", content={
    "document": "refund_policy_v2.md",
    "similarity_score": 0.92,
}, reasoning="Retrieved refund policy from vector store")

Shows up in the causal chain as [CONTEXT] nodes — "this decision was influenced by this specific document."

Prompt Drift Detection

Automatically detects when the system prompt changes between agent steps:

f.prompt_state("You are a helpful assistant.")
# ... agent does work ...
f.prompt_state("You are a helpful assistant. Always choose the cheapest option.")
# → PROMPT DRIFT automatically detected and flagged

For LangChain and OpenAI Agents SDK, drift detection is automatic — no manual calls needed.

Deterministic Replay

Extract model config from a recorded session and compare results:

config = f.get_replay_config("order-123")
# → {'model': 'gpt-4o', 'temperature': 0, 'seed': 42}

diff = f.replay_diff("order-123", "order-123-replay")
# → Shows exactly which step diverged and how

Web Dashboard

Dark-themed dashboard with session selector, color-coded timeline, and causal chain visualization.

Causal Chain — Root Cause Analysis

Data Leak Incident

Output Formats

Markdown — f.save_markdown()
PDF — f.save_pdf() (requires pip install agent-forensics[pdf])
Web Dashboard — f.dashboard(port=8080)
Raw Events — f.events() returns list[Event]
Failure Data — f.classify() returns list[dict]

Architecture

Your Agent (any framework)
    │
    │  Callback / Hook (1 line of code)
    ▼
┌───────────────────────────┐
│  Forensics Collector       │  Captures decisions, tool calls, LLM interactions
├───────────────────────────┤
│  Context & Prompt Tracker  │  Tracks RAG injections + prompt drift
├───────────────────────────┤
│  Event Store (SQLite)      │  Immutable event log with session isolation
├───────────────────────────┤
│  Failure Classifier        │  Auto-detects 6 failure patterns
├───────────────────────────┤
│  Report Generator          │  Markdown / PDF / Dashboard
├───────────────────────────┤
│  Replay Engine             │  Deterministic trace reproduction + diff
└───────────────────────────┘

Supported Frameworks

Framework	Integration	Method
Any (manual)	`f.decision()`, `f.tool_call()`, `f.error()`	Direct API
LangChain / LangGraph	`f.langchain()`	Callback Handler (auto prompt drift)
OpenAI Agents SDK	`f.openai_agents()`	AgentHooks (auto model config capture)
CrewAI	`f.crewai()`	step_callback / task_callback

Event Types

Type	When	Why It Matters
`decision`	Agent decides what to do next	Core of forensics — the why
`tool_call_start/end`	Tool execution	What tool, what input, what result
`llm_call_start/end`	LLM request/response	What was asked, what was answered
`error`	Something went wrong	Incident detection
`final_decision`	Agent produces final output	End of decision chain
`context_injection`	RAG/memory context injected	Which data influenced the decision
`prompt_state`	System prompt recorded	Baseline for drift detection
`prompt_drift`	System prompt changed	Instruction drift flagged
`guardrail_pass`	Action allowed by guardrail	Approval checkpoint
`guardrail_block`	Action blocked by guardrail	Prevention checkpoint

API Reference

Core

Method	Description
`Forensics(session, agent, db_path)`	Initialize with session ID and agent name
`f.decision(action, input, reasoning)`	Record a decision
`f.tool_call(action, input, output)`	Record a tool call
`f.llm_call(input, output, model, temperature, seed)`	Record an LLM call with model config
`f.error(action, output, reasoning)`	Record an error
`f.finish(output, reasoning)`	Record final output
`f.context_injection(source, content, reasoning)`	Record RAG/memory context injection
`f.prompt_state(system_prompt)`	Record prompt state (auto drift detection)
`f.guardrail(intent, action, allowed, reason)`	Record guardrail checkpoint

Analysis

Method	Description
`f.report()`	Generate Markdown forensic report
`f.save_markdown(path)`	Save report as Markdown file
`f.save_pdf(path)`	Save report as PDF
`f.classify(session_id)`	Auto-classify failure patterns
`f.failure_stats(session_ids)`	Aggregate failures across sessions
`f.get_replay_config(session_id)`	Extract model config for replay
`f.replay_diff(original, replay)`	Compare original vs replayed session
`f.events()`	Get raw events for current session
`f.sessions()`	List all sessions
`f.dashboard(port)`	Launch web dashboard

Framework Integrations

Method	Returns
`f.langchain()`	LangChain `BaseCallbackHandler`
`f.openai_agents()`	OpenAI Agents SDK `AgentHooks`
`f.crewai()`	CrewAI callback collection (`.step_callback`, `.task_callback`)

Documentation

Full documentation is available at the docs site:

Used By

Using Agent Forensics? Add your project here via pull request!

License

MIT

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

0.4.0

Mar 29, 2026

0.3.0

Mar 29, 2026

0.2.0

Mar 28, 2026

0.1.0

Mar 28, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_forensics-0.4.0.tar.gz (48.0 kB view details)

Uploaded Mar 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agent_forensics-0.4.0-py3-none-any.whl (39.3 kB view details)

Uploaded Mar 29, 2026 Python 3

File details

Details for the file agent_forensics-0.4.0.tar.gz.

File metadata

Download URL: agent_forensics-0.4.0.tar.gz
Upload date: Mar 29, 2026
Size: 48.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for agent_forensics-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`b975339a493a7b388aaf5c6bde8ce731b585aa2edd4be744fe62436e2813e65b`
MD5	`edca0cf0d672c795eecb4c61280200ad`
BLAKE2b-256	`7d5f9692adce191fb98c4a9a72be0c2ccdc61e9ee09398a9378c4534919d6793`

See more details on using hashes here.

File details

Details for the file agent_forensics-0.4.0-py3-none-any.whl.

File metadata

Download URL: agent_forensics-0.4.0-py3-none-any.whl
Upload date: Mar 29, 2026
Size: 39.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for agent_forensics-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cfe21de26fab2920de5dfd969ef855e7a14e82825bd267c9925c76945efce7bb`
MD5	`d3f4f5554e52c1d6f15125bb96c04255`
BLAKE2b-256	`5c0da159ed99fd3d70f27556ad2b6c7a35b099362350c5fc57d54ee56d581583`

See more details on using hashes here.

agent-forensics 0.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Agent Forensics

Why

Install

From PyPI

From Source

Quick Start — Full Walkthrough

Step 1: Install

Step 2: Record agent actions

Step 3: Generate reports

Step 4: Auto-classify failure patterns

Step 5: Deterministic replay

Step 6: Launch the web dashboard

Step 7: Access raw event data (optional)

Features

Forensic Report

Failure Auto-Classification

Guardrail Checkpoints

Context Injection Tracking

Prompt Drift Detection

Deterministic Replay

Web Dashboard

Output Formats

Architecture

Supported Frameworks

Event Types

API Reference

Core

Analysis

Framework Integrations

Documentation

Used By

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes