Local-first debugger for AI agents (traces, timeline UI, loop warnings).

These details have not been verified by PyPI

Project links

Repository

Project description

AgentDbg

Dashboard Brag

The step-through debugger for AI agents.

AgentDbg captures a structured trace of every agent run - LLM calls, tool calls, errors, state updates, loop warnings - and gives you a clean local timeline to see exactly what happened.

Add @trace, run your agent, then run:

agentdbg view

In under 10 minutes, you can inspect a full execution timeline with inputs, outputs, status, and failure evidence - all on your machine.

No cloud. No accounts. No telemetry.

Why AgentDbg?

Agents fail in ways logs don't explain:

Silent loops that burn tokens
Tool schema mismatches and malformed arguments
Prompt regressions ("it worked yesterday")
Flaky, non-deterministic runs
"Why did it do that?"

AgentDbg makes agent executions legible.

Instead of scattered logs, you get:

A chronological timeline of events
Expandable LLM calls (prompt, response, usage)
Tool calls with args, results, and error status
Highlighted loop warnings with evidence
A self-contained run artifact you can export

What you get (per run)

Each run produces a local artifact:

run.json - metadata, status, counts
events.jsonl - full structured event stream

In the UI, you see:

Run status (ok / error)
Duration
LLM call count
Tool call count
Error count
Loop warnings (if any)

Everything is written to ~/.agentdbg/ as plain JSON files. Nothing leaves your machine.

What AgentDbg is

A development-time debugger for AI agents
Local-first: traces stored as JSONL on disk
Framework-agnostic: works with any Python code
Redacted by default: secrets scrubbed before writing to disk
Built for the "why did it do that?" moment

What AgentDbg is NOT (v0.1 scope)

Not a hosted service
Not a production observability platform
Not dashboards or alerting
Not deterministic replay (planned v0.2+)
Not tied to a single framework

If observability tells you how your system behaves in production, AgentDbg helps you understand why your agent behaved that way while you're building it.

Get running in 5 minutes

Three commands. No config files, no API keys, no sign-up:

Install (one-time)
Run example
agentdbg view

Step 1: Install

git clone https://github.com/AgentDbg/AgentDbg.git
cd AgentDbg
uv venv && uv sync && uv pip install -e .

No uv? Use pip instead.

python -m venv .venv && source .venv/bin/activate
pip install -e .

Step 2: Run the example agent

python examples/demo/pure_python.py

This simulates a tiny agent that makes several tool and LLM calls and includes loop warnings and errors. Trace data lands in ~/.agentdbg/runs/.

Step 3: Open the timeline

agentdbg view

A browser tab opens at http://127.0.0.1:8712 showing the full run timeline - every event, with inputs, outputs, and timing.

Pure Pythonic Agent Timeline UI

That's it. You're debugging.

Instrument your own agent

Add three lines to any Python agent:

from agentdbg import trace, record_llm_call, record_tool_call

@trace
def run_agent():
    # ... your existing agent code ...

    record_tool_call(
        name="search_db",
        args={"query": "active users"},
        result={"count": 42},
    )

    record_llm_call(
        model="gpt-4",
        prompt="Summarize the search results.",
        response="There are 42 active users.",
        usage={"prompt_tokens": 12, "completion_tokens": 8, "total_tokens": 20},
    )

run_agent()

Then agentdbg view to see the timeline.

What gets captured

Event	Recorded by	What you see
Run start/end	`@trace` (automatic)	Duration, status, error if any
LLM calls	`record_llm_call()`	Model, prompt, response, token usage
Tool calls	`record_tool_call()`	Tool name, args, result, status
State updates	`record_state()`	Arbitrary state snapshots
Errors	`@trace` (automatic)	Exception type, message, stack trace
Loop warnings	Automatic detection	Repetitive pattern + evidence

CLI reference

List recent runs

agentdbg list              # last 20 runs
agentdbg list --limit 50   # more runs
agentdbg list --json       # machine-readable output

View a run timeline

agentdbg view              # opens latest run
agentdbg view <RUN_ID>     # specific run
agentdbg view --no-browser # just print the URL

Export a run

agentdbg export <RUN_ID> --out run.json

Redaction & privacy

Redaction is ON by default. AgentDbg scrubs values for keys matching sensitive patterns (case-insensitive) before writing to disk. Large fields are truncated (marked with __TRUNCATED__ marker).

Default redacted keys: api_key, token, authorization, cookie, secret, password.

# Override defaults via environment variables
export AGENTDBG_REDACT=1                    # on by default
export AGENTDBG_REDACT_KEYS="api_key,token,authorization,cookie,secret,password"
export AGENTDBG_MAX_FIELD_BYTES=20000       # truncation limit

You can also configure redaction in .agentdbg/config.yaml (project root) or ~/.agentdbg/config.yaml.

Storage

All data is local. Plain files, easy to inspect or delete.

~/.agentdbg/
└── runs/
    └── <run_id>/
        ├── run.json        # run metadata (status, counts, timing)
        └── events.jsonl    # append-only event log

Override the location:

export AGENTDBG_DATA_DIR=/path/to/traces

Integrations

AgentDbg is framework-agnostic at its core. The SDK works with any Python code.

LangChain / LangGraph (v0.1)

Optional callback handler that auto-records LLM and tool events. Requires langchain-core:

pip install -e ".[langchain]"

from agentdbg import trace
from agentdbg.integrations import AgentDbgLangChainCallbackHandler

@trace
def run_agent():
    handler = AgentDbgLangChainCallbackHandler()
    # pass to your chain: config={"callbacks": [handler]}
    ...

See examples/langchain/minimal.py for a runnable example.

Planned adapters

OpenAI Agents SDK
Agno
Others (AutoGen, CrewAI, custom loops)

Until an adapter exists for your framework, use the core SDK: @trace + record_llm_call / record_tool_call.

Development

uv venv && uv sync && uv pip install -e ".[langchain]"
uv run pytest

Roadmap

Works today (v0.1):

@trace decorator + record_llm_call / record_tool_call / record_state
Local JSONL storage with automatic redaction
agentdbg list, agentdbg view (timeline UI), agentdbg export
Loop detection (LOOP_WARNING events)
LangChain/LangGraph callback handler

Planned (v0.2+):

Deterministic replay / tool mocking
OpenAI Agents SDK adapter
Eval + regression CI
Optional hosted trace store

License

Licensed under the Apache License, Version 2.0. See LICENSE.

Project details

These details have not been verified by PyPI

Project links

Repository

Release history Release notifications | RSS feed

0.2.4

Mar 24, 2026

0.2.3

Mar 18, 2026

0.2.2

Mar 17, 2026

0.2.1

Mar 16, 2026

0.2.0

Mar 11, 2026

0.1.0

Mar 3, 2026

This version

0.1.0rc1 pre-release

Mar 3, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentdbg-0.1.0rc1.tar.gz (8.6 MB view details)

Uploaded Mar 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentdbg-0.1.0rc1-py3-none-any.whl (46.7 kB view details)

Uploaded Mar 3, 2026 Python 3

File details

Details for the file agentdbg-0.1.0rc1.tar.gz.

File metadata

Download URL: agentdbg-0.1.0rc1.tar.gz
Upload date: Mar 3, 2026
Size: 8.6 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for agentdbg-0.1.0rc1.tar.gz
Algorithm	Hash digest
SHA256	`092780b6ed1a53c30d3921b82ab1f68327c14b4e50865450567a4d9f631bf0b2`
MD5	`37cd346783e0a5dec887e7af246ba2e3`
BLAKE2b-256	`dc2e4f4c638221daabb038b13c1a835fa1e5b2abc8485aadf946e7fcbe815d4b`

See more details on using hashes here.

File details

Details for the file agentdbg-0.1.0rc1-py3-none-any.whl.

File metadata

Download URL: agentdbg-0.1.0rc1-py3-none-any.whl
Upload date: Mar 3, 2026
Size: 46.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for agentdbg-0.1.0rc1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`aad5c9ed767c0c34b7612057d5c403f7c2dd90981e185f23e3f4785ec38cf69a`
MD5	`613799db309340b03ae0ae552aaa283b`
BLAKE2b-256	`84e86ba7330bff9ea08e2f00b90295c8af5e1e9810b2ecb475b78182d9f1424a`

See more details on using hashes here.

agentdbg 0.1.0rc1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

AgentDbg

Why AgentDbg?

What you get (per run)

What AgentDbg is

What AgentDbg is NOT (v0.1 scope)

Get running in 5 minutes

Step 1: Install

Step 2: Run the example agent

Step 3: Open the timeline

Instrument your own agent

What gets captured

CLI reference

List recent runs

View a run timeline

Export a run

Redaction & privacy

Storage

Integrations

LangChain / LangGraph (v0.1)

Planned adapters

Development

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes