Deterministic record-and-replay debugger for AI agent runs

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

rich4188

These details have not been verified by PyPI

Project description

agentrr

Deterministic record-and-replay debugger for AI agent runs.

When an AI agent does something wrong in production, you usually can't reproduce it. Run it again and it takes a different path — the model samples differently, the tool returns different data, and the bug you saw is gone.

agentrr records every nondeterministic boundary an agent crosses — every LLM call, tool call, clock read, and random draw — then replays the run deterministically and offline. The agent's real logic runs again; every external answer is served from the recording. No API calls, no side effects, no cost. You step through the exact failing run as many times as you need.

It's rr / time-travel debugging, for AI agents.

Install (PyPI)

Alpha releases on PyPI:

pip install agentrr
agentrr version

Optional local web UI:

pip install agentrr-ui
agentrr-ui   # http://127.0.0.1:8765 — see docs/ui.md

Quick start (from source)

git clone https://github.com/ip174/agentrr.git
cd agentrr
uv sync --group dev
export PYTHONPATH=examples

1. Record a run

Use python -m … so the log stores a stable entrypoint for replay:

uv run python -m agents.deterministic_support
# run_id: deterministic_support-<id>
# log: .agentrr/runs/deterministic_support-<id>.jsonl

2. Replay in the CLI

uv run agentrr replay deterministic_support-<id>

Entrypoint is read from the log header (0.1.0a2+); override only when needed.

Edit the agent and replay again — strict mode stops at the first divergence:

DivergenceError: divergence at seq 5: signature mismatch

3. Inspect in the web UI (optional)

# dev checkout: install UI + built frontend
cd packages/agentrr-ui/frontend && npm ci && npm run build
cd ../../..
uv pip install -e . -e packages/agentrr-ui

export PYTHONPATH=examples
export AGENTRR_LOG_DIR=.agentrr/runs   # optional; this is the default
agentrr-ui

Open http://127.0.0.1:8765 — pick a session, read What happened, then Check replay and Next to step through AI/tool steps. Replay matched means today's run followed the same path as the recording.

See docs/ui.md for security, nginx, and troubleshooting.

What it guarantees

Faithful replay for every captured boundary — the replayed boundary sequence exactly matches the recording (verified in CI).
Offline and safe — replay makes zero live LLM calls and never re-executes tools. Replaying an agent that issued a refund does not issue another.
Crash-safe recording — an event is durably on disk (fsync) before the agent acts on it. Verified with real SIGKILL in CI. A killed run produces a truncated log, never a holed one.
Honest divergence — when replay can't reproduce faithfully, it halts at the exact point and tells you, with a diff. It never silently guesses or serves a mismatched response.

What it does NOT do (by design)

Single-process, synchronous agents. No marketplace, no backend, no hosted service. Concurrency, streaming-chunk replay, and multi-agent pipelines are out of scope for v0.1. See docs/contract.md.

How it works

Layer	Recorded	Served on replay
LLM calls (OpenAI, Anthropic)	full request + response + metadata	recorded response
Tool calls	name, args, return/error	recorded result (tool never runs)
Clock / RNG / IDs	every read and draw	recorded values, in order

Matching is sequence-primary, signature-validated — no fuzzy search. A request that doesn't match the next expected event is divergence.

Development

uv sync --group dev
export PYTHONPATH=examples
make test          # full suite (excludes durability subdir by default in Makefile)
make durability    # SIGKILL write-before-return gate
make ui-build      # compile React → agentrr_ui/static/
make lint
gitleaks detect    # before you push

Reference agents

Agent	Purpose
`examples/agents/deterministic_support.py`	Golden path (mock LLM, registered tools, shims)
`examples/agents/unstable_loop.py`	Unwrapped `random` — diverges on replay (by design)
`examples/agents/tool_caller.py`	LLM → tool → LLM loop
`examples/agents/broken_replay_cases.py`	Negative scenarios

Docs

Doc	Topic
docs/ui.md	Web UI install and run
docs/RELEASING.md	PyPI release checklist
docs/contract.md	Guarantees and exclusions
docs/replay-worker-protocol.md	UI worker IPC
CONTRIBUTING.md	Contributor workflow

License

Apache-2.0 — see LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

rich4188

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.0a3 pre-release

May 29, 2026

This version

0.1.0a2 pre-release

May 29, 2026

0.1.0a1 pre-release

May 28, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentrr-0.1.0a2.tar.gz (23.4 kB view details)

Uploaded May 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentrr-0.1.0a2-py3-none-any.whl (40.8 kB view details)

Uploaded May 29, 2026 Python 3

File details

Details for the file agentrr-0.1.0a2.tar.gz.

File metadata

Download URL: agentrr-0.1.0a2.tar.gz
Upload date: May 29, 2026
Size: 23.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for agentrr-0.1.0a2.tar.gz
Algorithm	Hash digest
SHA256	`f530a776c832b62108f2f5b1a3cec15a00815c7c7a1b9b0273f9fe25098a50f9`
MD5	`8a25e8a2a2b936f88ffe2ea4ae0b960e`
BLAKE2b-256	`18bba64ad66e27477f6116f1b16e987a9f52221efe2e666cefc80d944fc5a0f6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agentrr-0.1.0a2.tar.gz:

Publisher: release.yml on ip174/agentrr

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agentrr-0.1.0a2.tar.gz
- Subject digest: f530a776c832b62108f2f5b1a3cec15a00815c7c7a1b9b0273f9fe25098a50f9
- Sigstore transparency entry: 1664875633
- Sigstore integration time: May 29, 2026
Source repository:
- Permalink: ip174/agentrr@af62786604a17c5061fe0c0fd48a969232e5558d
- Branch / Tag: refs/tags/v0.1.0a2
- Owner: https://github.com/ip174
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@af62786604a17c5061fe0c0fd48a969232e5558d
- Trigger Event: push

File details

Details for the file agentrr-0.1.0a2-py3-none-any.whl.

File metadata

Download URL: agentrr-0.1.0a2-py3-none-any.whl
Upload date: May 29, 2026
Size: 40.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for agentrr-0.1.0a2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b1958778dd90c93ca131cfe1a12997d6d4ded74080307da6f710a8398fe67682`
MD5	`d7a5c63822ff3b1c2a7175ef0ec093d9`
BLAKE2b-256	`c5d549035d4429c8d4168a12540e45e86fa3560ee4d3f863fb191382dbfc1df5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agentrr-0.1.0a2-py3-none-any.whl:

Publisher: release.yml on ip174/agentrr

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agentrr-0.1.0a2-py3-none-any.whl
- Subject digest: b1958778dd90c93ca131cfe1a12997d6d4ded74080307da6f710a8398fe67682
- Sigstore transparency entry: 1664875947
- Sigstore integration time: May 29, 2026
Source repository:
- Permalink: ip174/agentrr@af62786604a17c5061fe0c0fd48a969232e5558d
- Branch / Tag: refs/tags/v0.1.0a2
- Owner: https://github.com/ip174
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@af62786604a17c5061fe0c0fd48a969232e5558d
- Trigger Event: push

agentrr 0.1.0a2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

agentrr

Install (PyPI)

Quick start (from source)

1. Record a run

2. Replay in the CLI

3. Inspect in the web UI (optional)

What it guarantees

What it does NOT do (by design)

How it works

Development

Reference agents

Docs

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance