Skip to main content

Record, replay, fork & share AI agent executions

Project description

retrace-sdk

The execution replay engine for AI agents. Record every LLM call, tool invocation, and error your AI agent makes. Replay step-by-step. Fork from any point. Share interactive traces via URL.

Install

pip install retrace-sdk

Requires Python 3.10+.

Quick Start

import retrace

retrace.configure(api_key="rt_live_...")  # Get your key at retrace.yashbogam.me/settings

@retrace.record(name="my-agent")
def run_agent(prompt: str):
    response = client.chat.completions.create(
        model="gpt-5.5",
        messages=[{"role": "user", "content": prompt}]
    )
    return response.choices[0].message.content

run_agent("What is quantum computing?")

Auto-Instrumentation

Retrace automatically captures LLM calls from all major providers:

# OpenAI — captured automatically
# Anthropic — captured automatically
# Google Gemini — captured automatically

No extra setup needed. Install the provider SDK alongside retrace-sdk and calls are captured.

Features

  • Record — One decorator captures every LLM call, tool call, and error
  • Replay — Step through executions with play/pause/speed controls
  • Fork — Branch from any step, modify input, watch a new path diverge
  • Share — Publish traces as shareable "tapes" with interactive playback
  • Retrace AI — Built-in evaluations, memory extraction, and semantic search

Resumable Execution (Cascade Replay)

Mark a function as resumable to enable full cascade replay from the dashboard:

@retrace.record(name="my-agent", resumable=True)
def run_agent(prompt: str):
    plan = call_planner(prompt)
    result = call_executor(plan)
    return summarize(result)

When you fork at any span in the dashboard, the SDK re-executes the entire function with modified input — all subsequent LLM calls diverge.

Error Handling

from retrace import RetraceError, RetraceAuthError, RetraceCreditsExhaustedError, RetraceRateLimitError

Sampling

retrace.configure(api_key="rt_live_...", sample_rate=0.1)  # Record 10% of traces

Changelog

0.2.2

  • Version sync with TypeScript SDK

0.2.1

  • Offline buffer — stores up to 1000 messages when WebSocket disconnects, flushes on reconnect
  • Dedicated listener thread — receives server 'resume' commands without needing active sends
  • Cascade replayresumable=True registers function for SDK-level re-execution
  • Fixed — duplicate except block in transport, proper close() cleanup

0.2.0

  • Typed errors (RetraceAuthError, RetraceCreditsExhaustedError, RetraceRateLimitError)
  • Trace sampling via sample_rate config
  • Auto-instrumentation for OpenAI, Anthropic, Gemini
  • WebSocket transport with auto-reconnect

Links

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

retrace_sdk-0.2.3.tar.gz (16.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

retrace_sdk-0.2.3-py3-none-any.whl (16.8 kB view details)

Uploaded Python 3

File details

Details for the file retrace_sdk-0.2.3.tar.gz.

File metadata

  • Download URL: retrace_sdk-0.2.3.tar.gz
  • Upload date:
  • Size: 16.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for retrace_sdk-0.2.3.tar.gz
Algorithm Hash digest
SHA256 25270119f1451cae1c814560401db3a17bea2f32cd81ca513c51d5ea16d4b4c3
MD5 ee8a20546034ccb3749848d9466b20a6
BLAKE2b-256 dad9fec3b16cf180b6a8aca46d7a9e0aa45d2a888cfe9be2c10c21b822250323

See more details on using hashes here.

File details

Details for the file retrace_sdk-0.2.3-py3-none-any.whl.

File metadata

  • Download URL: retrace_sdk-0.2.3-py3-none-any.whl
  • Upload date:
  • Size: 16.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for retrace_sdk-0.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 9521df751a9dd61c3b5852b75bc720560df2bcc89ba240256f7691ec70e9f27d
MD5 983984ddbabd27ab63cb0a3b9c68e6ce
BLAKE2b-256 7dc7a51408b954bb6864224436fc6199c9b9c8afd496e00306bfa48bf146ee18

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page