An automatic, non-intrusive execution tracer for Python for iterative IDE-less debugging and code-flow analysis

These details have not been verified by PyPI

Project links

Project description

🍞 Breadcrumb: Automatic Execution Tracing

Breadcrumb lets you do this:

# Run your code with automatic tracing (NO code changes needed!)
breadcrumb run python my_code.py

Output:

============================================================
BREADCRUMB RUN REPORT
============================================================

Key Metrics:
  Total Events: 22
  - Calls: 9
  - Returns: 9
  - Exceptions: 0
  Duration: 4.06 ms
  Status: completed

Call Tree:
-> __main__.<module>
  -> __main__.main
    -> __main__.add
    <- __main__.add (0.05ms)
    -> __main__.multiply
    <- __main__.multiply (0.05ms)
    -> __main__.greet
    <- __main__.greet (0.04ms)
    -> __main__.calculate_total
      -> __main__.multiply
      <- __main__.multiply (0.08ms)
      -> __main__.multiply
      <- __main__.multiply (0.08ms)
      -> __main__.add
      <- __main__.add (0.04ms)
    <- __main__.calculate_total (0.51ms)
  <- __main__.main (1.46ms)
<- __main__.<module> (1.61ms)

Top 10 Most Called Functions:
    __main__.multiply: 3 calls
    __main__.add: 2 calls
    __main__.greet: 1 calls
    __main__.<module>: 1 calls
    __main__.main: 1 calls
    __main__.calculate_total: 1 calls

Recommendations on what to do next:
  Detailed Flow: breadcrumb query -c example --flow
  Untraced calls: breadcrumb query -c example --gaps
  View specific call: breadcrumb query -c example --call <call_id>
     Example 'multiply': breadcrumb query -c example --call multiply

Dig in:

breadcrumb query --call multiply

Output:

{
  "function": "multiply",
  "calls": [
    {
      "timestamp": "2025-10-11T22:50:39.444263",
      "args": {
        "a": 7,
        "b": 6
      },
      "return_value": 42,
      "duration_ms": 0.049,
      "called_by": "__main__.main",
      "calls_made": []
    },
    {
      "timestamp": "2025-10-11T22:50:39.444732",
      "args": {
        "a": 100,
        "b": 2
      },
      "return_value": 200,
      "duration_ms": 0.082,
      "called_by": "__main__.calculate_total",
      "calls_made": []
    },
    {
      "timestamp": "2025-10-11T22:50:39.444900",
      "args": {
        "a": 200,
        "b": 0.15
      },
      "return_value": 30.0,
      "duration_ms": 0.083,
      "called_by": "__main__.calculate_total",
      "calls_made": []
    }
  ]
}

# Find what wasn't traced
breadcrumb query --gaps

Output:

{
  "untraced_calls": [
    {
      "function": "json.dumps",
      "module": "json",
      "called_by": "__main__.format_response",
      "call_count": 3,
      "suggested_include": "json.*"
    },
    {
      "function": "requests.post",
      "module": "requests",
      "called_by": "__main__.send_webhook",
      "call_count": 1,
      "suggested_include": "requests.*"
    }
  ],
  "tip": "Add these to your config to trace them:\n  breadcrumb config edit myapp --add-include 'json.*'\n  breadcrumb config edit myapp --add-include 'requests.*'"
}

Expand tracing and re-run:

breadcrumb query --call post

Output:

{
  "function": "requests.post",
  "calls": [
    {
      "timestamp": "2025-10-11T15:30:42.123",
      "args": {"url": "https://api.example.com/webhook", "data": {"event": "user.signup"}},
      "return_value": {"status": 200, "body": "OK"},
      "duration_ms": 234.5,
      "called_by": "__main__.send_webhook",
      "calls_made": ["json.dumps"]
    }
  ]
}

---

## Why Would You Need This?

**You probably won't. But your AI coding agent will LOVE this.**

### The Problem: AI Agents Can't "Step Through" Code

When humans debug, we:
- Set breakpoints and step through execution
- Watch variables change in real-time
- See the actual call stack in our debugger
- Understand what the code *actually did* vs what we *thought* it would do

When AI agents debug, they:
- ❌ Can't set breakpoints (no interactive debugger access)
- ❌ Can't "watch" execution (no real-time visibility)
- ❌ Can't inspect the call stack (no execution context)
- ❌ Must **guess** what happened by reading static code

**Result:** AI agents spend 80% of debugging time making guesses, reading logs, and asking "what actually ran?"

### The Solution: Breadcrumb = X-Ray Vision for AI Agents

Breadcrumb captures **what actually happened** during execution:
- ✅ **Every function call** with arguments and return values
- ✅ **Exact execution flow** in chronological order
- ✅ **Gaps in coverage** showing untraced function calls
- ✅ **Call relationships** (who called what)
- ✅ **Performance data** (execution duration)

**No code changes. No manual logging. No guessing.**

Just run your code with `breadcrumb run` and query the execution trace with structured commands.

---

## Key Features

### 🎯 Gap Detection (The Killer Feature)

**Problem:** You can't trace EVERYTHING (performance overhead). So what aren't you tracing?

**Solution:** `breadcrumb query --gaps` shows you exactly which functions were called but not traced.

```bash
breadcrumb query -c myapp --gaps
# Shows: requests.post called 5 times by myapp.api_client
# Suggests: breadcrumb config edit myapp --add-include 'requests.*'

Workflow: Start minimal → discover gaps → expand tracing → repeat

This is iterative debugging - trace only what you need, when you need it.

🤖 AI-FIRST Design

Traditional tools: Optimize for human convenience (implicit context, smart defaults) Breadcrumb: Optimizes for AI reliability (explicit context, zero ambiguity)

# ❌ IMPLICIT (confusing for AI agents)
breadcrumb run python script.py          # Which database? Which config?
breadcrumb query --gaps                   # Where are the traces?

# ✅ EXPLICIT (perfect for AI agents)
breadcrumb run -c myapp python script.py  # Clear: using myapp config
breadcrumb query -c myapp --gaps          # Clear: querying myapp database

Design principle:

"It's only clunky for humans! AI agents have no issues writing -c myapp every time."

Result: Zero context confusion across sessions. AI agents can resume debugging workflows perfectly.

📊 Smart Queries (No SQL Required)

AI agents don't need to learn SQL. They just need structured data.

# Find untraced calls
breadcrumb query -c myapp --gaps

# Show function I/O
breadcrumb query -c myapp --call send_webhook

# Show execution timeline
breadcrumb query -c myapp --flow

# Filter by module
breadcrumb query -c myapp --flow --module myapp

All output is structured JSON, ready for AI consumption.

🚫 Zero Code Changes

OLD way (requires code modification):

import breadcrumb  # ❌ Must modify your code
breadcrumb.init()

def my_function():
    pass

NEW way (zero code changes):

# ✅ Just run with breadcrumb
breadcrumb run -c myapp --timeout 60 python main.py
# NO code modifications needed!

Your codebase stays clean. Tracing is external.

🔒 Automatic Secret Redaction

Breadcrumb automatically detects and redacts 50+ sensitive patterns:

Passwords, API keys, tokens
Credit cards, SSNs
JWTs, AWS keys, GitHub tokens

data = {"api_key": "sk-1234567890", "password": "secret"}
# Stored as: {"api_key": "[REDACTED]", "password": "[REDACTED]"}

Safe for production traces.

⚡ Low Overhead

PEP 669 backend (Python 3.12+): < 2% overhead
sys.settrace backend (Python 3.8+): < 5% overhead
Selective tracing: Only trace what you need
Async writes: Non-blocking trace persistence

Quick Start

Installation

# Using uv (recommended)
uv pip install -e .

# Or using pip
pip install -e .

The Complete Workflow

1️⃣ Initialize your project (one-time setup):

breadcrumb init myapp

Creates:

~/.breadcrumb/myapp.yaml (config with sensible defaults)
~/.breadcrumb/myapp-traces.duckdb (trace database)

2️⃣ Run your code with tracing:

breadcrumb run -c myapp --timeout 60 python main.py

Automatically traces execution. NO code changes needed.

3️⃣ Find gaps in coverage:

breadcrumb query -c myapp --gaps

Shows which functions were called but not traced.

4️⃣ Expand tracing based on gaps:

breadcrumb config edit myapp --add-include 'requests.*'
breadcrumb config edit myapp --add-include 'sqlalchemy.*'

5️⃣ Re-run with expanded tracing:

breadcrumb run -c myapp --timeout 60 python main.py

Now traces additional modules!

6️⃣ Query function details:

breadcrumb query -c myapp --call my_function

Shows args, returns, duration, caller/callee relationships.

7️⃣ View execution timeline:

breadcrumb query -c myapp --flow

Shows chronological execution order with call stack depth.

AI-FIRST Design Philosophy

Why `-c PROJECT` is REQUIRED

All commands require explicit project specification:

breadcrumb run -c myapp python script.py       # REQUIRED
breadcrumb query -c myapp --gaps               # REQUIRED
breadcrumb config edit myapp --add-include 'x' # REQUIRED

This eliminates:

❌ "Which database am I querying?"
❌ "Which config is active?"
❌ "Where did my traces go?"
❌ Context loss between AI agent sessions

Why it matters:

Humans: "Ugh, typing -c myapp every time is annoying!"
AI Agents: "Perfect! No ambiguity, no context confusion!"

Design philosophy: Explicit > Implicit for AI workflows.

AI agents don't care about typing extra characters. They care about reliability across sessions.

Command Reference

Core Commands

# Initialize project
breadcrumb init <project-name>

# Run with tracing
breadcrumb run -c <project> --timeout <seconds> <command>

# Smart queries
breadcrumb query -c <project> --gaps           # Find untraced calls
breadcrumb query -c <project> --call <func>    # Show function I/O
breadcrumb query -c <project> --flow           # Show execution timeline

# Config management
breadcrumb config create <name>
breadcrumb config edit <name> --add-include <pattern>
breadcrumb config edit <name> --remove-include <pattern>
breadcrumb config show <name>
breadcrumb config list

Query Examples

# Find what's not traced
breadcrumb query -c myapp --gaps

# Show function with args/returns
breadcrumb query -c myapp --call process_payment

# Show execution flow for module
breadcrumb query -c myapp --flow --module myapp

# Raw SQL query (still supported)
breadcrumb query -c myapp --sql "SELECT * FROM traces WHERE status='failed'"

Config Examples

# Start with minimal tracing
breadcrumb init myapp
# Creates config with include: ['__main__']

# Expand based on gaps
breadcrumb config edit myapp --add-include 'requests.*'
breadcrumb config edit myapp --add-include 'sqlalchemy.*'
breadcrumb config edit myapp --add-include 'myapp.services.*'

# Remove noisy modules
breadcrumb config edit myapp --remove-include 'logging.*'

# View current config
breadcrumb config show myapp

Use Cases for AI Coding Agents

1. Understanding Legacy Code

Human approach:

Read docs (if they exist)
Step through with debugger
Ask teammates

AI agent approach:

# Run the code
breadcrumb run -c legacy --timeout 120 python legacy_system.py

# See what actually executed
breadcrumb query -c legacy --flow

# Find entry points
breadcrumb query -c legacy --gaps

Result: AI agent sees actual execution flow, not just static code.

2. Debugging API Integration Issues

Human approach:

Add print statements
Check API logs
Use network inspector

AI agent approach:

# Trace the integration
breadcrumb config edit myapp --add-include 'requests.*'
breadcrumb run -c myapp --timeout 60 python test_api.py

# See exact API calls
breadcrumb query -c myapp --call 'requests.post'

Result: AI agent sees exact request args, response values, and timing.

3. Performance Investigation

Human approach:

Use profiler
Add timing code
Analyze bottlenecks

AI agent approach:

# Run with tracing
breadcrumb run -c myapp --timeout 300 python load_test.py

# Find slow functions
breadcrumb query -c myapp --sql "SELECT function_name, AVG(duration_ms) FROM trace_events GROUP BY function_name ORDER BY AVG(duration_ms) DESC LIMIT 10"

Result: AI agent identifies performance bottlenecks from real execution data.

4. Test Failure Analysis

Human approach:

Re-run test
Add debug output
Check assertions

AI agent approach:

# Run failing test with tracing
breadcrumb run -c tests --timeout 60 pytest tests/test_payment.py::test_refund

# See what actually happened
breadcrumb query -c tests --call process_refund

# Check execution flow
breadcrumb query -c tests --flow --module myapp.payments

Result: AI agent sees exact execution path that led to failure.

MCP Integration (Claude Desktop)

Breadcrumb includes an MCP server for Claude Desktop integration.

Setup:

Add to claude_desktop_config.json:

{
  "mcpServers": {
    "breadcrumb": {
      "command": "breadcrumb",
      "args": ["serve-mcp"],
      "env": {}
    }
  }
}

Available Tools:

breadcrumb__query_traces - Execute SQL queries
breadcrumb__get_trace - Get trace details by ID
breadcrumb__find_exceptions - Find exceptions in traces
breadcrumb__analyze_performance - Analyze function performance

Usage:

Tell Claude: "Use breadcrumb to find the last exception in my app"
Claude will use: breadcrumb__find_exceptions

Advanced Configuration

Python API (Optional)

If you need programmatic control, you can still use the Python API:

import breadcrumb

# Basic configuration
breadcrumb.init(
    enabled=True,
    sample_rate=1.0,
    db_path="~/.breadcrumb/myapp-traces.duckdb",
    include=["myapp.*", "requests.*"]
)

# Your code here
def my_function():
    pass

But 99% of use cases should use breadcrumb run instead.

Environment Variables

# Override config settings
export BREADCRUMB_ENABLED=true
export BREADCRUMB_DB_PATH="/custom/path/traces.duckdb"
export BREADCRUMB_SAMPLE_RATE=0.5

Database Schema

Breadcrumb stores traces in DuckDB with these tables:

traces - One row per execution:

CREATE TABLE traces (
    id VARCHAR PRIMARY KEY,
    started_at TIMESTAMP,
    ended_at TIMESTAMP,
    status VARCHAR,  -- 'running', 'completed', 'failed'
    duration_ms DOUBLE
);

trace_events - Function calls and returns:

CREATE TABLE trace_events (
    id VARCHAR PRIMARY KEY,
    trace_id VARCHAR,
    timestamp TIMESTAMP,
    event_type VARCHAR,  -- 'call', 'return', 'exception', 'call_site'
    function_name VARCHAR,
    module_name VARCHAR,
    data JSON  -- Contains: args, kwargs, return_value, caller info
);

Direct SQL access:

import duckdb
conn = duckdb.connect('~/.breadcrumb/myapp-traces.duckdb')
result = conn.execute("SELECT * FROM traces").fetchall()

How It Works

Architecture

PEP 669 Backend (Python 3.12+)
- Uses sys.monitoring API (low overhead)
- Captures function calls/returns automatically
- Detects untraced calls for gap detection
Trace Storage (DuckDB)
- Fast columnar database
- JSON support for flexible data
- SQL queryable for complex analysis
Smart Query Layer
- Abstracts SQL complexity
- Returns structured JSON
- Optimized for AI agent consumption
CLI Interface
- Typer-based commands
- Explicit project specification
- AI-FIRST design principles

Include-Only Workflow

Default behavior: Trace only __main__ module (minimal overhead)

Iterative expansion:

breadcrumb run -c myapp python script.py     # Trace __main__ only
breadcrumb query -c myapp --gaps             # Find what's not traced
breadcrumb config edit myapp --add-include 'requests.*'  # Expand
breadcrumb run -c myapp python script.py     # Trace more

Philosophy: Start minimal, expand based on what you discover.

This is much better than tracing everything and filtering later.

Performance

Breadcrumb is designed for production use:

Overhead benchmarks (1M function calls):

Baseline: 0.85s
With Breadcrumb (PEP 669): 0.87s (2.4% overhead)
With Breadcrumb (settrace): 0.89s (4.7% overhead)

Optimization strategies:

Selective tracing (include patterns)
Async database writes
Value truncation (prevent bloat)
Smart sampling (optional)

Security

Automatic secret redaction prevents sensitive data from reaching traces:

Detected patterns:

Password fields (password, passwd, pwd, secret)
API keys (api_key, apikey, token, bearer)
Credit cards (16-digit patterns)
SSNs (XXX-XX-XXXX format)
JWTs (eyJ... tokens)
AWS keys (AKIA... patterns)
GitHub tokens (ghp_... patterns)

Custom patterns:

from breadcrumb.capture.secret_redactor import configure_redactor
configure_redactor(custom_patterns=['my_secret_*', 'internal_token'])

Requirements

Python 3.12+ (recommended for PEP 669 backend)
Python 3.10+ (falls back to sys.settrace backend)
DuckDB 1.4.1+
FastMCP 2.12.4+ (for MCP server)
Typer 0.19.2+ (for CLI)

Status

Current version: 0.2.0 (Beta)

Production-ready features:

✅ PEP 669 tracing backend
✅ Zero-code-change execution (breadcrumb run)
✅ Gap detection (--gaps command)
✅ Smart queries (no SQL needed)
✅ AI-FIRST design (explicit -c everywhere)
✅ Automatic secret redaction
✅ DuckDB storage
✅ MCP server integration

Coming soon:

🚧 --call command (show function I/O details)
🚧 --flow command (execution timeline)
📦 PyPI package
📊 Performance visualizations
☁️ Cloud storage backends

Contributing

We welcome contributions! This project is built for AI coding agents.

Development setup:

git clone https://github.com/AndreRatzenberger/breadcrumb-tracer.git
cd breadcrumb-tracer
uv pip install -e .
pytest tests/

Key principles:

AI-FIRST design (explicit over implicit)
Zero code changes required
Performance matters (< 5% overhead)
Test-driven development

License

MIT License - see LICENSE file for details.

Acknowledgments

Built with:

PEP 669 (sys.monitoring) - Low-overhead monitoring API
DuckDB - Embedded analytical database
FastMCP - MCP server framework
Typer - CLI framework

Special thanks to the Model Context Protocol team at Anthropic for enabling AI-native tooling.

The Bottom Line

Humans debug with debuggers. AI agents debug with execution traces.

Breadcrumb gives AI coding agents the X-ray vision they need to understand what your code actually does - not just what it's supposed to do.

No code changes. No manual logging. No guessing.

Just structured execution data, queryable in real-time.

That's Breadcrumb. 🍞

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.14

Oct 12, 2025

0.2.13

Oct 12, 2025

0.2.12

Oct 12, 2025

0.2.10

Oct 11, 2025

0.2.9

Oct 11, 2025

0.2.8

Oct 11, 2025

0.2.7

Oct 11, 2025

This version

0.2.6

Oct 11, 2025

0.2.5

Oct 11, 2025

0.2.4

Oct 11, 2025

0.2.3

Oct 11, 2025

0.2.2

Oct 11, 2025

0.2.1

Oct 11, 2025

0.2.0

Oct 11, 2025

0.1.0

Oct 11, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

breadcrumb-0.2.6.tar.gz (334.2 kB view details)

Uploaded Oct 11, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

breadcrumb-0.2.6-py3-none-any.whl (99.7 kB view details)

Uploaded Oct 11, 2025 Python 3

File details

Details for the file breadcrumb-0.2.6.tar.gz.

File metadata

Download URL: breadcrumb-0.2.6.tar.gz
Upload date: Oct 11, 2025
Size: 334.2 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.1

File hashes

Hashes for breadcrumb-0.2.6.tar.gz
Algorithm	Hash digest
SHA256	`5f7396413f9ccf4085fe2433203364a2f95f8e77ec6b2a0d7a2d384ff8badab8`
MD5	`5c1dc4078c4745538b89ce0af0b7f324`
BLAKE2b-256	`8918c7976693ccd0a0134251f1c87d8e86bf92c754447897387ee41836dc8882`

See more details on using hashes here.

File details

Details for the file breadcrumb-0.2.6-py3-none-any.whl.

File metadata

Download URL: breadcrumb-0.2.6-py3-none-any.whl
Upload date: Oct 11, 2025
Size: 99.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.9.1

File hashes

Hashes for breadcrumb-0.2.6-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1ef2cf62ca02bd70b40a676cc7362ac4b27e21df173272ca6bbb6437aea9d7fe`
MD5	`3cbe30b645f47ca0ff787c42ef801b8d`
BLAKE2b-256	`fc089ab1c0d4a69cdc534694ddf87ccb5a5ecb8bdb3efc9136c47bf9c3ff244f`

See more details on using hashes here.

breadcrumb 0.2.6

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🍞 Breadcrumb: Automatic Execution Tracing

🤖 AI-FIRST Design

📊 Smart Queries (No SQL Required)

🚫 Zero Code Changes

🔒 Automatic Secret Redaction

⚡ Low Overhead

Quick Start

Installation

The Complete Workflow

AI-FIRST Design Philosophy

Why -c PROJECT is REQUIRED

Command Reference

Core Commands

Query Examples

Config Examples

Use Cases for AI Coding Agents

1. Understanding Legacy Code

2. Debugging API Integration Issues

3. Performance Investigation

4. Test Failure Analysis

MCP Integration (Claude Desktop)

Advanced Configuration

Python API (Optional)

Environment Variables

Database Schema

How It Works

Architecture

Include-Only Workflow

Performance

Security

Requirements

Status

Contributing

License

Acknowledgments

The Bottom Line

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Why `-c PROJECT` is REQUIRED