Structured debug snapshots for LLM-assisted debugging

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

nschuler

These details have not been verified by PyPI

Project description

llmdebug logo

llmdebug

Structured debug snapshots for LLM-assisted debugging.

llmdebug captures failure-time evidence — exception details, prioritized stack frames, local variables, and execution context — as a machine-readable artifact that works for both humans and coding agents. The goal is to make the first failing run useful, rather than reconstructing state after the fact.

The Debugging Loop

Without structured evidence, a typical loop looks like:

fail → infer missing state → guess patch → rerun → repeat

With llmdebug, the loop becomes:

fail → read snapshot → ranked hypotheses → minimal patch → verify

Installation

pip install 'llmdebug[cli]'   # Recommended: pytest plugin + CLI
pip install llmdebug           # Core library + pytest plugin only
pip install 'llmdebug[mcp]'    # MCP server for IDE integration

Other extras: jupyter, toon, evals — see Configuration Reference.

Start Here

Use the package: start with the quick start below, then the Configuration Reference, CLI Reference, and MCP Reference.
Contribute to the package: read Contributing, Testing Guide, and Quality Map.
Run evals locally or in Docker: start with the Eval Quickstart, then Eval Framework, Eval Configs, and the Local/Docker Eval Runbook.
Find any command fast: Master Runbook — setup, quality, debug CLI, eval ops, analysis, and maintenance all in one place.
Browse canonical docs first: Docs Index separates active product docs from dated research notes and analyses.

Quick Start

Pytest (automatic)

Failing tests automatically create .llmdebug/latest.json:

pytest                                  # Failures create .llmdebug/latest.json
llmdebug                                # View crash summary
llmdebug show --detail context          # Full context (git, env, repro command)
llmdebug diff                           # Compare latest vs previous

Decorator

from llmdebug import SnapshotConfig, debug_snapshot

cfg = SnapshotConfig(out_dir=".llmdebug", redaction_profile="ci")

@debug_snapshot(config=cfg)
def main():
    data = load_data()
    process(data)

Context Manager

from llmdebug import SnapshotConfig, snapshot_section

cfg = SnapshotConfig(out_dir=".llmdebug", redaction_profile="prod")

with snapshot_section("data_processing", config=cfg):
    result = transform(data)

More entry points (production hooks, web middleware, Jupyter) in the Configuration Reference.

Features

Automatic capture — pytest plugin, decorator, context manager, production hooks, web middleware
Rich snapshots — exception chain, prioritized frames, typed locals (array shapes, dtypes), source context
Layered detail — crash (~2K tokens) → full (~5K) → context (~10K) disclosure levels
CLI inspection — show, list, frames, diff, git-context, clean
Shareable exports — llmdebug export generates single-file artifacts or redacted bundle reports for CI/team sharing
MCP server — 9 tools for Claude Code, Cursor, and other MCP-capable editors
Hypothesis engine — auto-generated ranked debugging hypotheses from snapshot patterns
Privacy controls — PII redaction profiles (dev / ci / prod), pattern-based redaction
Jupyter integration — cell-error banners + %llmdebug magic commands
Compact formats — json_compact (~40% smaller) and toon (~50% smaller) for LLM context

CLI

llmdebug                                # Latest snapshot (crash detail)
llmdebug show --detail full             # All frames
llmdebug show --detail context          # Everything (git, env, repro)
llmdebug show --json --detail context   # JSON output
llmdebug list                           # List snapshots
llmdebug list --search "KeyError"       # Search indexed snapshot metadata
llmdebug diff                           # Compare latest vs previous
llmdebug export latest --format html    # Shareable HTML artifact
llmdebug export --bundle-dir reports    # Redacted CI-safe bundle with manifest + index
llmdebug clean -k 5                     # Keep 5 most recent

Level	Content	~Tokens
`crash` (default)	Exception + crash frame	~2K
`full`	All frames + traceback	~5K
`context`	Everything (repro, git, env, coverage)	~10K

Full reference: docs/cli-reference.md

MCP Server

pip install 'llmdebug[mcp]'
llmdebug-mcp  # Start the MCP server (stdio transport)

Tool	Description
`llmdebug_diagnose`	Concise crash summary optimized for LLM consumption
`llmdebug_show`	Full expanded JSON snapshot with detail level control
`llmdebug_list`	List available snapshots with metadata
`llmdebug_frame`	Detailed view of a specific stack frame
`llmdebug_git_context`	On-demand enhanced git metadata for crash triage
`llmdebug_diff`	Compare two snapshots to show what changed
`llmdebug_rca_status`	Show latest RCA state for a session
`llmdebug_rca_history`	Show RCA attempt history
`llmdebug_rca_advance`	Manually advance RCA state machine

Claude Code configuration (.mcp.json):

{
  "mcpServers": {
    "llmdebug": {
      "command": "llmdebug-mcp"
    }
  }
}

Full reference: docs/mcp-reference.md

Output

On failure, .llmdebug/latest.json stores a versioned DebugSession envelope:

{
  "schema_version": "2.0",
  "kind": "llmdebug.debug_session",
  "session": {
    "name": "test_training_step",
    "timestamp_utc": "2026-01-27T14:30:52Z"
  },
  "snapshot": {
    "exception": {
      "type": "ValueError",
      "message": "operands could not be broadcast together..."
    },
    "frames": [
      {
        "file": "training.py",
        "line": 42,
        "function": "train_step",
        "code": "output = model(x) + residual",
        "locals": {
          "x": {"shape": [32, 64], "dtype": "float32"},
          "residual": {"shape": [32, 128], "dtype": "float32"}
        }
      }
    ]
  }
}

Configuration

Prefer config=SnapshotConfig(...) for decorator/context-manager capture settings. Legacy flat kwargs still work temporarily but emit a DeprecationWarning. Pytest plugin settings can still be driven by environment variables:

LLMDEBUG_OUTPUT_FORMAT=json pytest          # Use pretty JSON
LLMDEBUG_REDACTION_PROFILE=ci pytest        # Use CI redaction profile
LLMDEBUG_INCLUDE_GIT=false pytest           # Disable git context

Full parameter reference, output formats, and redaction profiles: docs/configuration.md

Documentation

Document	Description
Docs Index	Canonical docs map, grouped by reference, guides, ops, and research notes
Configuration Reference	Parameters, env vars, output formats, API surface
CLI Reference	Full CLI command reference
MCP Reference	MCP server JSON schemas and parameters
Architecture	System layers, capture pipeline, and design decisions
Testing Guide	Test workflows, snapshot-driven debugging, and integration guidance
Troubleshooting	Common setup/runtime failures and recovery steps
Eval Quickstart	Fastest copy-paste path for eval container builds, local model serving, and baseline/native runs
Eval Framework	Canonical starting point for running and interpreting evals
Eval Configs	Ready-made eval templates and TOML config mapping
Local/Docker Eval Runbook	Canonical eval-ops guide for local model endpoints, Docker wrapper runs, and SWE-Bench scoring
Contributing	Development setup and quality gates
Quality Map	Which checks block package changes vs staged eval work

Dated research notes, experiment logs, and roadmap documents remain under docs/, but they are supporting material rather than the primary starting point for package usage or eval operations.

License

MIT

Authors

Nicolas Schuler — schuler.nicolas@proton.me
Balint Mate
Vincenzo Scotti
Raffaela Mirandola

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

nschuler

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

3.0.0

Apr 6, 2026

2.27.0

Mar 14, 2026

2.25.0

Mar 7, 2026

2.24.0

Mar 5, 2026

2.23.0

Mar 4, 2026

2.22.0

Mar 4, 2026

2.21.0

Mar 3, 2026

2.20.0

Mar 3, 2026

2.19.2

Mar 2, 2026

2.19.1

Mar 2, 2026

2.19.0

Mar 1, 2026

2.18.0

Mar 1, 2026

2.17.2

Feb 27, 2026

2.17.1

Feb 27, 2026

2.17.0

Feb 27, 2026

2.16.0

Feb 23, 2026

2.15.0

Feb 22, 2026

2.14.7

Feb 22, 2026

2.14.6

Feb 21, 2026

2.14.5

Feb 20, 2026

2.14.4

Feb 20, 2026

2.14.3

Feb 20, 2026

2.14.2

Feb 19, 2026

2.14.1

Feb 19, 2026

2.14.0

Feb 19, 2026

2.13.1

Feb 19, 2026

2.13.0

Feb 19, 2026

2.12.0

Feb 18, 2026

2.11.4

Feb 18, 2026

2.11.3

Feb 18, 2026

2.11.2

Feb 18, 2026

2.10.1

Feb 17, 2026

2.10.0

Feb 16, 2026

2.9.0

Feb 16, 2026

2.8.5

Feb 16, 2026

2.8.4

Feb 16, 2026

2.8.3

Feb 15, 2026

2.8.2

Feb 14, 2026

2.8.1

Feb 14, 2026

2.8.0

Feb 13, 2026

2.7.0

Feb 13, 2026

2.6.0

Feb 11, 2026

2.5.1

Feb 10, 2026

2.5.0

Feb 10, 2026

2.4.0

Feb 10, 2026

2.3.0

Feb 10, 2026

2.2.0

Feb 10, 2026

2.1.0

Feb 8, 2026

2.0.1

Feb 8, 2026

2.0.0

Jan 31, 2026

1.0.1

Jan 28, 2026

0.2.0

Jan 28, 2026

0.1.4

Jan 27, 2026

0.1.3

Jan 27, 2026

0.1.2

Jan 27, 2026

0.1.1

Jan 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llmdebug-3.0.0.tar.gz (7.7 MB view details)

Uploaded Apr 6, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llmdebug-3.0.0-py3-none-any.whl (146.8 kB view details)

Uploaded Apr 6, 2026 Python 3

File details

Details for the file llmdebug-3.0.0.tar.gz.

File metadata

Download URL: llmdebug-3.0.0.tar.gz
Upload date: Apr 6, 2026
Size: 7.7 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.3 {"installer":{"name":"uv","version":"0.11.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for llmdebug-3.0.0.tar.gz
Algorithm	Hash digest
SHA256	`c9a3236c26daea08bb9ba3cd637e425f6108cc83908658798237e9eef5d03d3a`
MD5	`18d07bc498b6736e1f84fcc144160277`
BLAKE2b-256	`6f77027a02560bab2e20b4916154b7f81749598388c1515f1947e05d0efca1d7`

See more details on using hashes here.

File details

Details for the file llmdebug-3.0.0-py3-none-any.whl.

File metadata

Download URL: llmdebug-3.0.0-py3-none-any.whl
Upload date: Apr 6, 2026
Size: 146.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.3 {"installer":{"name":"uv","version":"0.11.3","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for llmdebug-3.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`69d02ae86c3dce9857a0a01ddb04206588f587f3e19c037be80bfb2d3353cc57`
MD5	`8a116d1d5d3399e6f4bcf9198a21d5d9`
BLAKE2b-256	`1ef01312a23bb7a3289482d90c65c217de6f98a3c81eab8158027a381aff45d6`

See more details on using hashes here.

llmdebug 3.0.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

llmdebug

The Debugging Loop

Installation

Start Here

Quick Start

Pytest (automatic)

Decorator

Context Manager

Features

CLI

MCP Server

Output

Configuration

Documentation

License

Authors

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes