Error log → runnable reproduction code generator

These details have not been verified by PyPI

Project links

Project description

log2repro

Error log → runnable reproduction code generator

Paste a Python traceback, get a self-contained reproduce.py that triggers the original error — with requirements.txt, mock data, and a verification README.

45 min → 3 min average reproduction time.

Why log2repro?

Engineers spend 60%+ of debug time on "guess parameters, dig through databases, mock third-party APIs".

Sentry tells you where the error happened. log2repro gives you a replayable scene.

	Sentry	log2repro
Output	Stack trace, breadcrumbs, user context	`reproduce.py` + `requirements.txt` + mock data
Integration	Requires SDK / code changes	Zero-invasion: paste text, CI log, or file
Solves	Monitor → discover → locate	Locate → construct env → verify fix

Features

AST-powered context extraction — extracts real variable names, function signatures, and imports from source code to constrain LLM output (no hallucinated libraries)
Sandbox verification — runs generated code in an isolated venv with network disabled, verifies the original error is actually reproduced
Auto-fix loop — if the script fails (SyntaxError, ModuleNotFoundError, etc.), feeds the error back to the LLM for targeted repair (up to 3 rounds)
Graceful degradation — after 3 failed fixes, outputs human-readable repair suggestions instead of silently failing
Multi-format support — Python tracebacks, chained exceptions, async errors, C extension errors, dynamic imports
LLM-agnostic — uses litellm, supports OpenAI, Anthropic, local models, etc.

Installation

# With uv (recommended)
uv pip install log2repro

# With pip
pip install log2repro

# From source
git clone https://github.com/ChanChiChoi/log2repro.git
cd log2repro
uv sync

Quick Start

# Full pipeline: parse → generate → sandbox → auto-fix → output
log2repro run error.log --output-dir ./repro_out

# From stdin
cat error.log | log2repro run - --output-dir ./repro_out

# Paste directly
log2repro run 'Traceback (most recent call last):
  File "app.py", line 10, in process
    result = api.fetch(user_id)
requests.exceptions.ConnectionError: Connection refused'

# Parse only (no LLM calls)
log2repro run error.log --dry-run

Output:

repro_out/
├── reproduce.py        # Minimal script that triggers the original error
├── requirements.txt    # pip dependencies
├── mock_data.json      # Test fixtures / mock data
└── README_repro.md     # Usage instructions + auto-fix history

Example reproduce.py:

"""Minimal reproduction for ConnectionError."""
from unittest.mock import patch, MagicMock

def test_api_connection():
    with patch("requests.get") as mock_get:
        mock_get.side_effect = ConnectionError("Connection refused")
        import requests
        requests.get("http://api/users/123")  # raises ConnectionError

if __name__ == "__main__":
    test_api_connection()

How It Works

┌─────────────┐     ┌──────────────┐     ┌──────────────┐     ┌───────────────┐
│  Parse Log  │────▶│  AST Extract │────▶│  LLM Generate│────▶│ Sandbox Verify│
│  (regex)    │     │  (ast module)│     │  (litellm)   │     │  (venv+subproc)│
└─────────────┘     └──────────────┘     └──────────────┘     └───────┬───────┘
                                                                      │
                                                    ┌─────────────────┼─────────────────┐
                                                    │ reproduced?     │ fixable error?   │
                                                    ▼                 ▼                  ▼
                                                ✅ Done          LLM Fix (×3)      Degraded +
                                                                          │         Suggestions
                                                                          ▼
                                                                    Re-sandbox

Parse — regex-based parser extracts file, line, error type, call chain from traceback
AST Extract — Python ast module pulls real variable names, function signatures, imports from source
LLM Generate — sends structured prompt (error + AST context) to LLM, parses Markdown code blocks
Sandbox Verify — creates venv, installs deps, runs script with network disabled, checks if original error appears in stderr
Auto-fix — if sandbox fails with a fixable error (SyntaxError, ImportError, etc.), feeds stderr back to LLM for targeted repair

CLI Reference

log2repro run <input> [OPTIONS]

Arguments:
  input                File path, "-" for stdin, or raw traceback text

Options:
  -m, --model TEXT     LLM model (default: gpt-4o)
  -n, --dry-run        Parse only, skip LLM generation
  -d, --output-dir     Output directory (default: ./repro_out)
  -o, --output         Write JSON to file (legacy mode)
  --sandbox-timeout    Max seconds for sandbox execution (default: 10)
  --max-refine         Max sandbox→LLM refinement rounds (default: 2)
  -v, --verbose        Enable verbose logging

Supported Error Formats

Category	Examples
Python tracebacks	`ValueError`, `KeyError`, `TypeError`, `AttributeError`
Chained exceptions	`During handling of the above exception...`
Async errors	`TaskGroup`, `asyncio.TimeoutError`, async generators
C extensions	`numpy._UFuncNoLoopError`, `sqlite3.OperationalError`, `struct.error`
Dynamic imports	`importlib`, `__import__`, lazy imports, module reload
Deep call stacks	Decorators, middleware, recursion, context managers, callbacks
Network/DB	`requests`, `httpx`, `aiohttp`, `sqlalchemy`, `psycopg2`

Evaluation

from log2repro.eval_metrics import evaluate_batch, GenerationInput

batch = evaluate_batch([
    GenerationInput(trace_name="api_500", files=files, tokens_used=800, expected_error="ValueError: x"),
])
print(batch.report())

Metric	Description
Runnable Rate ↑	Script executes without ImportError/SyntaxError
Dep Conflict Rate ↓	requirements.txt has no version conflicts
Mock Coverage ↑	External calls (network, DB) are properly mocked
Token Efficiency ↑	Errors reproduced per 1,000 tokens

Documentation

Full documentation: docs/index.md

Table of Contents

Guide	Reference	Development
Getting Started	Parser Reference	Architecture
User Guide	LLM Prompt Design	Contributing
	Evaluation Guide	Changelog
		Roadmap

Development

git clone https://github.com/ChanChiChoi/log2repro.git
cd log2repro
uv sync --group dev

# Run tests (385 tests, ~2.5 min)
uv run pytest -v

# Run benchmarks
uv run python -m benchmarks.runner

# Lint
uv run ruff check src/ tests/

Project Structure

log2repro/
├── src/log2repro/
│   ├── cli.py              # Typer CLI entry point
│   ├── eval_metrics.py     # 4 quality metrics
│   ├── parsers/            # Log parsing (traceback, sentry, CI)
│   ├── extractors/         # AST context extraction
│   ├── generators/         # LLM generation + prompts
│   ├── validators/         # Sandbox + auto-fix
│   └── utils/              # I/O helpers
├── tests/                  # 385 tests, 30+ fixtures
└── benchmarks/             # Prompt variant benchmarking

Acknowledgements

litellm — unified LLM API
Typer — CLI framework
Rich — terminal formatting
Pydantic — data validation

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.0

May 5, 2026

0.2.1

May 5, 2026

This version

0.1.0

May 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

log2repro-0.1.0.tar.gz (81.6 kB view details)

Uploaded May 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

log2repro-0.1.0-py3-none-any.whl (42.7 kB view details)

Uploaded May 5, 2026 Python 3

File details

Details for the file log2repro-0.1.0.tar.gz.

File metadata

Download URL: log2repro-0.1.0.tar.gz
Upload date: May 5, 2026
Size: 81.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for log2repro-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`3aaa90702b35377cbd71c779297d9eb127daa91aa77678c1b8b1f8a9b7deab27`
MD5	`e9aec0a62ec0d4cace46a75b4fc3d6af`
BLAKE2b-256	`dcd7dabce3cd3adeac0fd20d35c995a99d27453bc385ae0e51e719882b08cfbe`

See more details on using hashes here.

File details

Details for the file log2repro-0.1.0-py3-none-any.whl.

File metadata

Download URL: log2repro-0.1.0-py3-none-any.whl
Upload date: May 5, 2026
Size: 42.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for log2repro-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7139a2f59da6ebd0ebe27f23daef5177a437453ce35d1f8c086389ebb108aab5`
MD5	`48b200b104dda788276729cf03f613b6`
BLAKE2b-256	`0af40db3a125d90e2b4b1c9644e2404e90e53167dbf87ee056306cfdcd724ce1`

See more details on using hashes here.

log2repro 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

log2repro

Why log2repro?

Features

Installation

Quick Start

How It Works

CLI Reference

Supported Error Formats

Evaluation

Documentation

Development

Acknowledgements

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes