Root-cause analysis for a PR or branch: classify feature vs bugfix, find the introducing commit (suspect set) or the blast radius.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

noordeen

These details have not been verified by PyPI

Project description

culprit

Root-cause analysis for a pull request or branch.

culprit looks at a PR (or the current branch), decides whether it's a bugfix or a feature, then:

Bugfix -> reconstructs the bug's life story. It blames the lines the fix removed/changed at the base revision to rank the commits that introduced it (the suspect set), surfaces what the author was trying to do (the introducing PR/commit + any linked issue), how long it lived and which releases shipped it, whether the file is a recurring hotspot, and whether the fix is actually complete (other untouched call sites, a missing test, a revert) - then explains why it broke.
Feature -> maps the blast radius: who imports the changed modules, which tests cover them, and which touched files live in high-risk shared/core areas.

It is read-only - it never modifies your repo or the PR.

Example

The visual report (rca --html report.html) for a bugfix - a one-line area formula silently broken by a perf commit and shipped across three releases before it was fixed. Top to bottom: the QA risk score with its factors, the introducing commit's intent (+ linked issue), the line-evolution timeline (created -> reformatted -> broke (red) -> fix (green)), the tests to run, the co-change files you may have missed, and suggested reviewers.

culprit RCA report

A self-contained HTML file (no server, no CDN) with deep links, the introducing PR's intent, a lifecycle strip (how long it lived and the releases that shipped it), a fix-completeness callout, a test-gap callout, and expandable per-step diffs.

Why the split design

The deterministic git work (diff parsing, git blame / git log -L, the suspect set, the reverse-import map) lives in a plain Python engine that emits structured JSON. The only LLM step - the "why it broke" narrative - is isolated behind a ReasoningAdapter:

HarnessAdapter - used by the Claude Code skill. Returns the structured result + a markdown skeleton; the agent writes the narrative. No API key.
ClaudeAPIAdapter - used standalone. Calls the Claude API (claude-opus-4-8 by default, --fast -> claude-sonnet-4-6).

Same engine, two frontends.

How it works

Everything runs off one normalized context (ctx) and produces one structured result (JSON). Each step is a small, deterministic module that reads git and writes a slice of that result; the only optional, non-deterministic step is the LLM narrative.

  PR / branch ---.
  stack trace ---+--> pr_context  -->  ctx  (diff, changed files, commits, host links)
                          |
                          v
                    classify   (bugfix vs feature, with evidence)
                   /                                  \
          bugfix  v                                    v  feature
   suspect   (blame the lines the fix removed)     blast_radius
     -> evolution     (how the line evolved)        (who imports the changed code,
     -> intent / lifecycle / completeness            covering tests, high-risk modules)
     -> test_gap
                   \                                  /
                    v                                v
                  report.build  -->  QA risk score
                          |
            + test_impact . coupling . owners . coverage
                          |
                          v
   reasoning (optional LLM "why")  -->  output:
      JSON  |  HTML report  |  markdown  |  --select-tests  |  --fail-on (CI exit code)

Resolve the target into ctx - pr_context tries gh, then the GitHub/GitLab REST API, then plain local git; --trace instead turns stack frames into a synthetic diff so the same pipeline can run on a crash.
Classify bugfix vs feature from branch/label/title/commit signals.
Analyze down one path: a bugfix gets the suspect set, line-evolution timeline, and the "bug's life story" (intent, lifecycle, completeness); a feature gets the blast radius.
Score & augment - report.build rolls the signals into a QA risk score, then test impact, co-change, reviewers, and (optional) coverage are attached.
Render - the structured result becomes JSON, a self-contained HTML report, markdown, a test list, or a CI exit code. The LLM "why" is the only step that needs a key.

Every step is read-only (git status is unchanged after any run) and repo-agnostic (no hardcoded paths/hosts). For the full module map and data shapes, see docs/ARCHITECTURE.md.

Install

pip install culprit            # engine + CLI (rca / culprit)
pip install "culprit[api]"     # + Claude API reasoning layer (anthropic SDK)

Or with pipx for an isolated CLI: pipx install culprit. From source: pip install -e ".[dev]" then pytest.

PR metadata uses the GitHub CLI when available: brew install gh && gh auth login. For public repos you don't even need gh - rca --pr N falls back to the unauthenticated REST API (GitHub and GitLab) for metadata plus a read-only git fetch of the PR/MR head (set GITHUB_TOKEN / GITLAB_TOKEN to raise rate limits). With neither, culprit uses local git (base vs head) - fully offline, minus PR title/labels.

Any host, any language

Hosts: deep links (commit / PR / file) are generated for GitHub, GitLab, Bitbucket, and Gitea; the suspect-set + line-evolution timeline work on any git repo regardless of host. For a self-hosted forge the URL can't disambiguate, so set host = "gitlab" (or github/bitbucket/gitea) in .culprit.toml, or CULPRIT_HOST.
Languages: suspect/timeline are language-agnostic (pure git blame/log -L). Blast-radius + test-gap detect imports across JS/TS, Python, Go, Java/Kotlin, Ruby, C/C++, C#, PHP, Rust, Scala, Swift (quoted and bare/dotted import forms).

Usage

rca                      # current branch vs the configured base (or latest commit)
rca --last               # just the latest commit ("the change I just made")
rca --pr 16786           # a specific GitHub PR (uses the PR's own base)
rca --repo /path --base main
rca --mode api --fast    # standalone reasoning via the Claude API
rca --json               # structured result only
rca --html report.html --open   # self-contained visual report (timeline UI)
rca --pr 16889 --bisect "pytest tests/test_x.py::test_y"   # confirm the suspect via git bisect
rca --pr 16889 --fail-on high   # QA gate: exit non-zero when risk is high (for CI)
rca --select-tests              # print the tests to run for this change (CI-pipeable)
rca --trace crash.txt           # RCA from a stack trace (no fix/PR/test needed)

More than a smarter `git bisect`

git bisect finds one introducing commit after you already have a reliable failing test. culprit is a QA tool that also works before a bug ships and from a symptom:

QA risk score + gate - one explainable score over test gap, fix completeness, hotspot recurrence, blast radius, and churn; --fail-on high gates CI (see above). Pass --coverage <lcov|cobertura> to replace the import heuristic with ground truth - it pinpoints exactly which changed lines are uncovered.
Test impact analysis - --select-tests lists the existing tests that reach the changed code (direct + transitive via the reverse-import graph). Pipe it: pytest $(rca --select-tests).
RCA from a stack trace - rca --trace crash.txt (or ... --trace - from stdin) parses a Python / JS / Java / Go trace, resolves the frames to repo files, and blames the crashing lines to a suspect commit - no fix, PR, or failing test required.
Predictive signals - co-change flags a file that usually changes with the ones you touched but is missing here ("did you forget X?"); reviewer suggestions come from CODEOWNERS + git authorship.

All of this is read-only and ships in the same self-contained HTML report.

QA risk score

Every report carries a single QA risk score (0-100, low/medium/high) that combines the signals culprit already computes - test gap, fix completeness, hotspot recurrence, blast radius, churn - into one explainable number, with the contributing factors listed (no ML, fully deterministic). --fail-on {low,medium,high} makes culprit exit non-zero when the level meets or exceeds the threshold, so it can act as a CI quality gate.

Use in CI (GitHub Actions)

culprit runs as a read-only QA gate: it generates the HTML report as a build artifact and signals risk via the exit code - it never comments on or writes to the PR. Copy examples/github-actions/culprit-pr.yml into .github/workflows/:

- uses: actions/checkout@v4
  with: { fetch-depth: 0 }          # full history for blame / git log -L
- uses: actions/setup-python@v5
  with: { python-version: "3.12" }
- run: pip install culprit
- env: { GH_TOKEN: "${{ github.token }}" }   # read-only PR metadata
  run: rca --pr ${{ github.event.pull_request.number }} --html culprit-report.html --no-save --fail-on high
- if: always()
  uses: actions/upload-artifact@v4
  with: { name: culprit-report, path: culprit-report.html }

The job fails only when the QA risk is high; the report is uploaded either way. To gate in any other CI, run rca ... --fail-on high and check the exit status.

culprit vs `git bisect`

Same goal - find the commit that introduced a bug - but opposite method:

	`git bisect`	culprit
Method	Dynamic - checks out commits and runs a test at each	Static - blames the fix's lines + `git log -L`
Needs a failing test?	Required	No
Runs your code?	Yes (serial checkouts)	No
Speed	Minutes (~log2(N) runs)	Instant
Answers	"first commit where the test fails"	suspect + how the line evolved + why + the introducing PR's intent + releases shipped + hotspot + fix completeness + test gap
Confidence	Proof (if the test is reliable)	Strong heuristic

culprit is not a reimplementation of bisect - it reasons statically from the patch and gives you the story, no test required. But when you do have a repro, --bisect "<cmd>" runs a real bisect (in a throwaway git worktree, so your checkout and HEAD are never touched) and stamps "✓ confirmed by git bisect" when the first failing commit matches the blamed suspect. The command must exit non-zero when the bug is present; --good <ref> / --bad <ref> override the search bounds (defaults: the suspect's parent and the base).

Visual HTML report

--html PATH writes a single self-contained HTML file (inline CSS/JS, data embedded, no CDN - opens offline, shareable, CI-attachable). For a bugfix it renders a line-evolution timeline: for each line the fix touched, every commit that ever changed those lines, from creation -> ... -> the commit that broke it (red) -> the fix (green), each step expandable to its diff.

rca --pr 16889 --html rca.html --open                 # narrative via --mode api if key set
rca --pr 16889 --html rca.html --narrative-file why.md # embed a pre-written narrative

The timeline needs no API key. The "Analysis" prose comes from --narrative-file (e.g. written by the Claude Code /rca skill) or from --mode api.

The report also includes: a TL;DR banner naming the prime suspect with a lifecycle strip (how long the bug lived and the releases that shipped it); the introducing PR's intent (title, linked issue, message body) on the suspect card; a fix-completeness callout (other untouched references to the changed symbols, whether a test was added, revert detection); deep links on every commit / PR / file (derived from origin); weight bars ranking the suspects; expand/collapse-all and a per-file filter for the timeline; and a one-click copy-as-markdown to paste into the PR.

Choosing the base branch

The base differs per repo (main, master, develop, a long-lived release branch, ...). Resolution order: --base <ref> -> CULPRIT_BASE env -> .culprit.toml (base = "...") -> the latest commit. The static HTML report is generated for one base (shown in the footer with a regenerate hint). For an interactive base picker, use serve mode:

rca serve --repo /path/to/repo     # opens http://127.0.0.1:8722

It launches a local web app (stdlib only - no extra deps) with a form: enter a PR/branch, pick the base from a dropdown (auto = just your latest commit, or any local/remote branch to diff your whole branch against), choose classification + reasoning, and run a fresh analysis that renders the same visual report. The base picker repopulates when you point it at a different repo.

Optional Credentials (a collapsible section on the form) let you paste a GitHub token (enables the PR field / private repos, like gh auth) and an Anthropic API key (enables the "Claude API narrative" reasoning). They're kept in the server process's memory only - never written to disk, never put in a URL (submitted via POST). Binds to localhost only.

Base branch

In local mode (no PR), culprit needs a base to diff against. Resolution order:

--base <ref> on the CLI
CULPRIT_BASE environment variable
base = "..." in a .culprit.toml at the repo root
otherwise the latest commit (HEAD~1)

So pin your repo's real base once and forget it:

# .culprit.toml
base = "origin/main"   # whatever your repo is actually cut from

--last always forces the latest-commit view regardless of config.

Tests

pip install -e ".[dev]" && pytest

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

noordeen

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.3.0

Jun 24, 2026

0.2.0

Jun 22, 2026

0.1.2

Jun 20, 2026

0.1.1

Jun 20, 2026

0.1.0

Jun 20, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

culprit-0.3.0.tar.gz (997.4 kB view details)

Uploaded Jun 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

culprit-0.3.0-py3-none-any.whl (70.4 kB view details)

Uploaded Jun 24, 2026 Python 3

File details

Details for the file culprit-0.3.0.tar.gz.

File metadata

Download URL: culprit-0.3.0.tar.gz
Upload date: Jun 24, 2026
Size: 997.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for culprit-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`5164ae6aea50fca5856df0626eced8bb5361a3c0d538765d1ef73429913f87ae`
MD5	`3df7b8f12b979b2b51a0206a42be3f8f`
BLAKE2b-256	`05ea3f34727942e010deacf50afaecf8b4a5b226357405081f74cda13097af9e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for culprit-0.3.0.tar.gz:

Publisher: publish.yml on noordeen123/culprit

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: culprit-0.3.0.tar.gz
- Subject digest: 5164ae6aea50fca5856df0626eced8bb5361a3c0d538765d1ef73429913f87ae
- Sigstore transparency entry: 1940252813
- Sigstore integration time: Jun 24, 2026
Source repository:
- Permalink: noordeen123/culprit@e86c1dba311359ee7c2dadb7ea1b51995de0f7d4
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/noordeen123
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@e86c1dba311359ee7c2dadb7ea1b51995de0f7d4
- Trigger Event: release

File details

Details for the file culprit-0.3.0-py3-none-any.whl.

File metadata

Download URL: culprit-0.3.0-py3-none-any.whl
Upload date: Jun 24, 2026
Size: 70.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for culprit-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1ffa97622acddffd9cdce5cc9f2d0b920ab302d66f6a0f4e300b7c4500bd74e8`
MD5	`4dcea6b3d9d0df6a8cd16e683da47417`
BLAKE2b-256	`c51eae684e04d74eb052bd3dce700a40cd072b3ea52e00bbb2da5b03805849c5`

See more details on using hashes here.

Provenance

The following attestation bundles were made for culprit-0.3.0-py3-none-any.whl:

Publisher: publish.yml on noordeen123/culprit

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: culprit-0.3.0-py3-none-any.whl
- Subject digest: 1ffa97622acddffd9cdce5cc9f2d0b920ab302d66f6a0f4e300b7c4500bd74e8
- Sigstore transparency entry: 1940252938
- Sigstore integration time: Jun 24, 2026
Source repository:
- Permalink: noordeen123/culprit@e86c1dba311359ee7c2dadb7ea1b51995de0f7d4
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/noordeen123
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@e86c1dba311359ee7c2dadb7ea1b51995de0f7d4
- Trigger Event: release

culprit 0.3.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

culprit

Example

Why the split design

How it works

Install

Any host, any language

Usage

More than a smarter git bisect

QA risk score

Use in CI (GitHub Actions)

culprit vs git bisect

Visual HTML report

Choosing the base branch

Base branch

Tests

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

More than a smarter `git bisect`

culprit vs `git bisect`