Export codebase structure and contents for AI/LLM context

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

diffctx — smart diff context for LLM code review

diffctx selects the minimum code an LLM needs to review a git diff. Instead of pasting whole files, it walks the dependency graph from the changed lines outward and stops as soon as additional context stops paying for itself.

Why not just use `tree` or repomix?

	`tree`	repomix	Claude Code Review	diffctx
Primary use case	directory listing	full repo export	automated PR review	diff context for code review
Smart diff context	✗	✗	✓	✓
Works with any LLM	✓	✓	Claude only	✓
Free / local / offline	✓	✓	$15–25/review	✓
GitHub required	✗	✗	✓	✗
Multiple output formats	✗	limited	—	YAML/JSON/MD/txt
Python API	✗	✗	✗	✓
MCP server	✗	✗	✗	✓

Install (30 seconds)

pipx install diffctx                    # recommended — isolated CLI, no venv needed
pip install diffctx                     # or: into an active environment
pipx install 'diffctx[tree-sitter]'     # + AST parsing for smarter diff context
pipx install 'diffctx[mcp]'             # + MCP server for AI assistants

For everyday use, install once with pipx and call diffctx from any directory. Do not source the project's .venv to run diffctx from another repo — that runs a working-tree build and mutates the shell's PATH/PYTHONHOME for every subsequent command.

diffctx . --diff HEAD~1       # smart context for last commit → paste into Claude/ChatGPT
diffctx . -f md -c            # full export → clipboard in Markdown

diffctx demo: running diffctx . --diff HEAD~1 inside a git repo and copying the relevance-ranked YAML output to the clipboard for an LLM

Demo: diffctx . --diff HEAD~1 selects only the fragments — functions, imports, type definitions — that an LLM actually needs to review the last commit, instead of dumping every changed file in full.

Standalone binary (no Python required): download from the releases page.

Diff context mode works out of the box. Adding [tree-sitter] enables AST-level parsing for more accurate context selection across 30+ languages.

Diff Context Mode

Automatically finds the minimal set of code fragments needed to understand a change — imports, callers, type definitions, config dependencies — without dumping entire files. Understands 50+ file types.

name: myproject
type: diff_context
fragment_count: 5
fragments:
  - path: src/main.py
    lines: "10-25"
    kind: function
    symbol: process_data
    content: |
      def process_data(items):
          ...

How it works

Builds a code graph (imports, co-changes, type refs) and propagates relevance from changed lines outward across it. Three scoring modes are available — pick one with --scoring:

`--scoring`	What it does
`ego` (default)	Bounded ego-network expansion around changed nodes — fast, predictable radius, the current default
`ppr`	Personalized PageRank with damping `--alpha` — global, smoother decay, slower
`bm25`	Lexical fragment retrieval against the diff hunks — useful as a baseline / fallback when the graph is sparse

Selection stops when relevance drops below --tau (the minimum score a fragment must beat to be kept), or once --budget tokens have been emitted, whichever comes first.

Flag	Default	Description
`--scoring`	`ego`	Scoring mode: `ego`, `ppr`, or `bm25`
`--budget`	auto	Token cap. `auto` lets selection converge; `-1` disables the cap; `N` enforces a fixed cap
`--alpha`	0.60	How tightly context clusters around changes (PPR damping; 0–1, higher = more focused)
`--tau`	0.12	Minimum relevance required to include a fragment (lower = more context)
`--full`	false	Include every changed fragment; skip the smart-selection step entirely

Calibration of --alpha, --tau, and the edge-weight priors is documented in docs/parameter-strategy.md.

Theory: Context-Selection for Git Diff (Zenodo, 2026).

`graph` subcommand

For exploring the underlying dependency graph directly (without a diff), use the graph subcommand:

diffctx graph .                                  # Mermaid graph of directory deps (default)
diffctx graph . --summary                        # cycles, hotspots, coupling metrics
diffctx graph . --level fragment -f json         # fragment-level graph as JSON
diffctx graph . --level file -f graphml -o g.xml # file-level graph as GraphML

Flag	Default	Description
`-f/--format`	`mermaid`	Output format: `mermaid`, `json`, or `graphml`
`--level`	`directory`	Granularity: `fragment`, `file`, or `directory`
`--summary`	false	Print graph statistics (cycles, hotspots, coupling)

Usage

# full codebase export:
diffctx .                                # YAML to stdout + token count
diffctx . -f md -c                       # Markdown → clipboard
diffctx . -f json -o tree.json           # JSON → file
diffctx . --no-content                   # structure only, no file contents
diffctx . --max-depth 3                  # limit depth
diffctx . -i custom.ignore               # custom ignore patterns

# diff context mode (requires git repo):
diffctx . --diff                         # uncommitted changes (working tree vs HEAD)
diffctx . --diff HEAD~1                  # context for last commit
diffctx . --diff main..feature           # context for feature branch
diffctx . --diff HEAD~1 --budget 30000   # limit to ~30k tokens
diffctx . --diff HEAD~1 -c               # diff context to clipboard

Full codebase export output format:

name: myproject
type: directory
children:
  - name: main.py
    type: file
    content: |
      def hello():
          print("Hello, World!")
  - name: utils/
    type: directory
    children:
      - name: helpers.py
        type: file
        content: |
          def add(a, b):
              return a + b

Token Counting

Token count and size are always displayed on stderr:

12,847 tokens (o200k_base), 52.3 KB

For large outputs (>1MB), approximate counts with ~ prefix:

~125,000 tokens (o200k_base), 5.2 MB

Uses tiktoken with o200k_base encoding (GPT-4o tokenizer).

Clipboard Support

Copy output directly to clipboard with -c or --copy:

diffctx . -c                       # copy (stdout suppressed, stderr: token count)
diffctx . -c -o tree.yaml          # copy + save to file

System Requirements:

macOS: pbcopy (pre-installed)
Windows: clip (pre-installed)
Linux (Wayland): wl-copy
Linux (X11): xclip or xsel

Python API

from diffctx import map_directory
from diffctx import to_yaml, to_json, to_text, to_markdown

tree = map_directory(
    path,                     # directory path
    max_depth=None,           # limit traversal depth
    no_content=False,         # exclude file contents
    max_file_bytes=None,      # skip large files
    ignore_file=None,         # custom ignore file
    no_default_ignores=False, # disable default ignores
    whitelist_file=None,      # include-only filter
)

yaml_str = to_yaml(tree)
json_str = to_json(tree)
text_str = to_text(tree)
md_str = to_markdown(tree)

# Diff context mode
from pathlib import Path
from diffctx import build_diff_context, to_yaml

ctx = build_diff_context(
    Path("."),                # repository root
    "HEAD~1..HEAD",           # diff range; also accepts "main..feature"
    budget_tokens=None,       # None = convergence-based (default)
                              #   0  = diff only, no expansion (recall floor)
                              #  <0  = unlimited (10M-token soft ceiling)
                              #  >0  = explicit token cap
    alpha=0.6,                # PPR damping factor
    tau=0.12,                 # stopping threshold
    full=False,               # skip smart selection
    scoring_mode="ego",       # "ego" (default), "ppr", or "bm25"
    timeout=300,              # seconds before the pipeline aborts
)
yaml_str = to_yaml(ctx)

MCP Server

diffctx includes an MCP server that lets AI assistants (Claude Code, Cursor, Windsurf, etc.) call diff context analysis automatically during code review.

pip install 'diffctx[mcp]'

Add to your MCP client config (e.g. ~/.claude/mcp.json for Claude Code):

{
  "mcpServers": {
    "diffctx": {
      "command": "diffctx-mcp"
    }
  }
}

The server exposes a get_diff_context tool. Your AI assistant will automatically call it when reviewing PRs, explaining changes, or investigating broken tests — no manual invocation needed.

See src/diffctx/mcp/README.md for configs for Cursor, Continue, Windsurf, and Zed.

Ignore Patterns

Respects .gitignore and .diffctx/ignore automatically. Use --no-default-ignores to disable built-in patterns (.gitignore and .diffctx/ignore still apply).

Hierarchical: nested ignore files at each directory level
Negation patterns: !important.log un-ignores a file
Anchored patterns: /root_only.txt matches only in root
Output file is always auto-ignored

Auto-discovered files:

.diffctx/ignore — diffctx-specific ignore patterns
.diffctx/whitelist — Include-only filter (only matched files included)

Content Placeholders

<file too large: N bytes> — exceeds --max-file-bytes
<binary file: N bytes> — binary file detected
<unreadable content: not utf-8> — not valid UTF-8
<unreadable content> — permission denied or I/O error

License

Apache 2.0

Changelog
Security policy — threat model and vulnerability reporting
Parameter strategy — how --alpha, --tau, and edge weights are calibrated

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

nikolayer

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

1.10.2

Jul 1, 2026

1.10.1

Jul 1, 2026

1.10.0

Jun 20, 2026

1.9.1

Jun 14, 2026

1.8.1

Jun 14, 2026

1.7.1

May 16, 2026

1.7.0 yanked

May 16, 2026

0.1.0 yanked

May 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

diffctx-1.10.2.tar.gz (231.5 kB view details)

Uploaded Jul 1, 2026 Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

diffctx-1.10.2-cp310-abi3-win_amd64.whl (8.5 MB view details)

Uploaded Jul 1, 2026 CPython 3.10+Windows x86-64

diffctx-1.10.2-cp310-abi3-manylinux_2_28_x86_64.whl (8.8 MB view details)

Uploaded Jul 1, 2026 CPython 3.10+manylinux: glibc 2.28+ x86-64

diffctx-1.10.2-cp310-abi3-manylinux_2_28_aarch64.whl (8.6 MB view details)

Uploaded Jul 1, 2026 CPython 3.10+manylinux: glibc 2.28+ ARM64

diffctx-1.10.2-cp310-abi3-macosx_11_0_arm64.whl (8.8 MB view details)

Uploaded Jul 1, 2026 CPython 3.10+macOS 11.0+ ARM64

File details

Details for the file diffctx-1.10.2.tar.gz.

File metadata

Download URL: diffctx-1.10.2.tar.gz
Upload date: Jul 1, 2026
Size: 231.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for diffctx-1.10.2.tar.gz
Algorithm	Hash digest
SHA256	`55750bb17b1813200677d10ca18c64c27a9ef69f9a690501fa9918dbc274cec3`
MD5	`4956ef9042bfc55aada414223a27e6d0`
BLAKE2b-256	`5fa677852a57170fe8f2e7132831485842c2d0ba4a757e011c6c37d794df936b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diffctx-1.10.2.tar.gz:

Publisher: cd.yml on nikolay-e/diffctx

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diffctx-1.10.2.tar.gz
- Subject digest: 55750bb17b1813200677d10ca18c64c27a9ef69f9a690501fa9918dbc274cec3
- Sigstore transparency entry: 2040011516
- Sigstore integration time: Jul 1, 2026
Source repository:
- Permalink: nikolay-e/diffctx@2cbdcf311109b3485315e2e3292e95bf7df6df2e
- Branch / Tag: refs/heads/main
- Owner: https://github.com/nikolay-e
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: cd.yml@2cbdcf311109b3485315e2e3292e95bf7df6df2e
- Trigger Event: workflow_dispatch

File details

Details for the file diffctx-1.10.2-cp310-abi3-win_amd64.whl.

File metadata

Download URL: diffctx-1.10.2-cp310-abi3-win_amd64.whl
Upload date: Jul 1, 2026
Size: 8.5 MB
Tags: CPython 3.10+, Windows x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for diffctx-1.10.2-cp310-abi3-win_amd64.whl
Algorithm	Hash digest
SHA256	`6f76847f25d98fa3284a52e59af0e7d782cb2b06a043a635b807937327af2698`
MD5	`531e6c41ca099b017ecac0d0b188f02c`
BLAKE2b-256	`73012632a899182c0a3ea4b0e1cdf2f905177a2e1a24d1aa5b897c642c72a8bd`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diffctx-1.10.2-cp310-abi3-win_amd64.whl:

Publisher: cd.yml on nikolay-e/diffctx

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diffctx-1.10.2-cp310-abi3-win_amd64.whl
- Subject digest: 6f76847f25d98fa3284a52e59af0e7d782cb2b06a043a635b807937327af2698
- Sigstore transparency entry: 2040011842
- Sigstore integration time: Jul 1, 2026
Source repository:
- Permalink: nikolay-e/diffctx@2cbdcf311109b3485315e2e3292e95bf7df6df2e
- Branch / Tag: refs/heads/main
- Owner: https://github.com/nikolay-e
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: cd.yml@2cbdcf311109b3485315e2e3292e95bf7df6df2e
- Trigger Event: workflow_dispatch

File details

Details for the file diffctx-1.10.2-cp310-abi3-manylinux_2_28_x86_64.whl.

File metadata

Download URL: diffctx-1.10.2-cp310-abi3-manylinux_2_28_x86_64.whl
Upload date: Jul 1, 2026
Size: 8.8 MB
Tags: CPython 3.10+, manylinux: glibc 2.28+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for diffctx-1.10.2-cp310-abi3-manylinux_2_28_x86_64.whl
Algorithm	Hash digest
SHA256	`3d94b701cab5961bf9849a6cab0b63feba22248485092df8c490002c3d3e8a77`
MD5	`d432fac7b9a757a063595a2f426bd169`
BLAKE2b-256	`b838ea1ec48d7dc24f19d8746b59386c8ae77546aabbe341adc13395a594b690`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diffctx-1.10.2-cp310-abi3-manylinux_2_28_x86_64.whl:

Publisher: cd.yml on nikolay-e/diffctx

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diffctx-1.10.2-cp310-abi3-manylinux_2_28_x86_64.whl
- Subject digest: 3d94b701cab5961bf9849a6cab0b63feba22248485092df8c490002c3d3e8a77
- Sigstore transparency entry: 2040011588
- Sigstore integration time: Jul 1, 2026
Source repository:
- Permalink: nikolay-e/diffctx@2cbdcf311109b3485315e2e3292e95bf7df6df2e
- Branch / Tag: refs/heads/main
- Owner: https://github.com/nikolay-e
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: cd.yml@2cbdcf311109b3485315e2e3292e95bf7df6df2e
- Trigger Event: workflow_dispatch

File details

Details for the file diffctx-1.10.2-cp310-abi3-manylinux_2_28_aarch64.whl.

File metadata

Download URL: diffctx-1.10.2-cp310-abi3-manylinux_2_28_aarch64.whl
Upload date: Jul 1, 2026
Size: 8.6 MB
Tags: CPython 3.10+, manylinux: glibc 2.28+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for diffctx-1.10.2-cp310-abi3-manylinux_2_28_aarch64.whl
Algorithm	Hash digest
SHA256	`e2dba8d60282cc8dec566468b8ae52d6e76566d40fe7e4cc6c8407d5afdecea5`
MD5	`a242bc7dbc6a12d404658a4be7a1475f`
BLAKE2b-256	`089e852833a3d634a41eab50bec551604ae20ec091b55f6018c6a1466ff19b90`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diffctx-1.10.2-cp310-abi3-manylinux_2_28_aarch64.whl:

Publisher: cd.yml on nikolay-e/diffctx

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diffctx-1.10.2-cp310-abi3-manylinux_2_28_aarch64.whl
- Subject digest: e2dba8d60282cc8dec566468b8ae52d6e76566d40fe7e4cc6c8407d5afdecea5
- Sigstore transparency entry: 2040011688
- Sigstore integration time: Jul 1, 2026
Source repository:
- Permalink: nikolay-e/diffctx@2cbdcf311109b3485315e2e3292e95bf7df6df2e
- Branch / Tag: refs/heads/main
- Owner: https://github.com/nikolay-e
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: cd.yml@2cbdcf311109b3485315e2e3292e95bf7df6df2e
- Trigger Event: workflow_dispatch

File details

Details for the file diffctx-1.10.2-cp310-abi3-macosx_11_0_arm64.whl.

File metadata

Download URL: diffctx-1.10.2-cp310-abi3-macosx_11_0_arm64.whl
Upload date: Jul 1, 2026
Size: 8.8 MB
Tags: CPython 3.10+, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for diffctx-1.10.2-cp310-abi3-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`a64edeb9feab22adba63f02e8654d269e9a8971b88023aa29e2e947adab94490`
MD5	`a62d2a45beeae6bacb58e8630386ffe3`
BLAKE2b-256	`44226d93014c5e15577d7bf3e087154717c55b89704d934f47a5144a84220b13`

See more details on using hashes here.

Provenance

The following attestation bundles were made for diffctx-1.10.2-cp310-abi3-macosx_11_0_arm64.whl:

Publisher: cd.yml on nikolay-e/diffctx

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: diffctx-1.10.2-cp310-abi3-macosx_11_0_arm64.whl
- Subject digest: a64edeb9feab22adba63f02e8654d269e9a8971b88023aa29e2e947adab94490
- Sigstore transparency entry: 2040011752
- Sigstore integration time: Jul 1, 2026
Source repository:
- Permalink: nikolay-e/diffctx@2cbdcf311109b3485315e2e3292e95bf7df6df2e
- Branch / Tag: refs/heads/main
- Owner: https://github.com/nikolay-e
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: cd.yml@2cbdcf311109b3485315e2e3292e95bf7df6df2e
- Trigger Event: workflow_dispatch

diffctx 1.10.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

diffctx — smart diff context for LLM code review

Why not just use tree or repomix?

Install (30 seconds)

Diff Context Mode

How it works

graph subcommand

Usage

Token Counting

Clipboard Support

Python API

MCP Server

Ignore Patterns

Content Placeholders

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distributions

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Why not just use `tree` or repomix?

`graph` subcommand