Skip to main content

Local context compiler for AI coding assistants — smallest correct context bundle with rationale

Project description

context-compiler

A local-first MCP server that indexes your Python and TypeScript codebase into a dependency graph and returns the smallest correct context bundle for any coding task, with a one-line rationale for every included symbol.

No cloud. No LLM API calls. No data leaves your machine.


The problem

When you ask Claude to fix a bug or add a feature, it reads files by guessing which ones are relevant. It over-reads (wastes tokens) or misses the file that actually matters. The bigger the codebase, the worse this gets.

How it works

Your task: "fix the payment retry logic"
         ↓
  Classify → BUG_FIX
         ↓
  Find entry nodes → payment_processor.py (BM25 + docstring matching)
         ↓
  Traverse graph → payment_processor.py + retry_handler.py + test_processor.py
         ↓
  Score + budget → 870 tokens (within 8000 limit)
         ↓
  Return symbol-level slices with line ranges + rationale per symbol

Everything (classification, traversal, scoring, rationale) is deterministic. Same repo + same task = same bundle, every time.


Getting started

pip install claude-context-compiler
cd /your/project
context-compiler init

That's it. init does three things in one step:

  1. Indexes your codebase into a local dependency graph
  2. Registers the MCP server with Claude Code
  3. Adds context-retrieval instructions to CLAUDE.md

Then open Claude Code in your project. It will call get_context automatically before reading files.

Requires Python 3.11+.

Multi-repo projects

If your project spans multiple repositories, pass them as dependencies:

context-compiler init --dependencies ../sbc-pay,../sbc-web

Each repo is indexed into its own graph. The dependency list is saved alongside the primary graph and picked up automatically when the MCP server starts, no need to re-specify. get_context queries all graphs and returns the best-matching symbols across all repos.

Other commands

# Re-index after large changes
context-compiler index

# Preview what context a task would produce (no Claude needed)
context-compiler explain --task "<Prompt>"

All commands default to the current directory. Pass --repo <path> to target a different path.

Optional: semantic fallback

For better matching when task terms don't appear in symbol names (e.g. "fix login flow" → finds authenticate_user):

pip install "claude-context-compiler[semantic]"

Downloads a 23MB ONNX model once, no PyTorch required.


MCP tools

get_context(task, budget=8000)

Returns the minimal symbol-level context bundle for a coding task.

{
  "slices": [
    {
      "file_path": "/abs/path/payments/processor.py",
      "line_start": 6,
      "line_end": 24,
      "rationale": "Included PaymentProcessor as primary task location (matched 'payment')"
    },
    {
      "file_path": "/abs/path/payments/retry_handler.py",
      "line_start": 12,
      "line_end": 38,
      "rationale": "Included RetryHandler because it is called by PaymentProcessor (depth 1)"
    }
  ],
}

Each slice points to the specific function or class that's relevant. Claude reads only those lines rather than the entire file.

refresh(changed_files)

Re-indexes the repository after file changes. Claude calls this automatically after making code edits to keep the graph current.


What makes it different

Task-type-aware traversal. A bug fix traverses inbound callers and test coverage at depth 2. A new feature traverses imports and sibling modules. A refactor traverses everything at depth 3.

Symbol-level slices. Returns exact line ranges for each relevant function or class, not whole files. Claude reads only what's needed. A 500-line file with one relevant function costs 40 tokens, not 500.

Rationale per symbol. Every included slice has a one-line explanation of why it's there. You can see exactly what Claude will read before it reads it.

Local-only. Embedded KuzuDB graph, no server, no port, no auth. Works offline.


Supported languages

Language Parsing Docstrings
Python tree-sitter-python ✓ (first line of docstring)
TypeScript / TSX tree-sitter-typescript ✓ (JSDoc /** */)
JavaScript / JSX tree-sitter-javascript ✓ (JSDoc /** */)

License

Apache 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

claude_context_compiler-1.0.1.tar.gz (64.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

claude_context_compiler-1.0.1-py3-none-any.whl (34.4 kB view details)

Uploaded Python 3

File details

Details for the file claude_context_compiler-1.0.1.tar.gz.

File metadata

  • Download URL: claude_context_compiler-1.0.1.tar.gz
  • Upload date:
  • Size: 64.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for claude_context_compiler-1.0.1.tar.gz
Algorithm Hash digest
SHA256 9af2a41be40f382adeca90af8417371ec21dbfaf6d0e14e3ff52e3d2fa69cf76
MD5 000cc2d92c34b98b32a6094b7876c5e6
BLAKE2b-256 6797c8b827121a180baf9fe1f9f577051b951377108266ae96782a319e8bd00b

See more details on using hashes here.

File details

Details for the file claude_context_compiler-1.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for claude_context_compiler-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 dc7e0688d122864b9395fd69cae1364ee6636a24977b8c9c0ea4208ec9ec9bb9
MD5 230400e56f0197ac9dfa94351ac9b53d
BLAKE2b-256 f102f3a09f16619ca6fa5e91859373126429b0ad2e4a31782507974c90d4d482

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page