Skip to main content

Local context compiler for AI coding assistants — smallest correct context bundle with rationale

Project description

context-compiler

A local-first MCP server that indexes your Python and TypeScript codebase into a dependency graph and returns the smallest correct context bundle for any coding task, with a one-line rationale for every included symbol.

No cloud. No LLM API calls. No data leaves your machine.


The problem

When you ask Claude to fix a bug or add a feature, it reads files by guessing which ones are relevant. It over-reads (wastes tokens) or misses the file that actually matters. The bigger the codebase, the worse this gets.

How it works

Your task: "fix the payment retry logic"
         ↓
  Classify → BUG_FIX
         ↓
  Find entry nodes → payment_processor.py (BM25 + docstring matching)
         ↓
  Traverse graph → payment_processor.py + retry_handler.py + test_processor.py
         ↓
  Score + budget → 870 tokens (within 8000 limit)
         ↓
  Return symbol-level slices with line ranges + rationale per symbol

Everything (classification, traversal, scoring, rationale) is deterministic. Same repo + same task = same bundle, every time.


Getting started

pip install claude-context-compiler
cd /your/project
context-compiler init

That's it. init does three things in one step:

  1. Indexes your codebase into a local dependency graph
  2. Registers the MCP server with Claude Code
  3. Adds context-retrieval instructions to CLAUDE.md

Then open Claude Code in your project. It will call get_context automatically before reading files.

Requires Python 3.11+.

Multi-repo projects

If your project spans multiple repositories, pass them as dependencies:

context-compiler init --dependencies ../sbc-pay,../sbc-web

Each repo is indexed into its own graph. The dependency list is saved alongside the primary graph and picked up automatically when the MCP server starts, no need to re-specify. get_context queries all graphs and returns the best-matching symbols across all repos.

Other commands

# Re-index after large changes
context-compiler index

# Preview what context a task would produce (no Claude needed)
context-compiler explain --task "<Prompt>"

All commands default to the current directory. Pass --repo <path> to target a different path.

Optional: semantic fallback

For better matching when task terms don't appear in symbol names (e.g. "fix login flow" → finds authenticate_user):

pip install "claude-context-compiler[semantic]"

Downloads a 23MB ONNX model once, no PyTorch required.


MCP tools

get_context(task, budget=8000)

Returns the minimal symbol-level context bundle for a coding task.

{
  "slices": [
    {
      "file_path": "/abs/path/payments/processor.py",
      "line_start": 6,
      "line_end": 24,
      "rationale": "Included PaymentProcessor as primary task location (matched 'payment')"
    },
    {
      "file_path": "/abs/path/payments/retry_handler.py",
      "line_start": 12,
      "line_end": 38,
      "rationale": "Included RetryHandler because it is called by PaymentProcessor (depth 1)"
    }
  ],
}

Each slice points to the specific function or class that's relevant. Claude reads only those lines rather than the entire file.

refresh(changed_files)

Re-indexes the repository after file changes. Claude calls this automatically after making code edits to keep the graph current.


What makes it different

Task-type-aware traversal. A bug fix traverses inbound callers and test coverage at depth 2. A new feature traverses imports and sibling modules. A refactor traverses everything at depth 3.

Symbol-level slices. Returns exact line ranges for each relevant function or class, not whole files. Claude reads only what's needed. A 500-line file with one relevant function costs 40 tokens, not 500.

Rationale per symbol. Every included slice has a one-line explanation of why it's there. You can see exactly what Claude will read before it reads it.

Local-only. Embedded KuzuDB graph, no server, no port, no auth. Works offline.


Supported languages

Language Parsing Docstrings
Python tree-sitter-python ✓ (first line of docstring)
TypeScript / TSX tree-sitter-typescript ✓ (JSDoc /** */)

License

Apache 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

claude_context_compiler-1.0.0.tar.gz (64.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

claude_context_compiler-1.0.0-py3-none-any.whl (34.1 kB view details)

Uploaded Python 3

File details

Details for the file claude_context_compiler-1.0.0.tar.gz.

File metadata

  • Download URL: claude_context_compiler-1.0.0.tar.gz
  • Upload date:
  • Size: 64.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.4

File hashes

Hashes for claude_context_compiler-1.0.0.tar.gz
Algorithm Hash digest
SHA256 f0a23c68d28eddcc24adbae983b95d923beb10a84af80bb1e446b94f95d11121
MD5 27c0f0ba74ee55d6191074c07adf4bf9
BLAKE2b-256 7b82399d812eca0be189f8c721b1eb4a021feb2290f2eb1b10fd9e6dcee0dc1b

See more details on using hashes here.

File details

Details for the file claude_context_compiler-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for claude_context_compiler-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 bfb940fdbca8ec1f202b7a87dc89eb02c827afa11de4e24aa177f1453b098cdd
MD5 1e740c7888faa23493f3cfcd437c69b5
BLAKE2b-256 0fa45e276aca033286c58642957c020c60ad3b3280bd01d691030fea29bbf74b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page