Deterministic codebase context retrieval for LLMs

Project description

llm-diet

Stop sending your entire codebase to Claude. Inject only what matters.

Deterministic context retrieval for AI coding tools. Parses your repo into a call graph, scores every function against your query, and injects the top matches — before Claude starts reasoning. No embeddings, no vector DB, no LLM calls in the retrieval path.

The Problem

Every prompt you send carries your entire codebase as dead weight:

Without llm-diet:  Your prompt + 946,210 tokens of codebase → Claude → answer
With llm-diet:     Your prompt +     186 tokens of context  → Claude → same answer

Benchmark

This repo (185 nodes, 46k tokens)

Query	Baseline	With llm-diet	Reduction	Time
fix authentication bug	46,661	434	99.1%	187ms
add a new API endpoint	46,661	106	99.8%	203ms
debug memory leak	46,661	428	99.1%	172ms
add logging to the pipeline	46,661	58	99.9%	141ms

46,661 → 275 tokens average. 176ms overhead.

FastAPI repo (946k tokens — repo never seen before)

Query	Baseline	With llm-diet	Reduction	Time
fix authentication bug	946,210	87	>99.9%	359ms
add a new API endpoint	946,210	120	>99.9%	359ms
how does the database connection work	946,210	130	>99.9%	391ms
debug memory leak	946,210	436	>99.9%	1062ms
add input validation	946,210	244	>99.9%	375ms
explain the caching logic	946,210	114	>99.9%	313ms
fix error handling	946,210	221	>99.9%	406ms
add logging to the pipeline	946,210	136	>99.9%	328ms

946,210 → 186 tokens average. 432ms overhead.

How It Works

repo files (.py, .js, .ts, .jsx, .tsx)
   ↓
AST parser  (no LLM — pure tree-sitter)
   ↓
call graph  (.cecl/graph.json)
   ↓
query  →  keyword expansion  →  BFS traversal  →  top 5 functions
   ↓
injected into Claude before reasoning starts

Same query + same graph = same result. Deterministic by design.

Quick Start

pip install llm-diet
context-engine install    # indexes repo + configures your AI tool
# open Claude Code and start coding

Commands

Command	Description
`context-engine install`	Index repo and configure Claude Code / Cursor / Windsurf
`context-engine index .`	(Re)build the call graph
`context-engine query "fix auth bug"`	See what would be injected for a query
`context-engine apply "add endpoint"`	Plan → diff → validate → patch (needs `ANTHROPIC_API_KEY`)
`context-engine watch .`	Auto-reindex on file save

Platform Support

Platform	Integration	Token reduction
Claude Code	`UserPromptSubmit` hook — dynamic injection on every prompt	Full (186 tokens avg)
Cursor	Static rules file written to `.cursor/rules/`	Guides AI; no dynamic injection
Windsurf	Static rules file written to `.windsurf/rules/`	Guides AI; no dynamic injection

Dynamic injection for Cursor and Windsurf is on the roadmap.

Why Not RAG?

	llm-diet	Embeddings / RAG	code-review-graph
Retrieval method	AST + call graph	Vector similarity	AST + SQLite
LLM calls to retrieve	0	1+	0
Deterministic	Yes	No	Yes
Setup	`pip install` + `index`	Model + DB infra	`pip install` + `build`
Languages	Python, JS, TS, JSX, TSX	Any	23 languages
Autonomous apply	Yes	No	No
Works offline	Yes	No	Yes

We do less. What we do, we do surgically.

Contributing

Good first issues:

Dynamic injection for Cursor/Windsurf — extend beyond Claude Code's UserPromptSubmit
More language parsers — add Go, Rust, Java following the FileParseResult interface in parser.py
Better keyword expansion — improve domain-specific term mapping in retrieval.py

Open an issue or send a PR.

License

MIT

Project details

Release history Release notifications | RSS feed

0.1.13

Apr 23, 2026

0.1.12

Apr 23, 2026

0.1.11

Apr 23, 2026

0.1.10

Apr 23, 2026

0.1.9

Apr 23, 2026

0.1.8

Apr 23, 2026

0.1.7

Apr 23, 2026

0.1.6

Apr 22, 2026

This version

0.1.5

Apr 22, 2026

0.1.4

Apr 21, 2026

0.1.3

Apr 21, 2026

0.1.2

Apr 21, 2026

0.1.1

Apr 21, 2026

0.1.0

Apr 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

llm_diet-0.1.5.tar.gz (62.5 kB view details)

Uploaded Apr 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

llm_diet-0.1.5-py3-none-any.whl (68.9 kB view details)

Uploaded Apr 22, 2026 Python 3

File details

Details for the file llm_diet-0.1.5.tar.gz.

File metadata

Download URL: llm_diet-0.1.5.tar.gz
Upload date: Apr 22, 2026
Size: 62.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.7

File hashes

Hashes for llm_diet-0.1.5.tar.gz
Algorithm	Hash digest
SHA256	`aa26571a3566f9e86fa10c7b8ee4cb1b81bf335890008a7c8f49f6ded62fe636`
MD5	`c48c522c26b2b478ea3340118f93e784`
BLAKE2b-256	`ece2bb624b722ba7f6d42640e09eb0aaee1437f7bb60cd567db83a79043ddebe`

See more details on using hashes here.

File details

Details for the file llm_diet-0.1.5-py3-none-any.whl.

File metadata

Download URL: llm_diet-0.1.5-py3-none-any.whl
Upload date: Apr 22, 2026
Size: 68.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.7

File hashes

Hashes for llm_diet-0.1.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`78ef7072c05acd3466d900db07845ddd1371a3ed64d1ced8b6dfdaf4ccd7e2ea`
MD5	`47b08182b6132cc4e32cca97f84c37d4`
BLAKE2b-256	`9e8a8d6d0dd3e72669b6cf9a98de3d399a776393807d961ce678ff5d74036f01`

See more details on using hashes here.

llm-diet 0.1.5

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

llm-diet

The Problem

Benchmark

How It Works

Quick Start

Commands

Platform Support

Why Not RAG?

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes