Compress any code repo into a compact skeleton to reduce LLM token usage

These details have not been verified by PyPI

Project links

Project description

rtt — reducethemtokens

Give any LLM a complete map of your codebase in a single, cheap read.

rtt extracts every file's imports, function signatures, class hierarchies, and method lists into a compact plain-text skeleton — typically 90% smaller than the raw source — and wires it into your agent's config so the map is available from the first message of every session.

The problem it solves

Modern coding agents (Cursor, Claude Code, Copilot) are good at retrieving context for specific, targeted queries. Ask about one bug, one function, one file — they find it.

But they struggle with the orientation problem: starting a session on an unfamiliar codebase, or asking questions that span the whole structure. Before an agent can retrieve the right context, it needs to understand what exists and where. Without that map, it either scans files speculatively (burning tokens) or makes wrong assumptions about structure.

rtt solves this by providing that map upfront, once, cheaply. The agent reads the skeleton at session start, knows the full API surface, and then opens only the specific files it actually needs.

rtt is not a replacement for agent retrieval. Retrieval is better for targeted, implementation-level tasks. rtt is the orientation layer that makes retrieval more accurate by giving the agent the right mental model before it starts searching.

When to use it

Use rtt when:

Starting a session on a codebase the agent hasn't seen before
The task involves cross-cutting changes across many files (a refactor, a rename, adding a feature that touches multiple layers)
You're using a chat interface (ChatGPT, Claude.ai, direct API) that has no built-in retrieval — every session starts from zero
You're building a CI pipeline, a code review bot, or any automated workflow where reproducible, deterministic context matters
You want to give an LLM repo context without setting up a vector store or any additional infrastructure

rtt is less useful when:

You're asking about one specific file or function — just open it
Your agent already has full retrieval and you're working on targeted, well-scoped tasks

Installation

pip install reducethemtokens

Requires Python 3.9+.

Quick start

cd your-repo
rtt install .

This writes .rtt/context.txt (the skeleton) and adds a short instruction to every supported agent config file — CLAUDE.md, AGENTS.md, .cursor/rules/, and others. The instruction tells the agent to read the skeleton at session start for orientation, then work normally from there.

Commit both files. Every collaborator and every future session gets the map automatically.

# After code changes — regenerate the skeleton
rtt update .

# See how many tokens the skeleton saves vs raw source
rtt compare .

Sample skeleton output for one file:

# rtt/bench.py [python]
imports: os, random, re, dataclasses.dataclass, dataclasses.field, pathlib.Path, typing.Optional, rtt.RepoIndex, rtt.Symbol
class BenchQuestion
class QuestionResult
class BenchReport
  def heuristic_score(self) -> float
  def heuristic_by_kind(self) -> dict[str, tuple[int, int]]
  def heuristic_failing(self) -> list[QuestionResult]
  def llm_score(self) -> Optional[float]
def generate_questions(repo: RepoIndex) -> list[BenchQuestion]
def score_heuristic(questions: list[BenchQuestion], repo: RepoIndex) -> list[QuestionResult]
def run_bench(path: str, use_llm: bool, llm_sample: int) -> BenchReport

Benchmark — Django (3,020 files)

Metric	Value
Raw codebase	6,464,961 tokens
rtt skeleton	585,421 tokens
Reduction	90.9%
Heuristic bench score	100.0% (13,665 / 13,670 questions)
Audit coverage (Python)	99.9% (34,454 / 34,480 symbols)
Audit coverage (JavaScript)	97.9% (46 / 47 symbols)

The heuristic bench auto-generates factual questions from the index — parameter names, return types, method lists, imports — and verifies every answer appears in the skeleton. 100% means no structural information was lost in compression.

Commands

`rtt install`

Index the repo, write the skeleton to .rtt/context.txt, and inject orientation instructions into every supported agent config file. Also installs a git pre-commit hook that regenerates the skeleton automatically on every commit.

rtt install .
rtt install . --platform claude    # single agent only
rtt install . --force              # overwrite existing rtt sections

Supported agents:

Agent	Config file
Claude Code	`CLAUDE.md`
Cursor	`.cursor/rules/rtt.mdc`
Windsurf	`.windsurfrules`
Codex / OpenAI	`AGENTS.md`
GitHub Copilot	`.github/copilot-instructions.md`
Kiro	`.kiro/steering/rtt.md`
Gemini CLI	`GEMINI.md`
Aider	`.aider/prompts/conventions.md`
Zed	`.rules`

The instruction added to each config file tells the agent to read .rtt/context.txt once at session start for orientation, then work normally. It does not restrict the agent from reading source files or using its own retrieval.

`rtt update`

Regenerate .rtt/context.txt after code changes. Does not touch agent config files. The git hook installed by rtt install runs this automatically on every commit.

rtt update .
rtt update . --diff    # show what symbols changed

`rtt uninstall`

Remove rtt instructions from all agent config files.

rtt uninstall .
rtt uninstall . --platform cursor
rtt uninstall . --clean    # also delete .rtt/context.txt

`rtt index`

Generate the skeleton and print to stdout, or write to a file. Useful for piping into other tools or building custom workflows.

rtt index .
rtt index /path/to/repo --output context.txt

`rtt compare`

Show token reduction statistics with a per-file breakdown.

rtt compare .
rtt compare . --diff HEAD~3..HEAD    # token delta for a git range

`rtt bench`

Measure how much structural information the skeleton retains.

rtt bench .                        # heuristic only, free, instant
rtt bench . --llm --sample 30      # semantic equivalence via Claude

The --llm mode requires ANTHROPIC_API_KEY and pip install "reducethemtokens[llm]".

`rtt audit`

Verify extraction accuracy: symbols found vs expected, and signature correctness.

rtt audit .

`rtt vs`

Compare token footprint against another repo-indexing tool (currently supports graphify).

pip install graphifyy
rtt vs .

`rtt view`

Render the skeleton as markdown and open in a pager.

rtt view .
rtt view . --output overview.md

Keeping the skeleton current

rtt install sets up a git pre-commit hook that runs rtt update automatically on every commit. For most solo workflows that is enough.

For teams, the hook only runs on machines where rtt is installed. A new contributor who clones the repo without installing rtt will not regenerate the skeleton. Two approaches:

Add a CI step that regenerates and commits the skeleton on every merge to main:

# .github/workflows/rtt.yml
name: Update rtt index
on:
  push:
    branches: [main]
jobs:
  rtt:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4
      - run: pip install reducethemtokens
      - run: rtt update .
      - uses: stefanzweifel/git-auto-commit-action@v5
        with:
          commit_message: "chore: update rtt index"
          file_pattern: ".rtt/context.txt"

Or document it in your contributing guide:

# after pulling changes
rtt update .
git add .rtt/context.txt

The first line of .rtt/context.txt includes the generation timestamp and file count, so agents can detect a stale index without reading the whole file.

Large repos and context window limits

If the skeleton is too large for your model's context window, use --max-tokens to trim it to fit. rtt will keep non-test files with the most symbols and drop the rest:

rtt install . --max-tokens 100000    # fits in most 128k-window models
rtt install . --max-tokens 50000     # conservative

Or limit to specific directories:

rtt install . --include 'src/**' --include 'lib/**' --exclude 'tests/**'

Python API

import rtt

repo = rtt.index("/path/to/repo")

print(repo.token_count)    # int
print(repo.text)           # full skeleton as a string

for file in repo.files:
    print(file.path, file.language)
    print(file.imports)    # e.g. ["pathlib.Path", "typing.Optional"]
    for sym in file.symbols:
        print(sym.name, sym.kind, sym.signature)
        for child in sym.children:
            print(" ", child.signature)

report = rtt.compare("/path/to/repo")
print(f"{report.reduction_pct:.1f}% reduction")
print(f"{report.raw_tokens:,} → {report.compressed_tokens:,} tokens")

Supported languages

Python, JavaScript, TypeScript, Go, Rust, Java, C, C++, Ruby.

How it works

rtt parses each file with tree-sitter and walks the AST to collect top-level definitions: functions, classes, methods, and imports. Function bodies are discarded. The output is one line per symbol, indented to show class membership, with imports resolved to specific symbols (from pathlib import Path → pathlib.Path).

Results are cached by file content hash. Subsequent runs on large repos are fast.

Development

git clone https://github.com/yttrium400/reducethemtokens-rtt-
cd reducethemtokens-rtt-
pip install -e ".[dev]"
pytest tests/

91 tests. No network calls required.

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.6.1

May 16, 2026

0.6.0

May 16, 2026

0.5.2

May 16, 2026

0.5.1

May 16, 2026

0.5.0

May 16, 2026

0.4.1

May 15, 2026

0.4.0

May 14, 2026

This version

0.3.0

May 14, 2026

0.2.0

May 14, 2026

0.1.0

May 14, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

reducethemtokens-0.3.0.tar.gz (47.1 kB view details)

Uploaded May 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

reducethemtokens-0.3.0-py3-none-any.whl (42.8 kB view details)

Uploaded May 14, 2026 Python 3

File details

Details for the file reducethemtokens-0.3.0.tar.gz.

File metadata

Download URL: reducethemtokens-0.3.0.tar.gz
Upload date: May 14, 2026
Size: 47.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for reducethemtokens-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`fafc0eb92138f41b40d0addc09d384ec673735bcdb198d42c74434ba03fb56f0`
MD5	`e30978b31de6414c35bb1871d3df4b96`
BLAKE2b-256	`a960e8205a2ba70f3076564384a401a6a6158e739683705fa33c904371b2e9d1`

See more details on using hashes here.

File details

Details for the file reducethemtokens-0.3.0-py3-none-any.whl.

File metadata

Download URL: reducethemtokens-0.3.0-py3-none-any.whl
Upload date: May 14, 2026
Size: 42.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for reducethemtokens-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a7d62fd880a99782a79d39cc66df5dd8a8f1eea4ad68930c5db68626cd1fcb73`
MD5	`ac0da865ba738579bd554dd15753a0b8`
BLAKE2b-256	`ff111bb6273abde3889795a864e1f86788ad684eb631efc745c6615adb27817c`

See more details on using hashes here.

reducethemtokens 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

rtt — reducethemtokens

The problem it solves

When to use it

Installation

Quick start

Benchmark — Django (3,020 files)

Commands

rtt install

rtt update

rtt uninstall

rtt index

rtt compare

rtt bench

rtt audit

rtt vs

rtt view

Keeping the skeleton current

Python API

Supported languages

How it works

Development

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`rtt install`

`rtt update`

`rtt uninstall`

`rtt index`

`rtt compare`

`rtt bench`

`rtt audit`

`rtt vs`

`rtt view`