Skip to main content

Crisp repos. Sharp AI. Give AI the blueprint, not the code.

Project description

Cresbee Crespo

Version License Python Downloads Stars

Give AI the blueprint, not the code.

Stop burning your context window on raw source files.
Crespo extracts what matters — and compresses everything else.

pip install crespo && crespo ./myproject

The Problem

You paste a codebase into any LLM. It hits the context limit but you problem remains unfinished. You paste one file at a time. The AI loses the big picture. You give AI the full code. The AI reads 40,000 tokens linearly and still misses the architecture.

Crespo solves this differently.

Instead of concatenating raw files, Crespo uses Tree-sitter AST parsing to extract only the structural DNA of your repository — imports, classes, functions, module connections — and emits a compact XML blueprint. Same architectural understanding. A fraction of the tokens.


Usage

# Structure mode (default)
crespo ./myproject

# Summary mode — requires Groq key
crespo ./myproject --mode summary --groq YOUR_KEY

# Save your Groq key for future runs
crespo --groq YOUR_KEY

# Concat mode — full source, redacted
crespo ./myproject --mode concat

# Analyse a GitHub repo directly
crespo --git https://github.com/user/repo

# Custom output filename
crespo ./myproject --output blueprint.xml

How It Works

your repo
    │
    ▼
 walker          respects .gitignore · skips tests · skips build artifacts
    │
    ▼
 tree-sitter     real AST parse · 10 languages · no regex
    │
    ▼
 extractor       imports · classes · functions · structs · enums
    │
    ▼
 blueprint XML   compact · structured · LLM-ready

No heuristics. No regex scraping. Real language grammars via Tree-sitter — the same parser used by GitHub, Neovim, and Zed.


Modes

Mode What it produces Best for
structure AST skeleton — imports, classes, functions Architecture review, onboarding, LLM context
summary Structure + AI one-line descriptions per file Deeper codebase understanding
concat Full source, secrets redacted, in structured XML Passing entire repos to LLMs safely

Languages supported

Python · JavaScript · TypeScript · JSX · TSX · Rust · Go · Java · C · C++


Example output

<?xml version='1.0' encoding='utf-8'?>
<repo n="kara" s="Gesture-controlled PDF viewer using PyQt6, MediaPipe, and OpenCV.">
  <meta>
    <dep>cv2,mediapipe,numpy,PyQt6,fitz,groq,markdown</dep>
  </meta>
  <files>
    <f p="Ui.py" e=".py" s="Main PyQt6 window coordinating PDF rendering, gesture input, and AI summarisation.">
      <imp>PyQt6,fitz,render,summarise,gesture,markdown</imp>
      <cls n="Window">
        <fn n="summary" p="(self)" />
        <fn n="startGest" p="(self, state)" />
        <fn n="gestZoom" p="(self, state: int)" />
      </cls>
    </f>
    <f p="gesture.py" e=".py" s="MediaPipe hand tracking with gesture classification and debouncing.">
      <imp>mediapipe,cv2,numpy</imp>
      <cls n="GestureController">
        <fn n="detect" p="(self, frame)" />
        <fn n="classify" p="(self, landmarks)" />
      </cls>
    </f>
  </files>
</repo>

Benchmarks

Tested on real open-source repositories. Structure accuracy evaluated by asking an LLM three questions from the blueprint alone — no access to the original source.

Structure mode accuracy

Repo Components & connections Dependencies Entry point Score
Axios ✅ correct and specific ✅ correct and specific ✅ correct and specific 3/3
Express ✅ correct and specific ✅ correct and specific ✅ correct and specific 3/3
Kara ✅ correct and specific ✅ correct and specific ✅ correct and specific 3/3
Moodilist ✅ correct and specific ✅ correct and specific ✅ correct and specific 3/3
Requests ✅ correct and specific ✅ correct and specific ⚠️ partially correct 2/3
Urai ✅ correct and specific ✅ correct and specific ⚠️ partially correct 2/3
FastAPI ✅ correct and specific ✅ correct and specific ⚠️ partially correct 2/3
Flask ✅ correct and specific ✅ correct and specific ⚠️ partially correct 2/3
Average 2.75 / 3

⚠️ Entry points are partially correct on framework-level repos (FastAPI, Flask) and convention-driven repos (Next.js) where the entry point is implicit rather than explicit. Architecture and dependency accuracy remains perfect across all tested repos.

Token reduction — structure mode

Repo Raw tokens Blueprint tokens Reduction
Kara ~4,667 ~934 ~80%
Moodilist ~8,580 ~1,396 ~84%
Axios ~61,494 ~6,989 ~89%
Express ~17,222 ~707 ~96%
FastAPI ~145,606 ~124,993 ~14%
Flask ~77,402 ~13,848 ~82%
Requests ~49,556 ~9,585 ~81%
Urai ~ 17,418 ~2,304 ~87%
Average ~86%

FastAPI (14% reduction) excluded from average — as a framework repo its structure IS the content. Crespo correctly preserves it rather than discarding it.

Framework-heavy repos compress slightly less because the preserved structure is genuinely useful — there is less noise to discard.

Token Counting is done using tiktoken python library.

Compression Depends on Repo Type


Security

Crespo redacts secrets before writing any output. Patterns covered:

  • Quoted assignments — api_key = "...", token: '...'
  • Raw .env style — GROQ_KEY=abc123
  • Known key prefixes — Groq (gsk_), OpenAI (sk-), Anthropic (sk-ant-), GitHub (ghp_), AWS (AKIA), Slack (xox)

Groq Setup

Summary mode uses Groq to generate one-line descriptions per file and function. The free tier is more than enough.

# pass once — saved to ~/.crespo/config
crespo --groq YOUR_KEY

# all future summary runs pick it up automatically
crespo ./myproject --mode summary

Your key is stored locally at ~/.crespo/config and never sent anywhere except Groq's API.


Roadmap

  • Something for Humans coming soon!
  • More aggressive compression preset
  • More language support (Ruby, PHP, Swift, Kotlin)
  • .crespoignore support

Troubleshooting

crespo: command not found

This usually means Crespo was installed successfully, but the executable isn't on your system PATH.

Verify installation

python -m pip show crespo

If Crespo appears in the output but the crespo command still isn't recognized, it's a PATH issue — follow the steps below for your OS.

Windows

Add your Python Scripts directory to PATH (typically):

C:\Users\<you>\AppData\Local\Programs\Python\Python3x\Scripts

Restart your terminal afterwards.

macOS / Linux

Find your user scripts directory:

python3 -m site --user-base

Then add its bin folder to your shell configuration:

export PATH="$HOME/.local/bin:$PATH"

Restart your shell and try again.

Multiple Python installations

If you have multiple Python versions installed, make sure installation and execution use the same interpreter. Check which python/pip you're actually using:

where python      # Windows
which -a python3  # macOS / Linux

Reinstall using that same interpreter explicitly if needed:

python -m pip install --force-reinstall crespo

Recommended: use pipx

For CLI tools, pipx avoids most PATH-related issues entirely:

pipx install crespo
crespo ./myproject

Contributing

Contributions are welcome. If you have ideas for new output modes, better parsing, or additional language support, open an issue or PR.


License

MIT © Hrudul Krishna K V




Built by a CS student who kept hitting AI's free limit.
GitHub · PyPI

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

crespo-1.0.14.tar.gz (643.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

crespo-1.0.14-py3-none-any.whl (639.6 kB view details)

Uploaded Python 3

File details

Details for the file crespo-1.0.14.tar.gz.

File metadata

  • Download URL: crespo-1.0.14.tar.gz
  • Upload date:
  • Size: 643.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for crespo-1.0.14.tar.gz
Algorithm Hash digest
SHA256 49c60aa9591e8689647708bed3c3daca711fa49b9b9519534ab8dd62aecbfb53
MD5 87c15733bc8c1375b81f13ec18a08f5f
BLAKE2b-256 6f6c53a359fa4ddbcfaa7b4dbfcb85e8bf3ff93a8059cd82e395a5e1ed4657cd

See more details on using hashes here.

Provenance

The following attestation bundles were made for crespo-1.0.14.tar.gz:

Publisher: publish.yml on hrudulmmn/crespo

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file crespo-1.0.14-py3-none-any.whl.

File metadata

  • Download URL: crespo-1.0.14-py3-none-any.whl
  • Upload date:
  • Size: 639.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for crespo-1.0.14-py3-none-any.whl
Algorithm Hash digest
SHA256 fd6f39480fa9a341e0801047157b01e40b0f37fdad30c3cae1edf48867225894
MD5 403857be0632f938d036569e4c3d419c
BLAKE2b-256 0064e3dd1f12efdd4f67939fd24cfdc6589eea38a281ae7e741581ba3e2100af

See more details on using hashes here.

Provenance

The following attestation bundles were made for crespo-1.0.14-py3-none-any.whl:

Publisher: publish.yml on hrudulmmn/crespo

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page