Crisp repos. Sharp AI. Give AI the blueprint, not the code.
Project description
Give AI the blueprint, not the code.
Stop burning your context window on raw source files.
Crespo extracts what matters — and compresses everything else.
pip install crespo && crespo ./myproject
The Problem
You paste a codebase into any LLM. It hits the context limit but you problem remains unfinished. You paste one file at a time. The AI loses the big picture. You give AI the full code. The AI reads 40,000 tokens linearly and still misses the architecture.
Crespo solves this differently.
Instead of concatenating raw files, Crespo uses Tree-sitter AST parsing to extract only the structural DNA of your repository — imports, classes, functions, module connections — and emits a compact XML blueprint. Same architectural understanding. A fraction of the tokens.
Usage
# Structure mode (default)
crespo ./myproject
# Summary mode — requires Groq key
crespo ./myproject --mode summary --groq YOUR_KEY
# Save your Groq key for future runs
crespo --groq YOUR_KEY
# Concat mode — full source, redacted
crespo ./myproject --mode concat
# Analyse a GitHub repo directly
crespo --git https://github.com/user/repo
# Custom output filename
crespo ./myproject --output blueprint.xml
How It Works
your repo
│
▼
walker respects .gitignore · skips tests · skips build artifacts
│
▼
tree-sitter real AST parse · 10 languages · no regex
│
▼
extractor imports · classes · functions · structs · enums
│
▼
blueprint XML compact · structured · LLM-ready
No heuristics. No regex scraping. Real language grammars via Tree-sitter — the same parser used by GitHub, Neovim, and Zed.
Modes
| Mode | What it produces | Best for |
|---|---|---|
structure |
AST skeleton — imports, classes, functions | Architecture review, onboarding, LLM context |
summary |
Structure + AI one-line descriptions per file | Deeper codebase understanding |
concat |
Full source, secrets redacted, in structured XML | Passing entire repos to LLMs safely |
Languages supported
Python · JavaScript · TypeScript · JSX · TSX · Rust · Go · Java · C · C++
Example output
<?xml version='1.0' encoding='utf-8'?>
<repo n="kara" s="Gesture-controlled PDF viewer using PyQt6, MediaPipe, and OpenCV.">
<meta>
<dep>cv2,mediapipe,numpy,PyQt6,fitz,groq,markdown</dep>
</meta>
<files>
<f p="Ui.py" e=".py" s="Main PyQt6 window coordinating PDF rendering, gesture input, and AI summarisation.">
<imp>PyQt6,fitz,render,summarise,gesture,markdown</imp>
<cls n="Window">
<fn n="summary" p="(self)" />
<fn n="startGest" p="(self, state)" />
<fn n="gestZoom" p="(self, state: int)" />
</cls>
</f>
<f p="gesture.py" e=".py" s="MediaPipe hand tracking with gesture classification and debouncing.">
<imp>mediapipe,cv2,numpy</imp>
<cls n="GestureController">
<fn n="detect" p="(self, frame)" />
<fn n="classify" p="(self, landmarks)" />
</cls>
</f>
</files>
</repo>
Benchmarks
Tested on real open-source repositories. Structure accuracy evaluated by asking an LLM three questions from the blueprint alone — no access to the original source.
Structure mode accuracy
| Repo | Components & connections | Dependencies | Entry point | Score |
|---|---|---|---|---|
| Axios | ✅ correct and specific | ✅ correct and specific | ✅ correct and specific | 3/3 |
| Express | ✅ correct and specific | ✅ correct and specific | ✅ correct and specific | 3/3 |
| Kara | ✅ correct and specific | ✅ correct and specific | ✅ correct and specific | 3/3 |
| Moodilist | ✅ correct and specific | ✅ correct and specific | ✅ correct and specific | 3/3 |
| Requests | ✅ correct and specific | ✅ correct and specific | ⚠️ partially correct | 2/3 |
| Urai | ✅ correct and specific | ✅ correct and specific | ⚠️ partially correct | 2/3 |
| FastAPI | ✅ correct and specific | ✅ correct and specific | ⚠️ partially correct | 2/3 |
| Flask | ✅ correct and specific | ✅ correct and specific | ⚠️ partially correct | 2/3 |
| Average | 2.75 / 3 |
⚠️ Entry points are partially correct on framework-level repos (FastAPI, Flask) and convention-driven repos (Next.js) where the entry point is implicit rather than explicit. Architecture and dependency accuracy remains perfect across all tested repos.
Token reduction — structure mode
| Repo | Raw tokens | Blueprint tokens | Reduction |
|---|---|---|---|
| Kara | ~4,667 | ~934 | ~80% |
| Moodilist | ~8,580 | ~1,396 | ~84% |
| Axios | ~61,494 | ~6,989 | ~89% |
| Express | ~17,222 | ~707 | ~96% |
| FastAPI | ~145,606 | ~124,993 | ~14% |
| Flask | ~77,402 | ~13,848 | ~82% |
| Requests | ~49,556 | ~9,585 | ~81% |
| Urai | ~ 17,418 | ~2,304 | ~87% |
| Average | ~86% |
FastAPI (14% reduction) excluded from average — as a framework repo its structure IS the content. Crespo correctly preserves it rather than discarding it.
Framework-heavy repos compress slightly less because the preserved structure is genuinely useful — there is less noise to discard.
Token Counting is done using tiktoken python library.
Compression Depends on Repo Type
Security
Crespo redacts secrets before writing any output. Patterns covered:
- Quoted assignments —
api_key = "...",token: '...' - Raw
.envstyle —GROQ_KEY=abc123 - Known key prefixes — Groq (
gsk_), OpenAI (sk-), Anthropic (sk-ant-), GitHub (ghp_), AWS (AKIA), Slack (xox)
Groq Setup
Summary mode uses Groq to generate one-line descriptions per file and function. The free tier is more than enough.
# pass once — saved to ~/.crespo/config
crespo --groq YOUR_KEY
# all future summary runs pick it up automatically
crespo ./myproject --mode summary
Your key is stored locally at ~/.crespo/config and never sent anywhere except Groq's API.
Roadmap
- Something for Humans coming soon!
- More aggressive compression preset
- More language support (Ruby, PHP, Swift, Kotlin)
.crespoignoresupport
Troubleshooting
crespo: command not found
This usually means Crespo was installed successfully, but the executable isn't on your system PATH.
Verify installation
python -m pip show crespo
If Crespo appears in the output but the crespo command still isn't recognized, it's a PATH issue — follow the steps below for your OS.
Windows
Add your Python Scripts directory to PATH (typically):
C:\Users\<you>\AppData\Local\Programs\Python\Python3x\Scripts
Restart your terminal afterwards.
macOS / Linux
Find your user scripts directory:
python3 -m site --user-base
Then add its bin folder to your shell configuration:
export PATH="$HOME/.local/bin:$PATH"
Restart your shell and try again.
Multiple Python installations
If you have multiple Python versions installed, make sure installation and execution use the same interpreter. Check which python/pip you're actually using:
where python # Windows
which -a python3 # macOS / Linux
Reinstall using that same interpreter explicitly if needed:
python -m pip install --force-reinstall crespo
Recommended: use pipx
For CLI tools, pipx avoids most PATH-related issues entirely:
pipx install crespo
crespo ./myproject
Contributing
Contributions are welcome. If you have ideas for new output modes, better parsing, or additional language support, open an issue or PR.
License
MIT © Hrudul Krishna K V
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file crespo-1.0.15.tar.gz.
File metadata
- Download URL: crespo-1.0.15.tar.gz
- Upload date:
- Size: 643.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
97de8cfe93b9403a54cb135156fcd8443a29e03313cf6b971e3269f4876d2c5c
|
|
| MD5 |
e0f1b980beb4bdb455535b994861db48
|
|
| BLAKE2b-256 |
6b74f83ad8a897e5b1685deadf6524af487b8617f485ed35ace102f55e4631d7
|
Provenance
The following attestation bundles were made for crespo-1.0.15.tar.gz:
Publisher:
publish.yml on hrudulmmn/crespo
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
crespo-1.0.15.tar.gz -
Subject digest:
97de8cfe93b9403a54cb135156fcd8443a29e03313cf6b971e3269f4876d2c5c - Sigstore transparency entry: 1883147359
- Sigstore integration time:
-
Permalink:
hrudulmmn/crespo@35c9f1fdc388b294e6d7f3025544c20fdd391ad8 -
Branch / Tag:
- Owner: https://github.com/hrudulmmn
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@35c9f1fdc388b294e6d7f3025544c20fdd391ad8 -
Trigger Event:
release
-
Statement type:
File details
Details for the file crespo-1.0.15-py3-none-any.whl.
File metadata
- Download URL: crespo-1.0.15-py3-none-any.whl
- Upload date:
- Size: 639.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
868ce9e205478fa492b1dd398a83d9fc564df52bf97b29f6d76be6ec4d267496
|
|
| MD5 |
e58f991b1eefc5a6467d8f37bc64da6b
|
|
| BLAKE2b-256 |
29215e6496e9db6617825480a80d7339b2a92dc203c1dd0b1da2cc8eb32c4b9b
|
Provenance
The following attestation bundles were made for crespo-1.0.15-py3-none-any.whl:
Publisher:
publish.yml on hrudulmmn/crespo
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
crespo-1.0.15-py3-none-any.whl -
Subject digest:
868ce9e205478fa492b1dd398a83d9fc564df52bf97b29f6d76be6ec4d267496 - Sigstore transparency entry: 1883147805
- Sigstore integration time:
-
Permalink:
hrudulmmn/crespo@35c9f1fdc388b294e6d7f3025544c20fdd391ad8 -
Branch / Tag:
- Owner: https://github.com/hrudulmmn
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@35c9f1fdc388b294e6d7f3025544c20fdd391ad8 -
Trigger Event:
release
-
Statement type: