Skip to main content

The Context Optimization Layer for LLM Applications - Cut costs by 50-90%

Project description

  ██╗  ██╗███████╗ █████╗ ██████╗ ██████╗  ██████╗  ██████╗ ███╗   ███╗
  ██║  ██║██╔════╝██╔══██╗██╔══██╗██╔══██╗██╔═══██╗██╔═══██╗████╗ ████║
  ███████║█████╗  ███████║██║  ██║██████╔╝██║   ██║██║   ██║██╔████╔██║
  ██╔══██║██╔══╝  ██╔══██║██║  ██║██╔══██╗██║   ██║██║   ██║██║╚██╔╝██║
  ██║  ██║███████╗██║  ██║██████╔╝██║  ██║╚██████╔╝╚██████╔╝██║ ╚═╝ ██║
  ╚═╝  ╚═╝╚══════╝╚═╝  ╚═╝╚═════╝ ╚═╝  ╚═╝ ╚═════╝  ╚═════╝ ╚═╝     ╚═╝
                  The context compression layer for AI agents

60–95% fewer tokens · library · proxy · MCP · 6 algorithms · local-first · reversible

CI codecov PyPI npm Model: Kompress-base Tokens saved: 60B+ License: Apache 2.0 Docs

Docs · Install · Proof · Agents · Discord · llms.txt

AI agents / LLMs: read /llms.txt here, or fetch the live index / full docs blob.


Headroom compresses everything your AI agent reads — tool outputs, logs, RAG chunks, files, and conversation history — before it reaches the LLM. Same answers, fraction of the tokens.

Headroom in action
Live: 10,144 → 1,260 tokens — same FATAL found.

What it does

  • Librarycompress(messages) in Python or TypeScript, inline in any app
  • Proxyheadroom proxy --port 8787, zero code changes, any language
  • Agent wrapheadroom wrap claude|codex|cursor|aider|copilot in one command
  • MCP serverheadroom_compress, headroom_retrieve, headroom_stats for any MCP client
  • Cross-agent memory — shared store across Claude, Codex, Gemini, auto-dedup
  • headroom learn — mines failed sessions, writes corrections to CLAUDE.md / AGENTS.md
  • Reversible (CCR) — originals never deleted; LLM retrieves on demand

How it works (30 seconds)

 Your agent / app
   (Claude Code, Cursor, Codex, LangChain, Agno, Strands, your own code…)
        │   prompts · tool outputs · logs · RAG results · files
        ▼
    ┌────────────────────────────────────────────────────┐
    │  Headroom   (runs locally — your data stays here)  │
    │  ───────────────────────────────────────────────   │
    │  CacheAligner  →  ContentRouter  →  CCR             │
    │                    ├─ SmartCrusher   (JSON)         │
    │                    ├─ CodeCompressor (AST)          │
    │                    └─ Kompress-base  (text, HF)     │
    │                                                     │
    │  Cross-agent memory  ·  headroom learn  ·  MCP      │
    └────────────────────────────────────────────────────┘
        │   compressed prompt  +  retrieval tool
        ▼
 LLM provider  (Anthropic · OpenAI · Bedrock · …)
  • ContentRouter — detects content type, selects the right compressor
  • SmartCrusher / CodeCompressor / Kompress-base — compress JSON, AST, or prose
  • CacheAligner — stabilizes prefixes so provider KV caches actually hit
  • CCR — stores originals locally; LLM calls headroom_retrieve if it needs them

Architecture · CCR reversible compression · Kompress-base model card

Get started (60 seconds)

# 1 — Install
pip install "headroom-ai[all]"          # Python
npm install headroom-ai                 # Node / TypeScript

# 2 — Pick your mode
headroom wrap claude                    # wrap a coding agent
headroom proxy --port 8787              # drop-in proxy, zero code changes
# or: from headroom import compress      # inline library

# 3 — See the savings
headroom stats

Granular extras: [proxy], [mcp], [ml], [agno], [langchain], [evals]. Requires Python 3.10+.

Proof

Savings on real agent workloads:

Workload Before After Savings
Code search (100 results) 17,765 1,408 92%
SRE incident debugging 65,694 5,118 92%
GitHub issue triage 54,174 14,761 73%
Codebase exploration 78,502 41,254 47%

Accuracy preserved on standard benchmarks:

Benchmark Category N Baseline Headroom Delta
GSM8K Math 100 0.870 0.870 ±0.000
TruthfulQA Factual 100 0.530 0.560 +0.030
SQuAD v2 QA 100 97% 19% compression
BFCL Tools 100 97% 32% compression

Reproduce: python -m headroom.evals suite --tier 1 · Full benchmarks & methodology

60B+ tokens saved — community leaderboard
60B+ tokens saved by the community — live leaderboard →

Agent compatibility matrix

Agent headroom wrap Notes
Claude Code --memory · --code-graph
Codex shares memory with Claude
Cursor prints config — paste once
Aider starts proxy + launches
Copilot CLI starts proxy + launches
OpenClaw installs as ContextEngine plugin

Any OpenAI-compatible client works via headroom proxy. MCP-native: headroom mcp install.

When to use · When to skip

Great fit if you…

  • run AI coding agents daily and want savings without changing your code
  • work across multiple agents and want shared memory
  • need reversible compression — originals always retrievable via CCR

Skip it if you…

  • only use a single provider's native compaction and don't need cross-agent memory
  • work in a sandboxed environment where local processes can't run
Integrations — drop Headroom into any stack
Your setup Hook in with
Any Python app compress(messages, model=…)
Any TypeScript app await compress(messages, { model })
Anthropic / OpenAI SDK withHeadroom(new Anthropic()) · withHeadroom(new OpenAI())
Vercel AI SDK wrapLanguageModel({ model, middleware: headroomMiddleware() })
LiteLLM litellm.callbacks = [HeadroomCallback()]
LangChain HeadroomChatModel(your_llm)
Agno HeadroomAgnoModel(your_model)
Strands Strands guide
ASGI apps app.add_middleware(CompressionMiddleware)
Multi-agent SharedContext().put / .get
MCP clients headroom mcp install
What's inside
  • SmartCrusher — universal JSON: arrays of dicts, nested objects, mixed types.
  • CodeCompressor — AST-aware for Python, JS, Go, Rust, Java, C++.
  • Kompress-base — our HuggingFace model, trained on agentic traces.
  • Image compression — 40–90% reduction via trained ML router.
  • CacheAligner — stabilizes prefixes so Anthropic/OpenAI KV caches actually hit.
  • IntelligentContext — score-based context fitting with learned importance.
  • CCR — reversible compression; LLM retrieves originals on demand.
  • Cross-agent memory — shared store, agent provenance, auto-dedup.
  • SharedContext — compressed context passing across multi-agent workflows.
  • headroom learn — plugin-based failure mining for Claude, Codex, Gemini.
Pipeline internals

Headroom exposes one stable request lifecycle across compress(), the SDK, and the proxy:

SetupPre-StartPost-StartInput ReceivedInput CachedInput RoutedInput CompressedInput RememberedPre-SendPost-SendResponse Received

  • Transforms do the work: CacheAligner, ContentRouter, SmartCrusher, CodeCompressor, Kompress-base, IntelligentContext / RollingWindow.
  • Pipeline extensions observe or customize lifecycle stages via on_pipeline_event(...).
  • Compression hooks sit alongside the canonical lifecycle as an additional extension seam.
  • Proxy extensions remain the server/app integration seam for ASGI middleware, routes, and startup policy.

Provider and tool-specific behavior lives under headroom/providers/ so core orchestration stays focused on lifecycle, sequencing, and policy.

  • CLI/tool slices: headroom/providers/claude, copilot, codex, openclaw
  • Provider runtime slices: headroom/providers/claude, gemini, plus shared backend/runtime dispatch in headroom/providers/registry.py
  • Core files stay orchestration-first: wrap.py, client.py, cli/proxy.py, and proxy/server.py delegate provider-specific env shaping, API target normalization, backend selection, and transport dispatch.

Install

pip install "headroom-ai[all]"          # Python, everything
npm install headroom-ai                 # TypeScript / Node
docker pull ghcr.io/chopratejas/headroom:latest

Granular extras: [proxy], [mcp], [ml] (Kompress-base), [agno], [langchain], [evals]. Requires Python 3.10+.

Using pipx? Choose a supported interpreter explicitly:

pipx install --python python3.13 "headroom-ai[all]"

Installation guide — Docker tags, persistent service, PowerShell, devcontainers.

headroom learn

headroom learn in action

headroom learn — mines failed sessions, writes corrections to CLAUDE.md / AGENTS.md / GEMINI.md.

Documentation

Start here Go deeper
Quickstart Architecture
Proxy How compression works
MCP tools CCR — reversible compression
Memory Cache optimization
Failure learning Benchmarks
Configuration Limitations

Compared to

Headroom runs locally, covers every content type, works with every major framework, and is reversible.

Scope Deploy Local Reversible
Headroom All context — tools, RAG, logs, files, history Proxy · library · middleware · MCP Yes Yes
RTK CLI command outputs CLI wrapper Yes No
lean-ctx CLI commands, MCP tools, editor rules CLI wrapper · MCP Yes No
Compresr, Token Co. Text sent to their API Hosted API call No No
OpenAI Compaction Conversation history Provider-native No No

Attribution. Headroom ships with the excellent RTK binary for shell-output rewriting — git show --short, scoped ls, summarized installers. Huge thanks to the RTK team; their tool is a first-class part of our stack, and Headroom compresses everything downstream of it. Headroom can also use lean-ctx as the selected CLI context tool; set HEADROOM_CONTEXT_TOOL=lean-ctx before running headroom wrap ....

Contributing

git clone https://github.com/chopratejas/headroom.git && cd headroom
pip install -e ".[dev]" && pytest

Devcontainers in .devcontainer/ (default + memory-stack with Qdrant & Neo4j). See CONTRIBUTING.md.

Community

License

Apache 2.0 — see LICENSE.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

headroom_ai-0.22.4.tar.gz (1.6 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

headroom_ai-0.22.4-cp313-cp313-manylinux_2_28_x86_64.whl (17.3 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

headroom_ai-0.22.4-cp313-cp313-manylinux_2_28_aarch64.whl (18.1 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

headroom_ai-0.22.4-cp313-cp313-macosx_11_0_arm64.whl (16.1 MB view details)

Uploaded CPython 3.13macOS 11.0+ ARM64

headroom_ai-0.22.4-cp312-cp312-manylinux_2_28_x86_64.whl (17.3 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

headroom_ai-0.22.4-cp312-cp312-manylinux_2_28_aarch64.whl (18.1 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

headroom_ai-0.22.4-cp312-cp312-macosx_11_0_arm64.whl (16.1 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

headroom_ai-0.22.4-cp311-cp311-manylinux_2_28_x86_64.whl (17.3 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

headroom_ai-0.22.4-cp311-cp311-manylinux_2_28_aarch64.whl (18.1 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

headroom_ai-0.22.4-cp311-cp311-macosx_11_0_arm64.whl (16.1 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

headroom_ai-0.22.4-cp310-cp310-manylinux_2_28_x86_64.whl (17.3 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

headroom_ai-0.22.4-cp310-cp310-manylinux_2_28_aarch64.whl (18.1 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

headroom_ai-0.22.4-cp310-cp310-macosx_11_0_arm64.whl (16.1 MB view details)

Uploaded CPython 3.10macOS 11.0+ ARM64

File details

Details for the file headroom_ai-0.22.4.tar.gz.

File metadata

  • Download URL: headroom_ai-0.22.4.tar.gz
  • Upload date:
  • Size: 1.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for headroom_ai-0.22.4.tar.gz
Algorithm Hash digest
SHA256 1e81e2438083da226b57db6c1a747a0dd6317bee9dbaf6a231663bd8eaebf97b
MD5 8d692291722a70ab3b21b89b1041a336
BLAKE2b-256 2e4aa34939c58f7f2067687497a8befeb6b9bb859deb8a27f2bc632f2020ada9

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.22.4.tar.gz:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.22.4-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.22.4-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 bd507859464b87f4db95fb106c80d44ba7b8a7562964eece5d3fd87cd633fbda
MD5 e7c9305643fd5f9e3001914db753155f
BLAKE2b-256 4dbe884eae8653143d5a2dab5d7da6eaefef654fe473dcdf86b263df95923d39

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.22.4-cp313-cp313-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.22.4-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.22.4-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 c6ce5250a9f800d30196b2185871a96355d72e0d10c3648361fd344a5c86d27f
MD5 f22e0a1277786b26e4a8032ec556cf3e
BLAKE2b-256 ae7e795766211f9140deb873aa03a3be7855a5b4edd1027a9505512e2a91bf18

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.22.4-cp313-cp313-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.22.4-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.22.4-cp313-cp313-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 3238051a3f8fab9a951e72c9ebf5b53307c907e675d98563273b37554bf1e04e
MD5 9592a3074fdb8dc422e5496ca8c84440
BLAKE2b-256 88912d1303836bfc54d66482edefb1b4842b79afe67f837bc25754cf3c448519

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.22.4-cp313-cp313-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.22.4-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.22.4-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 14dd28ff4306ac82065dc578a00e49db80ad8e9991f32544909490b060e856d9
MD5 371b5534c6b74f20eccb7fb7431bda2d
BLAKE2b-256 ef41de501ebabcb4d9596a5880351be88b8f801998a34b81fde6f3e2ea8fe33d

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.22.4-cp312-cp312-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.22.4-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.22.4-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 b5a2f2d869a4b7966d70a04a2dd9317c1367830501b122b7b96cf49cfe3e7950
MD5 ee33d1a2ae49b6c47e6d15a1168b86d8
BLAKE2b-256 002116eff2a2826e5d11d20856e1f3d4e170c096c63812c79ee43bc31e86deb9

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.22.4-cp312-cp312-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.22.4-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.22.4-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 20d4f330918af2316a3124fcf9f8f1858a868fd2d90a75bf8bba1265700af5a6
MD5 a17a9e5d3f61b648393a800cba9db99c
BLAKE2b-256 ca60b2bcb81a933d3d613ca4b80174a8ccba0dd7f8ad06af346f3a4751b997b9

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.22.4-cp312-cp312-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.22.4-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.22.4-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0a3cd1af7610b4358db6fcfedb5a61a2c922cb5aac5e3e2954ed5f9be42ee336
MD5 14718a74afbf753f192dd838e279ddf9
BLAKE2b-256 304ea76b245100c7d188a3ab8514d3f5edf5b3a56a1a5d52e444eefcf124125d

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.22.4-cp311-cp311-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.22.4-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.22.4-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 4cdea8eb42a731bf36affb52eaf51a2e539f4f5970db1246d3990c03617c9647
MD5 07559a076717167baaf8e3c39bc4b71c
BLAKE2b-256 b749ef4091ee98a6fa50125014b20a74c894644f098f6acdeb409e231a8714ae

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.22.4-cp311-cp311-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.22.4-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.22.4-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 04db357f873a6ec1b247cb22734b55dc89076b100e8b0fe0bc2aaa69f6bb5153
MD5 406fcac439b0bc5c518101d34f085789
BLAKE2b-256 6939422dcbd081b5133c0cbf9da2bbe48fa8b9706420ebe02e8abf798b5505b2

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.22.4-cp311-cp311-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.22.4-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.22.4-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0942c5d45c6f94bb2cc7b094a323175bae262fdc3d38ab088e27aea6cc5186f3
MD5 d8b6a0fedaa81a676c659782409ff649
BLAKE2b-256 6b3062e09b7c92c703a5da0583d84333992ab154a5f2ab0d05879c9959327a42

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.22.4-cp310-cp310-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.22.4-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.22.4-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 9de6371e4df8134f91d7aee9ee7b72ec75026445c84ac2be3b88c16b4e4c1ea9
MD5 af4e7b96f444bce3a29e0e1081e7cd79
BLAKE2b-256 6b29401539f625fc42fac45e6e896de2175074f8f2da25a185ce0e557796cd05

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.22.4-cp310-cp310-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.22.4-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.22.4-cp310-cp310-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 e420cea10a8267cfcf0cf3fdd950d6239f1cba38e0b9b196854e417e68a54b04
MD5 cff8c7c3a7c647bb1db12b5f0e16491c
BLAKE2b-256 66456ed6c1772d33d49b2fdba1fa35fe941751e7ed58e6ecc9c732ce0f9fb705

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.22.4-cp310-cp310-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page