Skip to main content

The Context Optimization Layer for LLM Applications - Cut costs by 50-90%

Project description

  ██╗  ██╗███████╗ █████╗ ██████╗ ██████╗  ██████╗  ██████╗ ███╗   ███╗
  ██║  ██║██╔════╝██╔══██╗██╔══██╗██╔══██╗██╔═══██╗██╔═══██╗████╗ ████║
  ███████║█████╗  ███████║██║  ██║██████╔╝██║   ██║██║   ██║██╔████╔██║
  ██╔══██║██╔══╝  ██╔══██║██║  ██║██╔══██╗██║   ██║██║   ██║██║╚██╔╝██║
  ██║  ██║███████╗██║  ██║██████╔╝██║  ██║╚██████╔╝╚██████╔╝██║ ╚═╝ ██║
  ╚═╝  ╚═╝╚══════╝╚═╝  ╚═╝╚═════╝ ╚═╝  ╚═╝ ╚═════╝  ╚═════╝ ╚═╝     ╚═╝
                  The context compression layer for AI agents

60–95% fewer tokens · library · proxy · MCP · 6 algorithms · local-first · reversible

CI codecov PyPI npm Model: Kompress-base Tokens saved: 60B+ License: Apache 2.0 Docs

Docs · Install · Proof · Agents · Discord


Headroom compresses everything your AI agent reads — tool outputs, logs, RAG chunks, files, and conversation history — before it reaches the LLM. Same answers, fraction of the tokens.

Headroom in action
Live: 10,144 → 1,260 tokens — same FATAL found.

What it does

  • Librarycompress(messages) in Python or TypeScript, inline in any app
  • Proxyheadroom proxy --port 8787, zero code changes, any language
  • Agent wrapheadroom wrap claude|codex|cursor|aider|copilot in one command
  • MCP serverheadroom_compress, headroom_retrieve, headroom_stats for any MCP client
  • Cross-agent memory — shared store across Claude, Codex, Gemini, auto-dedup
  • headroom learn — mines failed sessions, writes corrections to CLAUDE.md / AGENTS.md
  • Reversible (CCR) — originals never deleted; LLM retrieves on demand

How it works (30 seconds)

 Your agent / app
   (Claude Code, Cursor, Codex, LangChain, Agno, Strands, your own code…)
        │   prompts · tool outputs · logs · RAG results · files
        ▼
    ┌────────────────────────────────────────────────────┐
    │  Headroom   (runs locally — your data stays here)  │
    │  ───────────────────────────────────────────────   │
    │  CacheAligner  →  ContentRouter  →  CCR             │
    │                    ├─ SmartCrusher   (JSON)         │
    │                    ├─ CodeCompressor (AST)          │
    │                    └─ Kompress-base  (text, HF)     │
    │                                                     │
    │  Cross-agent memory  ·  headroom learn  ·  MCP      │
    └────────────────────────────────────────────────────┘
        │   compressed prompt  +  retrieval tool
        ▼
 LLM provider  (Anthropic · OpenAI · Bedrock · …)
  • ContentRouter — detects content type, selects the right compressor
  • SmartCrusher / CodeCompressor / Kompress-base — compress JSON, AST, or prose
  • CacheAligner — stabilizes prefixes so provider KV caches actually hit
  • CCR — stores originals locally; LLM calls headroom_retrieve if it needs them

Architecture · CCR reversible compression · Kompress-base model card

Get started (60 seconds)

# 1 — Install
pip install "headroom-ai[all]"          # Python
npm install headroom-ai                 # Node / TypeScript

# 2 — Pick your mode
headroom wrap claude                    # wrap a coding agent
headroom proxy --port 8787              # drop-in proxy, zero code changes
# or: from headroom import compress      # inline library

# 3 — See the savings
headroom stats

Granular extras: [proxy], [mcp], [ml], [agno], [langchain], [evals]. Requires Python 3.10+.

Proof

Savings on real agent workloads:

Workload Before After Savings
Code search (100 results) 17,765 1,408 92%
SRE incident debugging 65,694 5,118 92%
GitHub issue triage 54,174 14,761 73%
Codebase exploration 78,502 41,254 47%

Accuracy preserved on standard benchmarks:

Benchmark Category N Baseline Headroom Delta
GSM8K Math 100 0.870 0.870 ±0.000
TruthfulQA Factual 100 0.530 0.560 +0.030
SQuAD v2 QA 100 97% 19% compression
BFCL Tools 100 97% 32% compression

Reproduce: python -m headroom.evals suite --tier 1 · Full benchmarks & methodology

60B+ tokens saved — community leaderboard
60B+ tokens saved by the community — live leaderboard →

Agent compatibility matrix

Agent headroom wrap Notes
Claude Code --memory · --code-graph
Codex shares memory with Claude
Cursor prints config — paste once
Aider starts proxy + launches
Copilot CLI starts proxy + launches
OpenClaw installs as ContextEngine plugin

Any OpenAI-compatible client works via headroom proxy. MCP-native: headroom mcp install.

When to use · When to skip

Great fit if you…

  • run AI coding agents daily and want savings without changing your code
  • work across multiple agents and want shared memory
  • need reversible compression — originals always retrievable via CCR

Skip it if you…

  • only use a single provider's native compaction and don't need cross-agent memory
  • work in a sandboxed environment where local processes can't run
Integrations — drop Headroom into any stack
Your setup Hook in with
Any Python app compress(messages, model=…)
Any TypeScript app await compress(messages, { model })
Anthropic / OpenAI SDK withHeadroom(new Anthropic()) · withHeadroom(new OpenAI())
Vercel AI SDK wrapLanguageModel({ model, middleware: headroomMiddleware() })
LiteLLM litellm.callbacks = [HeadroomCallback()]
LangChain HeadroomChatModel(your_llm)
Agno HeadroomAgnoModel(your_model)
Strands Strands guide
ASGI apps app.add_middleware(CompressionMiddleware)
Multi-agent SharedContext().put / .get
MCP clients headroom mcp install
What's inside
  • SmartCrusher — universal JSON: arrays of dicts, nested objects, mixed types.
  • CodeCompressor — AST-aware for Python, JS, Go, Rust, Java, C++.
  • Kompress-base — our HuggingFace model, trained on agentic traces.
  • Image compression — 40–90% reduction via trained ML router.
  • CacheAligner — stabilizes prefixes so Anthropic/OpenAI KV caches actually hit.
  • IntelligentContext — score-based context fitting with learned importance.
  • CCR — reversible compression; LLM retrieves originals on demand.
  • Cross-agent memory — shared store, agent provenance, auto-dedup.
  • SharedContext — compressed context passing across multi-agent workflows.
  • headroom learn — plugin-based failure mining for Claude, Codex, Gemini.
Pipeline internals

Headroom exposes one stable request lifecycle across compress(), the SDK, and the proxy:

SetupPre-StartPost-StartInput ReceivedInput CachedInput RoutedInput CompressedInput RememberedPre-SendPost-SendResponse Received

  • Transforms do the work: CacheAligner, ContentRouter, SmartCrusher, CodeCompressor, Kompress-base, IntelligentContext / RollingWindow.
  • Pipeline extensions observe or customize lifecycle stages via on_pipeline_event(...).
  • Compression hooks sit alongside the canonical lifecycle as an additional extension seam.
  • Proxy extensions remain the server/app integration seam for ASGI middleware, routes, and startup policy.

Provider and tool-specific behavior lives under headroom/providers/ so core orchestration stays focused on lifecycle, sequencing, and policy.

  • CLI/tool slices: headroom/providers/claude, copilot, codex, openclaw
  • Provider runtime slices: headroom/providers/claude, gemini, plus shared backend/runtime dispatch in headroom/providers/registry.py
  • Core files stay orchestration-first: wrap.py, client.py, cli/proxy.py, and proxy/server.py delegate provider-specific env shaping, API target normalization, backend selection, and transport dispatch.

Install

pip install "headroom-ai[all]"          # Python, everything
npm install headroom-ai                 # TypeScript / Node
docker pull ghcr.io/chopratejas/headroom:latest

Granular extras: [proxy], [mcp], [ml] (Kompress-base), [agno], [langchain], [evals]. Requires Python 3.10+.

Using pipx? Choose a supported interpreter explicitly:

pipx install --python python3.13 "headroom-ai[all]"

Installation guide — Docker tags, persistent service, PowerShell, devcontainers.

headroom learn

headroom learn in action

headroom learn — mines failed sessions, writes corrections to CLAUDE.md / AGENTS.md / GEMINI.md.

Documentation

Start here Go deeper
Quickstart Architecture
Proxy How compression works
MCP tools CCR — reversible compression
Memory Cache optimization
Failure learning Benchmarks
Configuration Limitations

Compared to

Headroom runs locally, covers every content type, works with every major framework, and is reversible.

Scope Deploy Local Reversible
Headroom All context — tools, RAG, logs, files, history Proxy · library · middleware · MCP Yes Yes
RTK CLI command outputs CLI wrapper Yes No
lean-ctx CLI commands, MCP tools, editor rules CLI wrapper · MCP Yes No
Compresr, Token Co. Text sent to their API Hosted API call No No
OpenAI Compaction Conversation history Provider-native No No

Attribution. Headroom ships with the excellent RTK binary for shell-output rewriting — git show --short, scoped ls, summarized installers. Huge thanks to the RTK team; their tool is a first-class part of our stack, and Headroom compresses everything downstream of it. Headroom can also use lean-ctx as the selected CLI context tool; set HEADROOM_CONTEXT_TOOL=lean-ctx before running headroom wrap ....

Contributing

git clone https://github.com/chopratejas/headroom.git && cd headroom
pip install -e ".[dev]" && pytest

Devcontainers in .devcontainer/ (default + memory-stack with Qdrant & Neo4j). See CONTRIBUTING.md.

Community

License

Apache 2.0 — see LICENSE.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

headroom_ai-0.21.34.tar.gz (1.6 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

headroom_ai-0.21.34-cp313-cp313-manylinux_2_28_x86_64.whl (18.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.34-cp313-cp313-manylinux_2_28_aarch64.whl (19.7 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.34-cp313-cp313-macosx_11_0_arm64.whl (17.4 MB view details)

Uploaded CPython 3.13macOS 11.0+ ARM64

headroom_ai-0.21.34-cp312-cp312-manylinux_2_28_x86_64.whl (18.6 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.34-cp312-cp312-manylinux_2_28_aarch64.whl (19.7 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.34-cp312-cp312-macosx_11_0_arm64.whl (17.4 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

headroom_ai-0.21.34-cp311-cp311-manylinux_2_28_x86_64.whl (18.6 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.34-cp311-cp311-manylinux_2_28_aarch64.whl (19.7 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.34-cp311-cp311-macosx_11_0_arm64.whl (17.4 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

headroom_ai-0.21.34-cp310-cp310-manylinux_2_28_x86_64.whl (18.6 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.34-cp310-cp310-manylinux_2_28_aarch64.whl (19.7 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.34-cp310-cp310-macosx_11_0_arm64.whl (17.4 MB view details)

Uploaded CPython 3.10macOS 11.0+ ARM64

File details

Details for the file headroom_ai-0.21.34.tar.gz.

File metadata

  • Download URL: headroom_ai-0.21.34.tar.gz
  • Upload date:
  • Size: 1.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for headroom_ai-0.21.34.tar.gz
Algorithm Hash digest
SHA256 d316c017d6c0f6c46f44f90ae7f8e48860666193860e9b9ad4478cf59a45eaaa
MD5 7748c1353318968878afb90d0af46354
BLAKE2b-256 388485daa8bf0c44e7b8e8871e17f6104b3c62b3d5c5f2d7b5b96ab4142857c6

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.34.tar.gz:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.34-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.34-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 4501bf239e915e152f65a9382c5614773b797ce5e2ec7b036739a37c0e620cd7
MD5 83b0098444999df560d49500416e920d
BLAKE2b-256 2fec8be48f0a35d1215c60bc9a62ca10cf1772ad4d276765c1b570a08ae4e375

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.34-cp313-cp313-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.34-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.34-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 878f98d39c9e74a4118009b9de3ed178570fcbd2d008d92e89347709628699d2
MD5 bf9283b8ceb178f46695982e5719e2d1
BLAKE2b-256 7ac0bf22e984db3044a044847ce16b61d15b54bb6e387b309328e372a3e96ef1

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.34-cp313-cp313-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.34-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.34-cp313-cp313-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 0c2eed8fb394eaec0cbe4605d8ee6b119d12c7a745a52a32c4047c95c9c60b02
MD5 5ce42a231e87da071ae7cd799f873a2b
BLAKE2b-256 d6798bc98de1a862459653c9ec5d27e1418106852b4bc2e54086673f88d3de43

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.34-cp313-cp313-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.34-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.34-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 135b645b900cab15e2cece2d44504d4cf9ffa9b783d190ad5c8b3f3bd4e1e069
MD5 9f80c353d8ad1e03ddee4aa4e3cdc9ce
BLAKE2b-256 bef07e3f6b033a54675c39d373f8eefb2a07bf41a848a118fc1dff17155cd0c4

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.34-cp312-cp312-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.34-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.34-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 1281d7efcbb7d8743f5db06e952d1a200db09717ab244c9a5dd6e676a90f4b32
MD5 254934abf866df2de3a5e3868a6a3230
BLAKE2b-256 ca0f1279adaf2f19bb54a57f41d5f952b486b44746c7d6948777b8a24eec30e4

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.34-cp312-cp312-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.34-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.34-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 6f3c54b1699e6532e4b0fce2c4b5063ad65bd198373e46b42fd59fd4da2d2176
MD5 b400bc6c592374b9fe038f00b8945f05
BLAKE2b-256 69274a281f6dbb6cb495956d620b9ada0ed87aedde030b1c48661f3a71217703

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.34-cp312-cp312-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.34-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.34-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 fdf1b82d2cb7a972c393a8b51cbe116502816e36d38ae0206ce6516e96d70607
MD5 041156f88adefd0664a165362e750018
BLAKE2b-256 30b08fcc254a8e6040c30f4f941f3e6475858f4071c1b04c9f05418a898f0185

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.34-cp311-cp311-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.34-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.34-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 a0eb3b9393e4d2a5388188cbc76afdded0954afbdbe1b641b11fc82edc57bbf6
MD5 6c6293e4a2549e1e2d39605a020e8210
BLAKE2b-256 6dbaf18eff671b775fb296ce5cd02b9f1bf330bfca6ddf0469967845f2288c70

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.34-cp311-cp311-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.34-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.34-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 0e81a54a51cce41d15ca31bc606efb7574e52f9292e372eea36a8ac4a944002b
MD5 889158007065e76e1b4b181ce249f1a6
BLAKE2b-256 bb28894fb0711e90c26288f7efed5614e0978089d09ab1a29453df7fd6db88b8

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.34-cp311-cp311-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.34-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.34-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0472377c878bc0255b9b233fc17fedba88efc612418c6d482d253dccff08a214
MD5 f9281614822bcb39a8bffac46f09bad8
BLAKE2b-256 e4d3cc98e579ada22439db5a55d59817a3f2f100a63bbf744a2b8706ede4e598

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.34-cp310-cp310-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.34-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.34-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 10e24af04833579aceda749a785481c855880e0d3d8c64d542bd5da27ceb61dd
MD5 923930ab7b957a17c095928ddf3f0fc7
BLAKE2b-256 8be217c164e1167e6f571e7cac0ed00188c63493c16d930cb8b923837e858845

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.34-cp310-cp310-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.34-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.34-cp310-cp310-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 77e6e3a7c390ce18e3fc65beb0ed19666d25f55814c9600de4fee492fb0c5493
MD5 668bd9b6cd530af2a8f6f979041930a5
BLAKE2b-256 45918f250567590f511b2540f813f636e3ccb532349b99abcd298cc325d8eb9f

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.34-cp310-cp310-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page