Skip to main content

The Context Optimization Layer for LLM Applications - Cut costs by 50-90%

Project description

  ██╗  ██╗███████╗ █████╗ ██████╗ ██████╗  ██████╗  ██████╗ ███╗   ███╗
  ██║  ██║██╔════╝██╔══██╗██╔══██╗██╔══██╗██╔═══██╗██╔═══██╗████╗ ████║
  ███████║█████╗  ███████║██║  ██║██████╔╝██║   ██║██║   ██║██╔████╔██║
  ██╔══██║██╔══╝  ██╔══██║██║  ██║██╔══██╗██║   ██║██║   ██║██║╚██╔╝██║
  ██║  ██║███████╗██║  ██║██████╔╝██║  ██║╚██████╔╝╚██████╔╝██║ ╚═╝ ██║
  ╚═╝  ╚═╝╚══════╝╚═╝  ╚═╝╚═════╝ ╚═╝  ╚═╝ ╚═════╝  ╚═════╝ ╚═╝     ╚═╝
                  The context compression layer for AI agents

60–95% fewer tokens · library · proxy · MCP · 6 algorithms · local-first · reversible

CI codecov PyPI npm Model: Kompress-base Tokens saved: 60B+ License: Apache 2.0 Docs

Docs · Install · Proof · Agents · Discord · llms.txt

AI agents / LLMs: read /llms.txt here, or fetch the live index / full docs blob.


Headroom compresses everything your AI agent reads — tool outputs, logs, RAG chunks, files, and conversation history — before it reaches the LLM. Same answers, fraction of the tokens.

Headroom in action
Live: 10,144 → 1,260 tokens — same FATAL found.

What it does

  • Librarycompress(messages) in Python or TypeScript, inline in any app
  • Proxyheadroom proxy --port 8787, zero code changes, any language
  • Agent wrapheadroom wrap claude|codex|cursor|aider|copilot in one command
  • MCP serverheadroom_compress, headroom_retrieve, headroom_stats for any MCP client
  • Cross-agent memory — shared store across Claude, Codex, Gemini, auto-dedup
  • headroom learn — mines failed sessions, writes corrections to CLAUDE.md / AGENTS.md
  • Reversible (CCR) — originals never deleted; LLM retrieves on demand

How it works (30 seconds)

 Your agent / app
   (Claude Code, Cursor, Codex, LangChain, Agno, Strands, your own code…)
        │   prompts · tool outputs · logs · RAG results · files
        ▼
    ┌────────────────────────────────────────────────────┐
    │  Headroom   (runs locally — your data stays here)  │
    │  ───────────────────────────────────────────────   │
    │  CacheAligner  →  ContentRouter  →  CCR             │
    │                    ├─ SmartCrusher   (JSON)         │
    │                    ├─ CodeCompressor (AST)          │
    │                    └─ Kompress-base  (text, HF)     │
    │                                                     │
    │  Cross-agent memory  ·  headroom learn  ·  MCP      │
    └────────────────────────────────────────────────────┘
        │   compressed prompt  +  retrieval tool
        ▼
 LLM provider  (Anthropic · OpenAI · Bedrock · …)
  • ContentRouter — detects content type, selects the right compressor
  • SmartCrusher / CodeCompressor / Kompress-base — compress JSON, AST, or prose
  • CacheAligner — stabilizes prefixes so provider KV caches actually hit
  • CCR — stores originals locally; LLM calls headroom_retrieve if it needs them

Architecture · CCR reversible compression · Kompress-base model card

Get started (60 seconds)

# 1 — Install
pip install "headroom-ai[all]"          # Python
npm install headroom-ai                 # Node / TypeScript

# 2 — Pick your mode
headroom wrap claude                    # wrap a coding agent
headroom proxy --port 8787              # drop-in proxy, zero code changes
# or: from headroom import compress      # inline library

# 3 — See the savings
headroom stats

Granular extras: [proxy], [mcp], [ml], [agno], [langchain], [evals]. Requires Python 3.10+.

Proof

Savings on real agent workloads:

Workload Before After Savings
Code search (100 results) 17,765 1,408 92%
SRE incident debugging 65,694 5,118 92%
GitHub issue triage 54,174 14,761 73%
Codebase exploration 78,502 41,254 47%

Accuracy preserved on standard benchmarks:

Benchmark Category N Baseline Headroom Delta
GSM8K Math 100 0.870 0.870 ±0.000
TruthfulQA Factual 100 0.530 0.560 +0.030
SQuAD v2 QA 100 97% 19% compression
BFCL Tools 100 97% 32% compression

Reproduce: python -m headroom.evals suite --tier 1 · Full benchmarks & methodology

60B+ tokens saved — community leaderboard
60B+ tokens saved by the community — live leaderboard →

Agent compatibility matrix

Agent headroom wrap Notes
Claude Code --memory · --code-graph
Codex shares memory with Claude
Cursor prints config — paste once
Aider starts proxy + launches
Copilot CLI starts proxy + launches
OpenClaw installs as ContextEngine plugin

Any OpenAI-compatible client works via headroom proxy. MCP-native: headroom mcp install.

When to use · When to skip

Great fit if you…

  • run AI coding agents daily and want savings without changing your code
  • work across multiple agents and want shared memory
  • need reversible compression — originals always retrievable via CCR

Skip it if you…

  • only use a single provider's native compaction and don't need cross-agent memory
  • work in a sandboxed environment where local processes can't run
Integrations — drop Headroom into any stack
Your setup Hook in with
Any Python app compress(messages, model=…)
Any TypeScript app await compress(messages, { model })
Anthropic / OpenAI SDK withHeadroom(new Anthropic()) · withHeadroom(new OpenAI())
Vercel AI SDK wrapLanguageModel({ model, middleware: headroomMiddleware() })
LiteLLM litellm.callbacks = [HeadroomCallback()]
LangChain HeadroomChatModel(your_llm)
Agno HeadroomAgnoModel(your_model)
Strands Strands guide
ASGI apps app.add_middleware(CompressionMiddleware)
Multi-agent SharedContext().put / .get
MCP clients headroom mcp install
What's inside
  • SmartCrusher — universal JSON: arrays of dicts, nested objects, mixed types.
  • CodeCompressor — AST-aware for Python, JS, Go, Rust, Java, C++.
  • Kompress-base — our HuggingFace model, trained on agentic traces.
  • Image compression — 40–90% reduction via trained ML router.
  • CacheAligner — stabilizes prefixes so Anthropic/OpenAI KV caches actually hit.
  • IntelligentContext — score-based context fitting with learned importance.
  • CCR — reversible compression; LLM retrieves originals on demand.
  • Cross-agent memory — shared store, agent provenance, auto-dedup.
  • SharedContext — compressed context passing across multi-agent workflows.
  • headroom learn — plugin-based failure mining for Claude, Codex, Gemini.
Pipeline internals

Headroom exposes one stable request lifecycle across compress(), the SDK, and the proxy:

SetupPre-StartPost-StartInput ReceivedInput CachedInput RoutedInput CompressedInput RememberedPre-SendPost-SendResponse Received

  • Transforms do the work: CacheAligner, ContentRouter, SmartCrusher, CodeCompressor, Kompress-base, IntelligentContext / RollingWindow.
  • Pipeline extensions observe or customize lifecycle stages via on_pipeline_event(...).
  • Compression hooks sit alongside the canonical lifecycle as an additional extension seam.
  • Proxy extensions remain the server/app integration seam for ASGI middleware, routes, and startup policy.

Provider and tool-specific behavior lives under headroom/providers/ so core orchestration stays focused on lifecycle, sequencing, and policy.

  • CLI/tool slices: headroom/providers/claude, copilot, codex, openclaw
  • Provider runtime slices: headroom/providers/claude, gemini, plus shared backend/runtime dispatch in headroom/providers/registry.py
  • Core files stay orchestration-first: wrap.py, client.py, cli/proxy.py, and proxy/server.py delegate provider-specific env shaping, API target normalization, backend selection, and transport dispatch.

Install

pip install "headroom-ai[all]"          # Python, everything
npm install headroom-ai                 # TypeScript / Node
docker pull ghcr.io/chopratejas/headroom:latest

Granular extras: [proxy], [mcp], [ml] (Kompress-base), [agno], [langchain], [evals]. Requires Python 3.10+.

Using pipx? Choose a supported interpreter explicitly:

pipx install --python python3.13 "headroom-ai[all]"

Installation guide — Docker tags, persistent service, PowerShell, devcontainers.

headroom learn

headroom learn in action

headroom learn — mines failed sessions, writes corrections to CLAUDE.md / AGENTS.md / GEMINI.md.

Documentation

Start here Go deeper
Quickstart Architecture
Proxy How compression works
MCP tools CCR — reversible compression
Memory Cache optimization
Failure learning Benchmarks
Configuration Limitations

Compared to

Headroom runs locally, covers every content type, works with every major framework, and is reversible.

Scope Deploy Local Reversible
Headroom All context — tools, RAG, logs, files, history Proxy · library · middleware · MCP Yes Yes
RTK CLI command outputs CLI wrapper Yes No
lean-ctx CLI commands, MCP tools, editor rules CLI wrapper · MCP Yes No
Compresr, Token Co. Text sent to their API Hosted API call No No
OpenAI Compaction Conversation history Provider-native No No

Attribution. Headroom ships with the excellent RTK binary for shell-output rewriting — git show --short, scoped ls, summarized installers. Huge thanks to the RTK team; their tool is a first-class part of our stack, and Headroom compresses everything downstream of it. Headroom can also use lean-ctx as the selected CLI context tool; set HEADROOM_CONTEXT_TOOL=lean-ctx before running headroom wrap ....

Contributing

git clone https://github.com/chopratejas/headroom.git && cd headroom
pip install -e ".[dev]" && pytest

Devcontainers in .devcontainer/ (default + memory-stack with Qdrant & Neo4j). See CONTRIBUTING.md.

Community

License

Apache 2.0 — see LICENSE.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

headroom_ai-0.21.37.tar.gz (1.6 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

headroom_ai-0.21.37-cp313-cp313-manylinux_2_28_x86_64.whl (18.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.37-cp313-cp313-manylinux_2_28_aarch64.whl (19.7 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.37-cp313-cp313-macosx_11_0_arm64.whl (17.4 MB view details)

Uploaded CPython 3.13macOS 11.0+ ARM64

headroom_ai-0.21.37-cp312-cp312-manylinux_2_28_x86_64.whl (18.6 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.37-cp312-cp312-manylinux_2_28_aarch64.whl (19.7 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.37-cp312-cp312-macosx_11_0_arm64.whl (17.4 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

headroom_ai-0.21.37-cp311-cp311-manylinux_2_28_x86_64.whl (18.6 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.37-cp311-cp311-manylinux_2_28_aarch64.whl (19.7 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.37-cp311-cp311-macosx_11_0_arm64.whl (17.4 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

headroom_ai-0.21.37-cp310-cp310-manylinux_2_28_x86_64.whl (18.6 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.37-cp310-cp310-manylinux_2_28_aarch64.whl (19.7 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.37-cp310-cp310-macosx_11_0_arm64.whl (17.4 MB view details)

Uploaded CPython 3.10macOS 11.0+ ARM64

File details

Details for the file headroom_ai-0.21.37.tar.gz.

File metadata

  • Download URL: headroom_ai-0.21.37.tar.gz
  • Upload date:
  • Size: 1.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for headroom_ai-0.21.37.tar.gz
Algorithm Hash digest
SHA256 03a46f62ad42413ac6652900a3dfd3debd2e6710e7fe03d6b60e7530c6875b9a
MD5 acafcb9c36503080c74ea791da8d2309
BLAKE2b-256 f67022a2196644f72231373d4357d77310ea9b16aee2ca3232ba6e2348929b6d

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.37.tar.gz:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.37-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.37-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 259bfbecacd637f6e582535d80d744c75e3da52e1c9932480561d0b65133f7b2
MD5 60dbc59f6d5371de83feb75ab44bbe2a
BLAKE2b-256 89dd3594cc4ef70177c9d63d61875502636a9fbde704318359ed1618842a10fb

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.37-cp313-cp313-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.37-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.37-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 2b98e87365ff7a367453c7b4878d49773f90fcb95131c166bb27a2d553ccc1d9
MD5 818272cd4c8e9db3a4358d18009ee995
BLAKE2b-256 e2cf4c04a9f6fd30215ca17ed81ee1b597d6fbc1653543005191a7cb7f0557d9

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.37-cp313-cp313-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.37-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.37-cp313-cp313-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 d3ccaa3ccf56c3fcb3127136538ca74fd7f1d6e9b213b42725316ede8699df3a
MD5 cbe3438d8de282b86eccf5cb97214988
BLAKE2b-256 afeb1aa801d1e9d5a96f41e2465e8402c5f37333456e6162b2d883947eae6bd7

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.37-cp313-cp313-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.37-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.37-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0d3e177f2bbc63f48eece42497186230112d15d2d12594427c460d8a1ab50c95
MD5 3ec59d25987cee23ef3c8e0c8f13353c
BLAKE2b-256 2e1db47f4c27df910337e79ce0943b7fbb7d27468b11d931d525730e876832fc

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.37-cp312-cp312-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.37-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.37-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 3781d203406bb707d81071b9faf53895dc78d6fece1c1501f825f1c2227b8b4b
MD5 8d8883caf163a6dda6c429d4a80446b6
BLAKE2b-256 43f57a9baebf10cb0a70a2579d0e35ae6f55c00187e7fb78490e037577a39920

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.37-cp312-cp312-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.37-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.37-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 cade59c9ad31f24757de16317413489dfa2844629fdfa34f1b4f34c70e502269
MD5 2cd705497ade3a47639bcbb5381ac563
BLAKE2b-256 d0995a5a4b05e640d67eae86f2d9c9886a36952d5d7d582af1baefda012f529c

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.37-cp312-cp312-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.37-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.37-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 a85daf87d4b0b0c76263a74c9cdcc1eee029340c83077fc1ae399ae8403a353f
MD5 6ebb070a1e6747f1cfd73e1f9a011159
BLAKE2b-256 246a50f88c1c3a1329f55f6cebe064641f9f44a95d70281f474f6c925c044813

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.37-cp311-cp311-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.37-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.37-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 c8b5cf1db2231f5a6e9f701403f3a6a849f5c90932697ac7c25c138e3e2ded2c
MD5 d4f8172921db8b522df2c1e4551382f0
BLAKE2b-256 7b3d40ca71b58a701db183a1b83011995bd4e28ec6a8a1054dde50684a27a368

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.37-cp311-cp311-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.37-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.37-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 58fa9e03dc795203e9e1715021b881f7b2197a20f7d2843c7c6b1f2e2aeb75db
MD5 6c9adbadff2c8911fdb9ef2f19640008
BLAKE2b-256 9208d595dab9840702c50371ec4b138087f254e1a27fda731d035901c9a54b26

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.37-cp311-cp311-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.37-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.37-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 0f92524bf4f4f921fa8b228c0daa4c7a4ef692d5cf292f0e2d4f3b3371936305
MD5 3222e78b446aeabac31c8f3f51262838
BLAKE2b-256 a87cf6d10e441ea517b0867b28ad1a60de957e82adb4610426673ebab80110ec

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.37-cp310-cp310-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.37-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.37-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 7ae6bbca7bfdb29a12d39bf975b58aa6343d1e7a84380cda94fb67cbf27e4bd1
MD5 6b152b4535b8094e132f2fcd7d8eee56
BLAKE2b-256 176cba0bde9274e8190f30e21c88d35928ee01376da009deb455c39635a0786d

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.37-cp310-cp310-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.37-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.37-cp310-cp310-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 3766c58e754fd8e9c5bbfbced902c930fbb6c0e2a61025499dcfb4d8d590eafc
MD5 2604f28fa45d19d842bb7c3510a6cc1f
BLAKE2b-256 ed08d04b7701ec8f961ca0d069e0cdb18def8acedc334f8b4933e99a2c4bd62a

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.37-cp310-cp310-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page