Skip to main content

The Context Optimization Layer for LLM Applications - Cut costs by 50-90%

Project description

Headroom

Compress everything your AI agent reads. Same answers, fraction of the tokens.

CI codecov PyPI npm Model: Kompress-base Tokens saved: 60B+ License: Apache 2.0 Docs

Headroom in action

Every tool call, log line, DB read, RAG chunk, and file your agent injects into a prompt is mostly boilerplate. Headroom strips the noise and keeps the signal — losslessly, locally, and without touching accuracy.

100 logs. One FATAL error buried at position 67. Both runs found it. Baseline 10,144 tokens → Headroom 1,260 tokens87% fewer, identical answer. python examples/needle_in_haystack_test.py


Quick start

Works with Anthropic, OpenAI, Google, Bedrock, Vertex, Azure, OpenRouter, and 100+ models via LiteLLM.

Wrap your coding agent — one command:

pip install "headroom-ai[all]"

headroom wrap claude      # Claude Code
headroom wrap codex       # Codex
headroom wrap cursor      # Cursor
headroom wrap aider       # Aider
headroom wrap copilot     # GitHub Copilot CLI

Using pipx? Current release wheels are built for Python 3.10 through 3.13, so choose a supported interpreter explicitly:

pipx install --python python3.13 "headroom-ai[all]"

Drop it into your own code — Python or TypeScript:

from headroom import compress

result = compress(messages, model="claude-sonnet-4-5")
response = client.messages.create(model="claude-sonnet-4-5", messages=result.messages)
print(f"Saved {result.tokens_saved} tokens ({result.compression_ratio:.0%})")
import { compress } from 'headroom-ai';
const result = await compress(messages, { model: 'gpt-4o' });

Or run it as a proxy — zero code changes, any language:

headroom proxy --port 8787
ANTHROPIC_BASE_URL=http://localhost:8787 your-app
OPENAI_BASE_URL=http://localhost:8787/v1 your-app

Why Headroom

  • Accuracy-preserving. GSM8K 0.870 → 0.870 (±0.000). TruthfulQA +0.030. SQuAD v2 and BFCL both 97% accuracy after compression. Validated on public OSS benchmarks you can rerun yourself.
  • Runs on your machine. No cloud API, no data egress. Compression latency is milliseconds — faster end-to-end for Sonnet / Opus / GPT-4 class models than a hosted service round-trip.
  • Kompress-base on HuggingFace. Our open-source text compressor, fine-tuned on real agentic traces — tool outputs, logs, RAG chunks, code. Install with pip install "headroom-ai[ml]".
  • Cross-agent memory and learning. Claude Code saves a fact, Codex reads it back. headroom learn mines failed sessions and writes corrections straight to CLAUDE.md / AGENTS.md / GEMINI.md — reliability compounds over time.
  • Reversible (CCR). Compression is not deletion. The model can always call headroom_retrieve to pull the original bytes. Nothing is thrown away.

Bundles the RTK binary for shell-output rewriting — full attribution below.


How it fits

 Your agent / app
   (Claude Code, Cursor, Codex, LangChain, Agno, Strands, your own code…)
        │   prompts · tool outputs · logs · RAG results · files
        ▼
    ┌────────────────────────────────────────────────────┐
    │  Headroom   (runs locally — your data stays here)  │
    │  ───────────────────────────────────────────────   │
    │  CacheAligner  →  ContentRouter  →  CCR             │
    │                    ├─ SmartCrusher   (JSON)         │
    │                    ├─ CodeCompressor (AST)          │
    │                    └─ Kompress-base  (text, HF)     │
    │                                                     │
    │  Cross-agent memory  ·  headroom learn  ·  MCP      │
    └────────────────────────────────────────────────────┘
        │   compressed prompt  +  retrieval tool
        ▼
 LLM provider  (Anthropic · OpenAI · Bedrock · …)

Architecture · CCR reversible compression · Kompress-base model card

Canonical pipeline lifecycle

Headroom now exposes one stable request lifecycle across compress(), the SDK, and the proxy:

SetupPre-StartPost-StartInput ReceivedInput CachedInput RoutedInput CompressedInput RememberedPre-SendPost-SendResponse Received

  • Transforms still do the work: CacheAligner, ContentRouter, SmartCrusher, CodeCompressor, Kompress-base, IntelligentContext / RollingWindow.
  • Pipeline extensions observe or customize those lifecycle stages via on_pipeline_event(...).
  • Compression hooks still work and now sit alongside the canonical lifecycle instead of being the only extension seam.
  • Proxy extensions remain the server/app integration seam for ASGI middleware, routes, and startup policy.

Provider slices

Provider and tool-specific behavior is being moved behind dedicated modules under headroom/providers/ so core orchestration stays focused on lifecycle, sequencing, and policy.

  • CLI/tool slices: headroom/providers/claude, copilot, codex, openclaw
  • Provider runtime slices: headroom/providers/claude, gemini, plus shared backend/runtime dispatch in headroom/providers/registry.py
  • Core files stay orchestration-first: wrap.py, client.py, cli/proxy.py, and proxy/server.py now delegate provider-specific env shaping, API target normalization, backend selection, and transport dispatch instead of inlining those rules.

Proof

Savings on real agent workloads:

Workload Before After Savings
Code search (100 results) 17,765 1,408 92%
SRE incident debugging 65,694 5,118 92%
GitHub issue triage 54,174 14,761 73%
Codebase exploration 78,502 41,254 47%

Accuracy preserved on standard benchmarks:

Benchmark Category N Baseline Headroom Delta
GSM8K Math 100 0.870 0.870 ±0.000
TruthfulQA Factual 100 0.530 0.560 +0.030
SQuAD v2 QA 100 97% 19% compression
BFCL Tools 100 97% 32% compression

Reproduce:

python -m headroom.evals suite --tier 1

Community, live:

Full benchmarks & methodology


Built for coding agents

Agent One-command wrap Notes
Claude Code headroom wrap claude --memory for cross-agent memory, --code-graph for codebase intel
Codex headroom wrap codex --memory Shares the same memory store as Claude
Cursor headroom wrap cursor Prints Cursor config — paste once, done
Aider headroom wrap aider Starts proxy, launches Aider
Copilot CLI headroom wrap copilot Starts proxy, launches Copilot
OpenClaw headroom wrap openclaw Installs Headroom as ContextEngine plugin

MCP-native too — headroom mcp install exposes headroom_compress, headroom_retrieve, and headroom_stats to any MCP client.

headroom learn in action

Integrations

Drop Headroom into any stack
Your setup Hook in with
Any Python app compress(messages, model=…)
Any TypeScript app await compress(messages, { model })
Anthropic / OpenAI SDK withHeadroom(new Anthropic()) · withHeadroom(new OpenAI())
Vercel AI SDK wrapLanguageModel({ model, middleware: headroomMiddleware() })
LiteLLM litellm.callbacks = [HeadroomCallback()]
LangChain HeadroomChatModel(your_llm)
Agno HeadroomAgnoModel(your_model)
Strands Strands guide
ASGI apps app.add_middleware(CompressionMiddleware)
Multi-agent SharedContext().put / .get
MCP clients headroom mcp install
What's inside
  • SmartCrusher — universal JSON: arrays of dicts, nested objects, mixed types.
  • CodeCompressor — AST-aware for Python, JS, Go, Rust, Java, C++.
  • Kompress-base — our HuggingFace model, trained on agentic traces.
  • Image compression — 40–90% reduction via trained ML router.
  • CacheAligner — stabilizes prefixes so Anthropic/OpenAI KV caches actually hit.
  • IntelligentContext — score-based context fitting with learned importance.
  • CCR — reversible compression; LLM retrieves originals on demand.
  • Cross-agent memory — shared store, agent provenance, auto-dedup.
  • SharedContext — compressed context passing across multi-agent workflows.
  • headroom learn — plugin-based failure mining for Claude, Codex, Gemini.

Install

pip install "headroom-ai[all]"          # Python, everything
npm  install headroom-ai                # TypeScript / Node
docker pull ghcr.io/chopratejas/headroom:latest

Granular extras: [proxy], [mcp], [ml] (Kompress-base), [agno], [langchain], [evals]. Requires Python 3.10+.

Installation guide — Docker tags, persistent service, PowerShell, devcontainers.


Documentation

Start here Go deeper
Quickstart Architecture
Proxy How compression works
MCP tools CCR — reversible compression
Memory Cache optimization
Failure learning Benchmarks
Configuration Limitations

Compared to

Headroom runs locally, covers every content type (not just CLI or text), works with every major framework, and is reversible.

Scope Deploy Local Reversible
Headroom All context — tools, RAG, logs, files, history Proxy · library · middleware · MCP Yes Yes
RTK CLI command outputs CLI wrapper Yes No
Compresr, Token Co. Text sent to their API Hosted API call No No
OpenAI Compaction Conversation history Provider-native No No

Attribution. Headroom ships with the excellent RTK binary for shell-output rewriting — git showgit show --short, noisy ls → scoped, chatty installers → summarized. Huge thanks to the RTK team; their tool is a first-class part of our stack, and Headroom compresses everything downstream of it.


Contributing

git clone https://github.com/chopratejas/headroom.git && cd headroom
pip install -e ".[dev]" && pytest

Devcontainers in .devcontainer/ (default + memory-stack with Qdrant & Neo4j). See CONTRIBUTING.md.


Community

License

Apache 2.0 — see LICENSE.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

headroom_ai-0.21.15.tar.gz (1.5 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

headroom_ai-0.21.15-cp313-cp313-manylinux_2_28_x86_64.whl (18.5 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.15-cp313-cp313-manylinux_2_28_aarch64.whl (19.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.15-cp313-cp313-macosx_11_0_arm64.whl (17.3 MB view details)

Uploaded CPython 3.13macOS 11.0+ ARM64

headroom_ai-0.21.15-cp312-cp312-manylinux_2_28_x86_64.whl (18.5 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.15-cp312-cp312-manylinux_2_28_aarch64.whl (19.6 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.15-cp312-cp312-macosx_11_0_arm64.whl (17.3 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

headroom_ai-0.21.15-cp311-cp311-manylinux_2_28_x86_64.whl (18.5 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.15-cp311-cp311-manylinux_2_28_aarch64.whl (19.6 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.15-cp311-cp311-macosx_11_0_arm64.whl (17.3 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

headroom_ai-0.21.15-cp310-cp310-manylinux_2_28_x86_64.whl (18.5 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.15-cp310-cp310-manylinux_2_28_aarch64.whl (19.6 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.15-cp310-cp310-macosx_11_0_arm64.whl (17.3 MB view details)

Uploaded CPython 3.10macOS 11.0+ ARM64

File details

Details for the file headroom_ai-0.21.15.tar.gz.

File metadata

  • Download URL: headroom_ai-0.21.15.tar.gz
  • Upload date:
  • Size: 1.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for headroom_ai-0.21.15.tar.gz
Algorithm Hash digest
SHA256 7bc670be58d5d13e1b5065f505661033c64dfea359423be8e39776c82eb3cf25
MD5 d5cdd4dbf11b5f211465bc1bea4c7d68
BLAKE2b-256 678ae12795f88ed546fc2dcab331b78dbcc1cbb3061fb5667ed69e17ea5dc906

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.15.tar.gz:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.15-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.15-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 c9fdd04b6f1a1f0b7106599a080d1e73eed756b08a69355b0465c5a21fe24d6b
MD5 a3371e21031555abea9e3d6283fb46aa
BLAKE2b-256 8b3e0fbdb56a2f0007d3747fddf723c5be2f9e45434f4632ce03ee92954303e8

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.15-cp313-cp313-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.15-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.15-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 824c57b80ebf6bae3122dabe086b1257343bfd690679291257c449527eb8713b
MD5 f6a2e6eec774129f3b07d78d6ba6319d
BLAKE2b-256 bf95854661e0ba2787593264a8b903933e4b31285869bbcdb03aafba692c6010

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.15-cp313-cp313-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.15-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.15-cp313-cp313-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 f0b804e832f72558ec07157c783d6d38023c1d2a46f04c4473ee8a1108514b7b
MD5 1a6064a1bf4ca267799e9fce5a8d0a61
BLAKE2b-256 31f20a8837c9dca4578818720d80d3953e983ad234ff24e457a4015a507b6786

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.15-cp313-cp313-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.15-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.15-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 4eeb64a7cfc967076ebcfcd6d63670d57824cbc5240a084468f0fee2d71378e1
MD5 4eefd8b3cca6c1ccd2b765f9b3b6e59e
BLAKE2b-256 f1e1319b26842a7d54749e0ca64cad437fea36ac1b559915457929d5d3b56673

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.15-cp312-cp312-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.15-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.15-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 4b7cce51b7978790c23d240ca21144102310b5164d526fa9cec96ec40dbea7a9
MD5 aad24fe909b997ba7c486ae1cf1a8a2e
BLAKE2b-256 742f08907d7e374edf4e5c057460fb04aab0cc7deef9f57cd50d4398903845ae

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.15-cp312-cp312-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.15-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.15-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 f01d30f0813120b7a176de3a81a494940808cbaafff22bf8078e813179dcb12d
MD5 a5586dcb7f13aaec59ab33c0386914ee
BLAKE2b-256 126a5eeb211d2f08dc4bc82ee08d23bc4cafb4cc46246c70a877af86fc7f9c69

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.15-cp312-cp312-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.15-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.15-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 819fdd84cf4936a9ee9e2c7a9d6c9fd4d380a7b504a5a3a07fccf2be950a888d
MD5 4b1bf9823f827f7d638282b46dd0377d
BLAKE2b-256 3eee84182bbdf46ef3689262d510f50a2da5f837f51e216b8397ce8d89d4446c

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.15-cp311-cp311-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.15-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.15-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 373fb296909abed96acb01035d1500bed0d91e5278138094ef49ffffba82e811
MD5 ec86fb81ea92ca15f4a1147b06cd9be4
BLAKE2b-256 09eda4a570247da6c6c7569281a0d605aca6a269169e27f59fe32f00b1c67ef9

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.15-cp311-cp311-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.15-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.15-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 126d59c0f79996240699270c4affa36054155768d981928c976c37c3ad08d2d2
MD5 73040a26488c67bae2b866b4c45bfc8c
BLAKE2b-256 12a23784b00a6829bbcb7affd187cf685fcef5ddad2eb945df377cb06a80e3be

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.15-cp311-cp311-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.15-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.15-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 8d488916e0a29fc90c570f51127b1f3ced0290184745730a8050e3a33a2277e5
MD5 2dc059ccc1a5cefeaebbdb4837ecf88b
BLAKE2b-256 b56dbd3198aecbda945bba93e7e03983de0a119bda0e528a8b830724363e9edf

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.15-cp310-cp310-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.15-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.15-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 2752b57ace62fd5806488ead649ec4c4cd36f1760d66341bfac4b751cb4a2638
MD5 cfa5bb43abf341e164ad37536937e3b9
BLAKE2b-256 051fbb170eee8b03dc5f8017fc995f58039fad186cead99f840da7be59cc3eb1

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.15-cp310-cp310-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.15-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.15-cp310-cp310-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 6cb99184d7decbeec30238c3ea63bb261254fe80ee63d0d4c39d1984a132e458
MD5 14b462ebd93de601255a0b30f0855f7c
BLAKE2b-256 020525f4f7ea0a8cf7174f48ead764d8b60a997e68f306a26df9ae12f2053f70

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.15-cp310-cp310-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page