Skip to main content

The Context Optimization Layer for LLM Applications - Cut costs by 50-90%

Project description

Headroom

Compress everything your AI agent reads. Same answers, fraction of the tokens.

CI codecov PyPI npm Model: Kompress-base Tokens saved: 60B+ License: Apache 2.0 Docs

Headroom in action

Every tool call, log line, DB read, RAG chunk, and file your agent injects into a prompt is mostly boilerplate. Headroom strips the noise and keeps the signal — losslessly, locally, and without touching accuracy.

100 logs. One FATAL error buried at position 67. Both runs found it. Baseline 10,144 tokens → Headroom 1,260 tokens87% fewer, identical answer. python examples/needle_in_haystack_test.py


Quick start

Works with Anthropic, OpenAI, Google, Bedrock, Vertex, Azure, OpenRouter, and 100+ models via LiteLLM.

Wrap your coding agent — one command:

pip install "headroom-ai[all]"

headroom wrap claude      # Claude Code
headroom wrap codex       # Codex
headroom wrap cursor      # Cursor
headroom wrap aider       # Aider
headroom wrap copilot     # GitHub Copilot CLI

Using pipx? Current release wheels are built for Python 3.10 through 3.13, so choose a supported interpreter explicitly:

pipx install --python python3.13 "headroom-ai[all]"

Drop it into your own code — Python or TypeScript:

from headroom import compress

result = compress(messages, model="claude-sonnet-4-5")
response = client.messages.create(model="claude-sonnet-4-5", messages=result.messages)
print(f"Saved {result.tokens_saved} tokens ({result.compression_ratio:.0%})")
import { compress } from 'headroom-ai';
const result = await compress(messages, { model: 'gpt-4o' });

Or run it as a proxy — zero code changes, any language:

headroom proxy --port 8787
ANTHROPIC_BASE_URL=http://localhost:8787 your-app
OPENAI_BASE_URL=http://localhost:8787/v1 your-app

Why Headroom

  • Accuracy-preserving. GSM8K 0.870 → 0.870 (±0.000). TruthfulQA +0.030. SQuAD v2 and BFCL both 97% accuracy after compression. Validated on public OSS benchmarks you can rerun yourself.
  • Runs on your machine. No cloud API, no data egress. Compression latency is milliseconds — faster end-to-end for Sonnet / Opus / GPT-4 class models than a hosted service round-trip.
  • Kompress-base on HuggingFace. Our open-source text compressor, fine-tuned on real agentic traces — tool outputs, logs, RAG chunks, code. Install with pip install "headroom-ai[ml]".
  • Cross-agent memory and learning. Claude Code saves a fact, Codex reads it back. headroom learn mines failed sessions and writes corrections straight to CLAUDE.md / AGENTS.md / GEMINI.md — reliability compounds over time.
  • Reversible (CCR). Compression is not deletion. The model can always call headroom_retrieve to pull the original bytes. Nothing is thrown away.

Bundles the RTK binary for shell-output rewriting — full attribution below.


How it fits

 Your agent / app
   (Claude Code, Cursor, Codex, LangChain, Agno, Strands, your own code…)
        │   prompts · tool outputs · logs · RAG results · files
        ▼
    ┌────────────────────────────────────────────────────┐
    │  Headroom   (runs locally — your data stays here)  │
    │  ───────────────────────────────────────────────   │
    │  CacheAligner  →  ContentRouter  →  CCR             │
    │                    ├─ SmartCrusher   (JSON)         │
    │                    ├─ CodeCompressor (AST)          │
    │                    └─ Kompress-base  (text, HF)     │
    │                                                     │
    │  Cross-agent memory  ·  headroom learn  ·  MCP      │
    └────────────────────────────────────────────────────┘
        │   compressed prompt  +  retrieval tool
        ▼
 LLM provider  (Anthropic · OpenAI · Bedrock · …)

Architecture · CCR reversible compression · Kompress-base model card

Canonical pipeline lifecycle

Headroom now exposes one stable request lifecycle across compress(), the SDK, and the proxy:

SetupPre-StartPost-StartInput ReceivedInput CachedInput RoutedInput CompressedInput RememberedPre-SendPost-SendResponse Received

  • Transforms still do the work: CacheAligner, ContentRouter, SmartCrusher, CodeCompressor, Kompress-base, IntelligentContext / RollingWindow.
  • Pipeline extensions observe or customize those lifecycle stages via on_pipeline_event(...).
  • Compression hooks still work and now sit alongside the canonical lifecycle instead of being the only extension seam.
  • Proxy extensions remain the server/app integration seam for ASGI middleware, routes, and startup policy.

Provider slices

Provider and tool-specific behavior is being moved behind dedicated modules under headroom/providers/ so core orchestration stays focused on lifecycle, sequencing, and policy.

  • CLI/tool slices: headroom/providers/claude, copilot, codex, openclaw
  • Provider runtime slices: headroom/providers/claude, gemini, plus shared backend/runtime dispatch in headroom/providers/registry.py
  • Core files stay orchestration-first: wrap.py, client.py, cli/proxy.py, and proxy/server.py now delegate provider-specific env shaping, API target normalization, backend selection, and transport dispatch instead of inlining those rules.

Proof

Savings on real agent workloads:

Workload Before After Savings
Code search (100 results) 17,765 1,408 92%
SRE incident debugging 65,694 5,118 92%
GitHub issue triage 54,174 14,761 73%
Codebase exploration 78,502 41,254 47%

Accuracy preserved on standard benchmarks:

Benchmark Category N Baseline Headroom Delta
GSM8K Math 100 0.870 0.870 ±0.000
TruthfulQA Factual 100 0.530 0.560 +0.030
SQuAD v2 QA 100 97% 19% compression
BFCL Tools 100 97% 32% compression

Reproduce:

python -m headroom.evals suite --tier 1

Community, live:

Full benchmarks & methodology


Built for coding agents

Agent One-command wrap Notes
Claude Code headroom wrap claude --memory for cross-agent memory, --code-graph for codebase intel
Codex headroom wrap codex --memory Shares the same memory store as Claude
Cursor headroom wrap cursor Prints Cursor config — paste once, done
Aider headroom wrap aider Starts proxy, launches Aider
Copilot CLI headroom wrap copilot Starts proxy, launches Copilot
OpenClaw headroom wrap openclaw Installs Headroom as ContextEngine plugin

MCP-native too — headroom mcp install exposes headroom_compress, headroom_retrieve, and headroom_stats to any MCP client.

headroom learn in action

Integrations

Drop Headroom into any stack
Your setup Hook in with
Any Python app compress(messages, model=…)
Any TypeScript app await compress(messages, { model })
Anthropic / OpenAI SDK withHeadroom(new Anthropic()) · withHeadroom(new OpenAI())
Vercel AI SDK wrapLanguageModel({ model, middleware: headroomMiddleware() })
LiteLLM litellm.callbacks = [HeadroomCallback()]
LangChain HeadroomChatModel(your_llm)
Agno HeadroomAgnoModel(your_model)
Strands Strands guide
ASGI apps app.add_middleware(CompressionMiddleware)
Multi-agent SharedContext().put / .get
MCP clients headroom mcp install
What's inside
  • SmartCrusher — universal JSON: arrays of dicts, nested objects, mixed types.
  • CodeCompressor — AST-aware for Python, JS, Go, Rust, Java, C++.
  • Kompress-base — our HuggingFace model, trained on agentic traces.
  • Image compression — 40–90% reduction via trained ML router.
  • CacheAligner — stabilizes prefixes so Anthropic/OpenAI KV caches actually hit.
  • IntelligentContext — score-based context fitting with learned importance.
  • CCR — reversible compression; LLM retrieves originals on demand.
  • Cross-agent memory — shared store, agent provenance, auto-dedup.
  • SharedContext — compressed context passing across multi-agent workflows.
  • headroom learn — plugin-based failure mining for Claude, Codex, Gemini.

Install

pip install "headroom-ai[all]"          # Python, everything
npm  install headroom-ai                # TypeScript / Node
docker pull ghcr.io/chopratejas/headroom:latest

Granular extras: [proxy], [mcp], [ml] (Kompress-base), [agno], [langchain], [evals]. Requires Python 3.10+.

Installation guide — Docker tags, persistent service, PowerShell, devcontainers.


Documentation

Start here Go deeper
Quickstart Architecture
Proxy How compression works
MCP tools CCR — reversible compression
Memory Cache optimization
Failure learning Benchmarks
Configuration Limitations

Compared to

Headroom runs locally, covers every content type (not just CLI or text), works with every major framework, and is reversible.

Scope Deploy Local Reversible
Headroom All context — tools, RAG, logs, files, history Proxy · library · middleware · MCP Yes Yes
RTK CLI command outputs CLI wrapper Yes No
Compresr, Token Co. Text sent to their API Hosted API call No No
OpenAI Compaction Conversation history Provider-native No No

Attribution. Headroom ships with the excellent RTK binary for shell-output rewriting — git showgit show --short, noisy ls → scoped, chatty installers → summarized. Huge thanks to the RTK team; their tool is a first-class part of our stack, and Headroom compresses everything downstream of it.


Contributing

git clone https://github.com/chopratejas/headroom.git && cd headroom
pip install -e ".[dev]" && pytest

Devcontainers in .devcontainer/ (default + memory-stack with Qdrant & Neo4j). See CONTRIBUTING.md.


Community

License

Apache 2.0 — see LICENSE.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

headroom_ai-0.21.12.tar.gz (1.5 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

headroom_ai-0.21.12-cp313-cp313-manylinux_2_28_x86_64.whl (18.5 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.12-cp313-cp313-manylinux_2_28_aarch64.whl (19.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.12-cp313-cp313-macosx_11_0_arm64.whl (17.3 MB view details)

Uploaded CPython 3.13macOS 11.0+ ARM64

headroom_ai-0.21.12-cp312-cp312-manylinux_2_28_x86_64.whl (18.5 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.12-cp312-cp312-manylinux_2_28_aarch64.whl (19.6 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.12-cp312-cp312-macosx_11_0_arm64.whl (17.3 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

headroom_ai-0.21.12-cp311-cp311-manylinux_2_28_x86_64.whl (18.5 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.12-cp311-cp311-manylinux_2_28_aarch64.whl (19.6 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.12-cp311-cp311-macosx_11_0_arm64.whl (17.3 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

headroom_ai-0.21.12-cp310-cp310-manylinux_2_28_x86_64.whl (18.5 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.12-cp310-cp310-manylinux_2_28_aarch64.whl (19.6 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.12-cp310-cp310-macosx_11_0_arm64.whl (17.3 MB view details)

Uploaded CPython 3.10macOS 11.0+ ARM64

File details

Details for the file headroom_ai-0.21.12.tar.gz.

File metadata

  • Download URL: headroom_ai-0.21.12.tar.gz
  • Upload date:
  • Size: 1.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for headroom_ai-0.21.12.tar.gz
Algorithm Hash digest
SHA256 545ca62cae91076756b0720e336793254d4fdd4c7a3f559a8bb5dbbd3136af93
MD5 fcbc7062b8eec3648278bdd283393ba0
BLAKE2b-256 9828f3c36e4709a57fedf4991933792593900f19f10a6e77ff9bcaa4371b9f18

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.12.tar.gz:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.12-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.12-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 19f95cb8a5a5a28889ae484cdc493905be7fc000112f8e93addec2797b8551dd
MD5 03d0df37b332b19c2d6cf9ddb38b8864
BLAKE2b-256 77346594bab063766c535e704d510a20cc2a24ae8727e8c9b884273768bc9288

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.12-cp313-cp313-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.12-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.12-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 0db252571c75baa353c1990b7e966d56121f0063f121bb6c5b634be20ab9dcb7
MD5 01de21077418d37446dceb632e4591ce
BLAKE2b-256 d5b2600dcda20c07b33c3715b5077ec930571efb8c98c5b949ba0e68a108ef83

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.12-cp313-cp313-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.12-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.12-cp313-cp313-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 7dc3cfc48d522c43ddfe5f0ffdb794b5c50d721d6e97e18a80aa9977977895d5
MD5 06e79efdc88868f555b8ed68ba47d978
BLAKE2b-256 e0f181f114c42b79625b5c9614f698275791372fa51db76925d4aed666396b2e

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.12-cp313-cp313-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.12-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.12-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 4bd12fc272d39e929338a4f836fc4143d479b4b012f76228d079f276dcd1de18
MD5 9f0bfe273420860528b9364c4e0c12b8
BLAKE2b-256 9265451f0757c7697e5d3fe6dd8f81e660ca95a1dec97ac31477e57ee0d22056

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.12-cp312-cp312-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.12-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.12-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 f0077fe0cbbb73e8425baa58183c834b29cdc2d16dc3ccb49cb910998c8287cf
MD5 f233da93a6ac8c5b0a18c834c50edbdb
BLAKE2b-256 0c73de35f3c2549177df774a34aa5e503e050265687d8319e5265298b63eba44

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.12-cp312-cp312-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.12-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.12-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 1cc35d2ff120d0ba42417e6e3d47917059a6780c2047f993374a4b95e9a24acd
MD5 8920049d929eb3af3a4adbc9aecc088d
BLAKE2b-256 4cb9ddb9196cea89fabd2245f751afe7ed4ae1aa775b448c0f55dbe004a30e14

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.12-cp312-cp312-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.12-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.12-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 1fe28daa6391e86ea85127a5b6afa46d80ae217e9e32f5714ce07a0daf6ce26e
MD5 dde96e88dc30cdc59bd113d339eb90d2
BLAKE2b-256 f941d576c2ece353cef83742a3aa961b345988cce017f9b8c907b1873494369e

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.12-cp311-cp311-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.12-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.12-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 3bd937f5cca8eb6be4edcbe288c7bf80456c3b76d0475c468d1a165c358344e3
MD5 22f0020521176d9226ed9f238c57babd
BLAKE2b-256 a8ef0efeaf7b98afaa05eb576a59a59c1c57caf5549ef7599e168492a4ef6931

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.12-cp311-cp311-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.12-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.12-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 b141ae5882eac5ae6af9392565df4059f4230b180002baa6bb325d5c8aeca912
MD5 92b7c805a478b4486502d5a60b8e5c5d
BLAKE2b-256 de667b113ec38e8f369012b184e7996c4d3f007eb3cab726c650e051d5a0caa5

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.12-cp311-cp311-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.12-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.12-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 f1a6f80d53aee64bbb6579f5bed6d142251740a31324966063319b93a8bc7f4a
MD5 b7322cb075b975a7e6bd17f6aa54fda1
BLAKE2b-256 540216873a67e32bb6937e59e36d20550585751d742d65a72d6cf243c4cabf06

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.12-cp310-cp310-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.12-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.12-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 70d13addb8898992c6547d35914c5d829971e28e4ada330233675803872d9f38
MD5 85acfabe5eb2c67a04d80b43209fab13
BLAKE2b-256 4b391b80a3fbafb39a038c0759d24d54fcf3c823d2b6d37d5a99d14fb44e3b89

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.12-cp310-cp310-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.12-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.12-cp310-cp310-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 85aa7293b01c74aad132d29f84435c20561ea92d60bd18c43a9fa69fb4548284
MD5 e7cdb5a73e631dcec7d0536f2793b9e4
BLAKE2b-256 302b748270c69766630004df0f87b911fa338e6971d6937bfdeee469e188ca79

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.12-cp310-cp310-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page