Skip to main content

The Context Optimization Layer for LLM Applications - Cut costs by 50-90%

Project description

Headroom

Compress everything your AI agent reads. Same answers, fraction of the tokens.

CI codecov PyPI npm Model: Kompress-base Tokens saved: 60B+ License: Apache 2.0 Docs

Headroom in action

Every tool call, log line, DB read, RAG chunk, and file your agent injects into a prompt is mostly boilerplate. Headroom strips the noise and keeps the signal — losslessly, locally, and without touching accuracy.

100 logs. One FATAL error buried at position 67. Both runs found it. Baseline 10,144 tokens → Headroom 1,260 tokens87% fewer, identical answer. python examples/needle_in_haystack_test.py


Quick start

Works with Anthropic, OpenAI, Google, Bedrock, Vertex, Azure, OpenRouter, and 100+ models via LiteLLM.

Wrap your coding agent — one command:

pip install "headroom-ai[all]"

headroom wrap claude      # Claude Code
headroom wrap codex       # Codex
headroom wrap cursor      # Cursor
headroom wrap aider       # Aider
headroom wrap copilot     # GitHub Copilot CLI

Using pipx? Current release wheels are built for Python 3.10 through 3.13, so choose a supported interpreter explicitly:

pipx install --python python3.13 "headroom-ai[all]"

Drop it into your own code — Python or TypeScript:

from headroom import compress

result = compress(messages, model="claude-sonnet-4-5")
response = client.messages.create(model="claude-sonnet-4-5", messages=result.messages)
print(f"Saved {result.tokens_saved} tokens ({result.compression_ratio:.0%})")
import { compress } from 'headroom-ai';
const result = await compress(messages, { model: 'gpt-4o' });

Or run it as a proxy — zero code changes, any language:

headroom proxy --port 8787
ANTHROPIC_BASE_URL=http://localhost:8787 your-app
OPENAI_BASE_URL=http://localhost:8787/v1 your-app

Why Headroom

  • Accuracy-preserving. GSM8K 0.870 → 0.870 (±0.000). TruthfulQA +0.030. SQuAD v2 and BFCL both 97% accuracy after compression. Validated on public OSS benchmarks you can rerun yourself.
  • Runs on your machine. No cloud API, no data egress. Compression latency is milliseconds — faster end-to-end for Sonnet / Opus / GPT-4 class models than a hosted service round-trip.
  • Kompress-base on HuggingFace. Our open-source text compressor, fine-tuned on real agentic traces — tool outputs, logs, RAG chunks, code. Install with pip install "headroom-ai[ml]".
  • Cross-agent memory and learning. Claude Code saves a fact, Codex reads it back. headroom learn mines failed sessions and writes corrections straight to CLAUDE.md / AGENTS.md / GEMINI.md — reliability compounds over time.
  • Reversible (CCR). Compression is not deletion. The model can always call headroom_retrieve to pull the original bytes. Nothing is thrown away.

Bundles the RTK binary for shell-output rewriting — full attribution below.


How it fits

 Your agent / app
   (Claude Code, Cursor, Codex, LangChain, Agno, Strands, your own code…)
        │   prompts · tool outputs · logs · RAG results · files
        ▼
    ┌────────────────────────────────────────────────────┐
    │  Headroom   (runs locally — your data stays here)  │
    │  ───────────────────────────────────────────────   │
    │  CacheAligner  →  ContentRouter  →  CCR             │
    │                    ├─ SmartCrusher   (JSON)         │
    │                    ├─ CodeCompressor (AST)          │
    │                    └─ Kompress-base  (text, HF)     │
    │                                                     │
    │  Cross-agent memory  ·  headroom learn  ·  MCP      │
    └────────────────────────────────────────────────────┘
        │   compressed prompt  +  retrieval tool
        ▼
 LLM provider  (Anthropic · OpenAI · Bedrock · …)

Architecture · CCR reversible compression · Kompress-base model card

Canonical pipeline lifecycle

Headroom now exposes one stable request lifecycle across compress(), the SDK, and the proxy:

SetupPre-StartPost-StartInput ReceivedInput CachedInput RoutedInput CompressedInput RememberedPre-SendPost-SendResponse Received

  • Transforms still do the work: CacheAligner, ContentRouter, SmartCrusher, CodeCompressor, Kompress-base, IntelligentContext / RollingWindow.
  • Pipeline extensions observe or customize those lifecycle stages via on_pipeline_event(...).
  • Compression hooks still work and now sit alongside the canonical lifecycle instead of being the only extension seam.
  • Proxy extensions remain the server/app integration seam for ASGI middleware, routes, and startup policy.

Provider slices

Provider and tool-specific behavior is being moved behind dedicated modules under headroom/providers/ so core orchestration stays focused on lifecycle, sequencing, and policy.

  • CLI/tool slices: headroom/providers/claude, copilot, codex, openclaw
  • Provider runtime slices: headroom/providers/claude, gemini, plus shared backend/runtime dispatch in headroom/providers/registry.py
  • Core files stay orchestration-first: wrap.py, client.py, cli/proxy.py, and proxy/server.py now delegate provider-specific env shaping, API target normalization, backend selection, and transport dispatch instead of inlining those rules.

Proof

Savings on real agent workloads:

Workload Before After Savings
Code search (100 results) 17,765 1,408 92%
SRE incident debugging 65,694 5,118 92%
GitHub issue triage 54,174 14,761 73%
Codebase exploration 78,502 41,254 47%

Accuracy preserved on standard benchmarks:

Benchmark Category N Baseline Headroom Delta
GSM8K Math 100 0.870 0.870 ±0.000
TruthfulQA Factual 100 0.530 0.560 +0.030
SQuAD v2 QA 100 97% 19% compression
BFCL Tools 100 97% 32% compression

Reproduce:

python -m headroom.evals suite --tier 1

Community, live:

Full benchmarks & methodology


Built for coding agents

Agent One-command wrap Notes
Claude Code headroom wrap claude --memory for cross-agent memory, --code-graph for codebase intel
Codex headroom wrap codex --memory Shares the same memory store as Claude
Cursor headroom wrap cursor Prints Cursor config — paste once, done
Aider headroom wrap aider Starts proxy, launches Aider
Copilot CLI headroom wrap copilot Starts proxy, launches Copilot
OpenClaw headroom wrap openclaw Installs Headroom as ContextEngine plugin

MCP-native too — headroom mcp install exposes headroom_compress, headroom_retrieve, and headroom_stats to any MCP client.

headroom learn in action

Integrations

Drop Headroom into any stack
Your setup Hook in with
Any Python app compress(messages, model=…)
Any TypeScript app await compress(messages, { model })
Anthropic / OpenAI SDK withHeadroom(new Anthropic()) · withHeadroom(new OpenAI())
Vercel AI SDK wrapLanguageModel({ model, middleware: headroomMiddleware() })
LiteLLM litellm.callbacks = [HeadroomCallback()]
LangChain HeadroomChatModel(your_llm)
Agno HeadroomAgnoModel(your_model)
Strands Strands guide
ASGI apps app.add_middleware(CompressionMiddleware)
Multi-agent SharedContext().put / .get
MCP clients headroom mcp install
What's inside
  • SmartCrusher — universal JSON: arrays of dicts, nested objects, mixed types.
  • CodeCompressor — AST-aware for Python, JS, Go, Rust, Java, C++.
  • Kompress-base — our HuggingFace model, trained on agentic traces.
  • Image compression — 40–90% reduction via trained ML router.
  • CacheAligner — stabilizes prefixes so Anthropic/OpenAI KV caches actually hit.
  • IntelligentContext — score-based context fitting with learned importance.
  • CCR — reversible compression; LLM retrieves originals on demand.
  • Cross-agent memory — shared store, agent provenance, auto-dedup.
  • SharedContext — compressed context passing across multi-agent workflows.
  • headroom learn — plugin-based failure mining for Claude, Codex, Gemini.

Install

pip install "headroom-ai[all]"          # Python, everything
npm  install headroom-ai                # TypeScript / Node
docker pull ghcr.io/chopratejas/headroom:latest

Granular extras: [proxy], [mcp], [ml] (Kompress-base), [agno], [langchain], [evals]. Requires Python 3.10+.

Installation guide — Docker tags, persistent service, PowerShell, devcontainers.


Documentation

Start here Go deeper
Quickstart Architecture
Proxy How compression works
MCP tools CCR — reversible compression
Memory Cache optimization
Failure learning Benchmarks
Configuration Limitations

Compared to

Headroom runs locally, covers every content type (not just CLI or text), works with every major framework, and is reversible.

Scope Deploy Local Reversible
Headroom All context — tools, RAG, logs, files, history Proxy · library · middleware · MCP Yes Yes
RTK CLI command outputs CLI wrapper Yes No
Compresr, Token Co. Text sent to their API Hosted API call No No
OpenAI Compaction Conversation history Provider-native No No

Attribution. Headroom ships with the excellent RTK binary for shell-output rewriting — git showgit show --short, noisy ls → scoped, chatty installers → summarized. Huge thanks to the RTK team; their tool is a first-class part of our stack, and Headroom compresses everything downstream of it.


Contributing

git clone https://github.com/chopratejas/headroom.git && cd headroom
pip install -e ".[dev]" && pytest

Devcontainers in .devcontainer/ (default + memory-stack with Qdrant & Neo4j). See CONTRIBUTING.md.


Community

License

Apache 2.0 — see LICENSE.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

headroom_ai-0.21.9.tar.gz (1.5 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

headroom_ai-0.21.9-cp313-cp313-manylinux_2_28_x86_64.whl (18.5 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.9-cp313-cp313-manylinux_2_28_aarch64.whl (19.6 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.9-cp313-cp313-macosx_11_0_arm64.whl (17.3 MB view details)

Uploaded CPython 3.13macOS 11.0+ ARM64

headroom_ai-0.21.9-cp312-cp312-manylinux_2_28_x86_64.whl (18.5 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.9-cp312-cp312-manylinux_2_28_aarch64.whl (19.6 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.9-cp312-cp312-macosx_11_0_arm64.whl (17.3 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

headroom_ai-0.21.9-cp311-cp311-manylinux_2_28_x86_64.whl (18.5 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.9-cp311-cp311-manylinux_2_28_aarch64.whl (19.6 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.9-cp311-cp311-macosx_11_0_arm64.whl (17.3 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

headroom_ai-0.21.9-cp310-cp310-manylinux_2_28_x86_64.whl (18.5 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.9-cp310-cp310-manylinux_2_28_aarch64.whl (19.6 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.9-cp310-cp310-macosx_11_0_arm64.whl (17.3 MB view details)

Uploaded CPython 3.10macOS 11.0+ ARM64

File details

Details for the file headroom_ai-0.21.9.tar.gz.

File metadata

  • Download URL: headroom_ai-0.21.9.tar.gz
  • Upload date:
  • Size: 1.5 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for headroom_ai-0.21.9.tar.gz
Algorithm Hash digest
SHA256 42df2ed4580e87ca87015a33ccba01feb7d9550d705094520d713c29517a8c20
MD5 1c6d92d1dd1da6f0ec8a34bb9a2ddee8
BLAKE2b-256 ca84cd70baf30ef4a86f2a5b8c9578f80eb3a18761c75baa45280756610766b9

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.9.tar.gz:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.9-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.9-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 ce88a68839270c219e4fb1e53905c1e131ee828472b38a8c77e9605430c3a657
MD5 c637af674d4a4457133242c32dfb59c8
BLAKE2b-256 9f1c4ae442fe18ad91860a5e46b1c85f86f4af8c81ac677aec7c82fc5f3fe5ba

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.9-cp313-cp313-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.9-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.9-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 8777c7cbbd138b2fc433ba7ed2501c777f4e9c98c613a707ee36246e0bbab5e5
MD5 f089bf3c07ea754e568b3641ab44114e
BLAKE2b-256 4bd7e329d490679dab10a5af434dde3c99cb50c72900c418177d1db6c6009ce1

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.9-cp313-cp313-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.9-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.9-cp313-cp313-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 1fac55d543f2008c99d6b402cf6ce5dfc3af525110f227759c2873dc99c7c142
MD5 ca2d953954b5e385ae81d10b9b9a00c1
BLAKE2b-256 6c13981e1032e9720aa8dd13b535b527ef6c2c357b70e77caeb38e9d4de66bfa

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.9-cp313-cp313-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.9-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.9-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 77bd5401a3ff1c7639b3369fe6e51bd5008d2e8e59f77dcc1a6b6483477a25c0
MD5 07c543b2597454115c2dba2a48a08a5a
BLAKE2b-256 40f24e7c52f20eaf05903ab1a3e44adcf4bfaadb8c7d483250fbe698b0ba0fae

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.9-cp312-cp312-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.9-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.9-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 a38956ec6df848177f9259a292ad3fc76bbd300043d122b70532c3d968140034
MD5 ed085986d09185b53fc43c6e9ba02216
BLAKE2b-256 95b83bdee8c59c845d244e0a22feb43831b1a6898115a9e5d7cb00846314e891

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.9-cp312-cp312-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.9-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.9-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 3c9420b3e5794eb1c059f54d386735afbf88ad2e0f40d4048ac7926164768077
MD5 6d9149e3f76a0e9519a35adffc0ca749
BLAKE2b-256 01deaf63792fb18606d1228b0607e26838bf05a5bd27df87e6dd875adfbdefec

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.9-cp312-cp312-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.9-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.9-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 6e5125e9b7ec6196f9466dc93a2807a82f22faa373c50e8d9577caa33e279e06
MD5 ee19f5c1e8a3a7e0f0c6a037564c51d0
BLAKE2b-256 e4d7e367d0aeeb1c6f92777bdab63743d5cad8a03dcabfb7a2d1f825fc0e6125

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.9-cp311-cp311-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.9-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.9-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 4bb4a4426422c2c4cc79bad2d1385ff9a9e63dbd87c9625f9678bd21853e5297
MD5 ce8ffc9eca7832957579be15f68dbc45
BLAKE2b-256 5901ea58d58822b548f7035be686fb27afd2e9b2e98f85608c05e8f46d50d705

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.9-cp311-cp311-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.9-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.9-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 799cc60a200e48d749ee709ab17bd420590b84923bf3f755c487d6f66d9c3692
MD5 f52aa37440a5aa672534ba99a9b634db
BLAKE2b-256 762e37b14f30f8a4be4eb973b7f9a10abee1c75f1e0d1a29345dc74a221aec73

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.9-cp311-cp311-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.9-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.9-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 6971a716c579f53c3052a8117b637571b3336bffa96df345ef579545de7cfc8b
MD5 66e551e58e73fdbdcf714eda9520cfcb
BLAKE2b-256 2a0fab991e54610ac53fba6885e291128e8f91a4651b98cdf01fa9ae599cf4fe

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.9-cp310-cp310-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.9-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.9-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 cb980410862b791564fd2e440230e18a7f132ba5d6e00bc8c935fdb789b48c12
MD5 8cbbec55bc99d2c1c647b09be22b38f4
BLAKE2b-256 7440bf62c0787a2789e887fd1d1f8cf19deaf0b95cd9cad957085141d3fa383f

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.9-cp310-cp310-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.9-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.9-cp310-cp310-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 e49da969095415f60aa451cc5888d75585c66416dd46e10ffe1d56f957b0f61e
MD5 5d1bb04f6ebdd1d3f2bd46617785b880
BLAKE2b-256 21b88bd93559f20fc487abb02773018fc1b0d2f96b7f5203c4ec60fd6be75cd9

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.9-cp310-cp310-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page