Skip to main content

The Context Optimization Layer for LLM Applications - Cut costs by 50-90%

Project description

  ██╗  ██╗███████╗ █████╗ ██████╗ ██████╗  ██████╗  ██████╗ ███╗   ███╗
  ██║  ██║██╔════╝██╔══██╗██╔══██╗██╔══██╗██╔═══██╗██╔═══██╗████╗ ████║
  ███████║█████╗  ███████║██║  ██║██████╔╝██║   ██║██║   ██║██╔████╔██║
  ██╔══██║██╔══╝  ██╔══██║██║  ██║██╔══██╗██║   ██║██║   ██║██║╚██╔╝██║
  ██║  ██║███████╗██║  ██║██████╔╝██║  ██║╚██████╔╝╚██████╔╝██║ ╚═╝ ██║
  ╚═╝  ╚═╝╚══════╝╚═╝  ╚═╝╚═════╝ ╚═╝  ╚═╝ ╚═════╝  ╚═════╝ ╚═╝     ╚═╝
                  The context compression layer for AI agents

60–95% fewer tokens · library · proxy · MCP · 6 algorithms · local-first · reversible

CI codecov PyPI npm Model: Kompress-base Tokens saved: 60B+ License: Apache 2.0 Docs

Docs · Install · Proof · Agents · Discord · llms.txt

AI agents / LLMs: read /llms.txt here, or fetch the live index / full docs blob.


Headroom compresses everything your AI agent reads — tool outputs, logs, RAG chunks, files, and conversation history — before it reaches the LLM. Same answers, fraction of the tokens.

Headroom in action
Live: 10,144 → 1,260 tokens — same FATAL found.

What it does

  • Librarycompress(messages) in Python or TypeScript, inline in any app
  • Proxyheadroom proxy --port 8787, zero code changes, any language
  • Agent wrapheadroom wrap claude|codex|cursor|aider|copilot in one command
  • MCP serverheadroom_compress, headroom_retrieve, headroom_stats for any MCP client
  • Cross-agent memory — shared store across Claude, Codex, Gemini, auto-dedup
  • headroom learn — mines failed sessions, writes corrections to CLAUDE.md / AGENTS.md
  • Reversible (CCR) — originals never deleted; LLM retrieves on demand

How it works (30 seconds)

 Your agent / app
   (Claude Code, Cursor, Codex, LangChain, Agno, Strands, your own code…)
        │   prompts · tool outputs · logs · RAG results · files
        ▼
    ┌────────────────────────────────────────────────────┐
    │  Headroom   (runs locally — your data stays here)  │
    │  ───────────────────────────────────────────────   │
    │  CacheAligner  →  ContentRouter  →  CCR             │
    │                    ├─ SmartCrusher   (JSON)         │
    │                    ├─ CodeCompressor (AST)          │
    │                    └─ Kompress-base  (text, HF)     │
    │                                                     │
    │  Cross-agent memory  ·  headroom learn  ·  MCP      │
    └────────────────────────────────────────────────────┘
        │   compressed prompt  +  retrieval tool
        ▼
 LLM provider  (Anthropic · OpenAI · Bedrock · …)
  • ContentRouter — detects content type, selects the right compressor
  • SmartCrusher / CodeCompressor / Kompress-base — compress JSON, AST, or prose
  • CacheAligner — stabilizes prefixes so provider KV caches actually hit
  • CCR — stores originals locally; LLM calls headroom_retrieve if it needs them

Architecture · CCR reversible compression · Kompress-base model card

Get started (60 seconds)

# 1 — Install
pip install "headroom-ai[all]"          # Python
npm install headroom-ai                 # Node / TypeScript

# 2 — Pick your mode
headroom wrap claude                    # wrap a coding agent
headroom proxy --port 8787              # drop-in proxy, zero code changes
# or: from headroom import compress      # inline library

# 3 — See the savings
headroom stats

Granular extras: [proxy], [mcp], [ml], [agno], [langchain], [evals]. Requires Python 3.10+.

Proof

Savings on real agent workloads:

Workload Before After Savings
Code search (100 results) 17,765 1,408 92%
SRE incident debugging 65,694 5,118 92%
GitHub issue triage 54,174 14,761 73%
Codebase exploration 78,502 41,254 47%

Accuracy preserved on standard benchmarks:

Benchmark Category N Baseline Headroom Delta
GSM8K Math 100 0.870 0.870 ±0.000
TruthfulQA Factual 100 0.530 0.560 +0.030
SQuAD v2 QA 100 97% 19% compression
BFCL Tools 100 97% 32% compression

Reproduce: python -m headroom.evals suite --tier 1 · Full benchmarks & methodology

60B+ tokens saved — community leaderboard
60B+ tokens saved by the community — live leaderboard →

Agent compatibility matrix

Agent headroom wrap Notes
Claude Code --memory · --code-graph
Codex shares memory with Claude
Cursor prints config — paste once
Aider starts proxy + launches
Copilot CLI starts proxy + launches
OpenClaw installs as ContextEngine plugin

Any OpenAI-compatible client works via headroom proxy. MCP-native: headroom mcp install.

When to use · When to skip

Great fit if you…

  • run AI coding agents daily and want savings without changing your code
  • work across multiple agents and want shared memory
  • need reversible compression — originals always retrievable via CCR

Skip it if you…

  • only use a single provider's native compaction and don't need cross-agent memory
  • work in a sandboxed environment where local processes can't run
Integrations — drop Headroom into any stack
Your setup Hook in with
Any Python app compress(messages, model=…)
Any TypeScript app await compress(messages, { model })
Anthropic / OpenAI SDK withHeadroom(new Anthropic()) · withHeadroom(new OpenAI())
Vercel AI SDK wrapLanguageModel({ model, middleware: headroomMiddleware() })
LiteLLM litellm.callbacks = [HeadroomCallback()]
LangChain HeadroomChatModel(your_llm)
Agno HeadroomAgnoModel(your_model)
Strands Strands guide
ASGI apps app.add_middleware(CompressionMiddleware)
Multi-agent SharedContext().put / .get
MCP clients headroom mcp install
What's inside
  • SmartCrusher — universal JSON: arrays of dicts, nested objects, mixed types.
  • CodeCompressor — AST-aware for Python, JS, Go, Rust, Java, C++.
  • Kompress-base — our HuggingFace model, trained on agentic traces.
  • Image compression — 40–90% reduction via trained ML router.
  • CacheAligner — stabilizes prefixes so Anthropic/OpenAI KV caches actually hit.
  • IntelligentContext — score-based context fitting with learned importance.
  • CCR — reversible compression; LLM retrieves originals on demand.
  • Cross-agent memory — shared store, agent provenance, auto-dedup.
  • SharedContext — compressed context passing across multi-agent workflows.
  • headroom learn — plugin-based failure mining for Claude, Codex, Gemini.
Pipeline internals

Headroom exposes one stable request lifecycle across compress(), the SDK, and the proxy:

SetupPre-StartPost-StartInput ReceivedInput CachedInput RoutedInput CompressedInput RememberedPre-SendPost-SendResponse Received

  • Transforms do the work: CacheAligner, ContentRouter, SmartCrusher, CodeCompressor, Kompress-base, IntelligentContext / RollingWindow.
  • Pipeline extensions observe or customize lifecycle stages via on_pipeline_event(...).
  • Compression hooks sit alongside the canonical lifecycle as an additional extension seam.
  • Proxy extensions remain the server/app integration seam for ASGI middleware, routes, and startup policy.

Provider and tool-specific behavior lives under headroom/providers/ so core orchestration stays focused on lifecycle, sequencing, and policy.

  • CLI/tool slices: headroom/providers/claude, copilot, codex, openclaw
  • Provider runtime slices: headroom/providers/claude, gemini, plus shared backend/runtime dispatch in headroom/providers/registry.py
  • Core files stay orchestration-first: wrap.py, client.py, cli/proxy.py, and proxy/server.py delegate provider-specific env shaping, API target normalization, backend selection, and transport dispatch.

Install

pip install "headroom-ai[all]"          # Python, everything
npm install headroom-ai                 # TypeScript / Node
docker pull ghcr.io/chopratejas/headroom:latest

Granular extras: [proxy], [mcp], [ml] (Kompress-base), [agno], [langchain], [evals]. Requires Python 3.10+.

Using pipx? Choose a supported interpreter explicitly:

pipx install --python python3.13 "headroom-ai[all]"

Installation guide — Docker tags, persistent service, PowerShell, devcontainers.

headroom learn

headroom learn in action

headroom learn — mines failed sessions, writes corrections to CLAUDE.md / AGENTS.md / GEMINI.md.

Documentation

Start here Go deeper
Quickstart Architecture
Proxy How compression works
MCP tools CCR — reversible compression
Memory Cache optimization
Failure learning Benchmarks
Configuration Limitations

Compared to

Headroom runs locally, covers every content type, works with every major framework, and is reversible.

Scope Deploy Local Reversible
Headroom All context — tools, RAG, logs, files, history Proxy · library · middleware · MCP Yes Yes
RTK CLI command outputs CLI wrapper Yes No
lean-ctx CLI commands, MCP tools, editor rules CLI wrapper · MCP Yes No
Compresr, Token Co. Text sent to their API Hosted API call No No
OpenAI Compaction Conversation history Provider-native No No

Attribution. Headroom ships with the excellent RTK binary for shell-output rewriting — git show --short, scoped ls, summarized installers. Huge thanks to the RTK team; their tool is a first-class part of our stack, and Headroom compresses everything downstream of it. Headroom can also use lean-ctx as the selected CLI context tool; set HEADROOM_CONTEXT_TOOL=lean-ctx before running headroom wrap ....

Contributing

git clone https://github.com/chopratejas/headroom.git && cd headroom
pip install -e ".[dev]" && pytest

Devcontainers in .devcontainer/ (default + memory-stack with Qdrant & Neo4j). See CONTRIBUTING.md.

Community

License

Apache 2.0 — see LICENSE.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

headroom_ai-0.21.38.tar.gz (1.6 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

headroom_ai-0.21.38-cp313-cp313-manylinux_2_28_x86_64.whl (17.2 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.38-cp313-cp313-manylinux_2_28_aarch64.whl (18.1 MB view details)

Uploaded CPython 3.13manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.38-cp313-cp313-macosx_11_0_arm64.whl (16.0 MB view details)

Uploaded CPython 3.13macOS 11.0+ ARM64

headroom_ai-0.21.38-cp312-cp312-manylinux_2_28_x86_64.whl (17.2 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.38-cp312-cp312-manylinux_2_28_aarch64.whl (18.1 MB view details)

Uploaded CPython 3.12manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.38-cp312-cp312-macosx_11_0_arm64.whl (16.0 MB view details)

Uploaded CPython 3.12macOS 11.0+ ARM64

headroom_ai-0.21.38-cp311-cp311-manylinux_2_28_x86_64.whl (17.2 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.38-cp311-cp311-manylinux_2_28_aarch64.whl (18.1 MB view details)

Uploaded CPython 3.11manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.38-cp311-cp311-macosx_11_0_arm64.whl (16.0 MB view details)

Uploaded CPython 3.11macOS 11.0+ ARM64

headroom_ai-0.21.38-cp310-cp310-manylinux_2_28_x86_64.whl (17.2 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ x86-64

headroom_ai-0.21.38-cp310-cp310-manylinux_2_28_aarch64.whl (18.1 MB view details)

Uploaded CPython 3.10manylinux: glibc 2.28+ ARM64

headroom_ai-0.21.38-cp310-cp310-macosx_11_0_arm64.whl (16.0 MB view details)

Uploaded CPython 3.10macOS 11.0+ ARM64

File details

Details for the file headroom_ai-0.21.38.tar.gz.

File metadata

  • Download URL: headroom_ai-0.21.38.tar.gz
  • Upload date:
  • Size: 1.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for headroom_ai-0.21.38.tar.gz
Algorithm Hash digest
SHA256 6494bff7ffa212616f176eb3ea80273879560d2e9646fb098955cb927448e46a
MD5 799470b71acc04c043e1b05824c85064
BLAKE2b-256 ff8ec6b1124cda8343e2edeb5fab5a849645bba551b45db2200311fd7361cb8a

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.38.tar.gz:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.38-cp313-cp313-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.38-cp313-cp313-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 162a42e8e5f51655712faadefee47cce177efd40c487c1a45d489335b3e9e78c
MD5 77197584f397764c6aaf017d57c2025c
BLAKE2b-256 92e95d3d909b4a98a88ea1a336747fbabb9e0a12cd75ba136f694b6c2dec1f24

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.38-cp313-cp313-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.38-cp313-cp313-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.38-cp313-cp313-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 b4fb7771c9693943f2eb987e25b37690fd3a7cc01b4c3c87b65af07b8887d61d
MD5 0cdd53996df1dd2a6d3cbaec2bccf927
BLAKE2b-256 9c2e403bbe72e1e13cf1e423403340c507093e221711755a9b8d159aeb2f3252

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.38-cp313-cp313-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.38-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.38-cp313-cp313-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 52d9f988c7702a47357a561281499249f64c6158cb27b6e4b581c429eec05f94
MD5 54785023668ad9ec223a9ccf463dd75a
BLAKE2b-256 9947ac49b9aed606d1ac38779cb572fd1892297b63a7844ed3a4910b453999d6

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.38-cp313-cp313-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.38-cp312-cp312-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.38-cp312-cp312-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 b313853aeefa3b5ee2abb4a8c96cdcca34aa2cca61b6ad4de60b459a8f77d728
MD5 e920931bf7cef389bbdb4caf1764b6a1
BLAKE2b-256 3634120619fa19a14ea0141a0dc58911684985a2e9483e636da84cac47ea1f1d

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.38-cp312-cp312-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.38-cp312-cp312-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.38-cp312-cp312-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 e3df48dad4c199ce6377f9e7c9e44fca6d023d625c0daf160739b00ae50f53c9
MD5 3005d5740e7276d9b9fc612a2969058a
BLAKE2b-256 7b52b43e12e34396e7992a5b97863200c6d84718c54e4beadab3b0bf9859319f

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.38-cp312-cp312-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.38-cp312-cp312-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.38-cp312-cp312-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 87d5a88b00ba962f4069f2cbba8aa5b71ebc2f531435db2884f8c62d334ab483
MD5 b8f6ab9aef4a76e74b99c8401c6fbc97
BLAKE2b-256 7358cbc0ba56c948e6c7f81f09768ef0404eeb1d92c5dcc8139752eb14906dc4

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.38-cp312-cp312-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.38-cp311-cp311-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.38-cp311-cp311-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 dc78828aa48b326237025434cb01cb7c51ccacff87e38af3fa53da2457082ac7
MD5 01e3fb2131f3a2bf2a1a1aeefd13e124
BLAKE2b-256 8004491a216dc7d7234bffa358f4e258c3103afa1de680cda33cfe2e727c70a1

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.38-cp311-cp311-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.38-cp311-cp311-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.38-cp311-cp311-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 ff9161e6d4e34f3c05defaedf3ccdc5cda22bfbf0dfa3e94766022c440ff0817
MD5 303714cdf3f47f0a36f853ab5c04dcba
BLAKE2b-256 65073a36a57119fada3c82a0b8ff0da6eb02d51b22fc3106c9adb7bf0567dbf3

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.38-cp311-cp311-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.38-cp311-cp311-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.38-cp311-cp311-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 78e021cea18e67d86d837da5884db157bfae2e81bebcaeef9ed1bc4506bf6783
MD5 11aaf10dcaad60aeb9d3b5f3b6296801
BLAKE2b-256 b2cf2e544dda0321c324d4dbf22cef3c6f780595cbda8a6ca01cea90dd898efe

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.38-cp311-cp311-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.38-cp310-cp310-manylinux_2_28_x86_64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.38-cp310-cp310-manylinux_2_28_x86_64.whl
Algorithm Hash digest
SHA256 8442e82723a0501cbd1232996d949799c3a1e23da2cc22efc668247acc0ce845
MD5 5af3623d75ffe3b2c778ac607de4bbf3
BLAKE2b-256 e07ef27e1008feb91d4f32cc8ec42907997c0cb3f9a2b96da5753112f938550d

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.38-cp310-cp310-manylinux_2_28_x86_64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.38-cp310-cp310-manylinux_2_28_aarch64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.38-cp310-cp310-manylinux_2_28_aarch64.whl
Algorithm Hash digest
SHA256 14e5c0a247f08850cab1b120df9ea2653cb1c667b2e3dc8509ab7ccfc00e8f50
MD5 1afd2a5aaab332ab5a7b6541e4602f44
BLAKE2b-256 861987168e21e550baf73a3bc73689e0939fc5eb3a7b7c7fc496c216365ff006

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.38-cp310-cp310-manylinux_2_28_aarch64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file headroom_ai-0.21.38-cp310-cp310-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for headroom_ai-0.21.38-cp310-cp310-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 b319e491e7fe6ca7813b34067c1f178e89e5286341b7e12d4ac6769e75965412
MD5 1e3f420b0598efc9c77019c32a766e2e
BLAKE2b-256 7cf0beaa3c9cdb70a67f0b7f7414c183e1283f3dcf76d5736bfb8839502de4d9

See more details on using hashes here.

Provenance

The following attestation bundles were made for headroom_ai-0.21.38-cp310-cp310-macosx_11_0_arm64.whl:

Publisher: release.yml on chopratejas/headroom

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page