Coordinate index layer for LLM context — Helix weighs, doesn't retrieve

These details have not been verified by PyPI

Project links

Project description

Helix Context

A context-index engine for LLM agents. Helix retrieves, weighs, and compresses your codebase into a context window — without a single LLM call on the retrieval path.

28.7× token savings on production workloads · GPQA diamond +4 pp accuracy with context on · 5.4× median across 15 query types

Benchmarks

⚙️ Hardware: Ryzen 7 5800x · 48 GB DDR4 · RTX 3080 Ti 12 GB VRAM · 2× 1 TB NVMe · open case, reactive fan curves · model: gemma4:e4b (Ollama) · genome: 18,547 genes

Query	Type	Helix tokens	RAG baseline	Savings
"How does helix handle WAL checkpoints?"	mechanism-internal	279	8,000	28.7×
"What does the access-rate tiebreaker do?"	operational rule	394	8,000	20.3×
"What port does the helix proxy listen on?"	point-fact lookup	399	8,000	20.1×
"What is the role of the harmonic_links table?"	data structure purpose	753	8,000	10.6×
"What does path_key_index store?"	data structure purpose	1,023	8,000	7.8×
"How does the density gate work?"	conceptual system	2,971	8,000	2.7×

RAG baseline: top-5 chunks × 1,500 tokens + 500 overhead = 8,000 tokens/query (Pinecone/LangChain defaults). Full bench: benchmarks/bench_rag_vs_sike_tokens.py — rerun yourself.

GPQA diamond accuracy — gemma4:e4b, N=100: OFF baseline: 22% → Helix ON: 26% (+4 pp) · Source: benchmarks/bench_aa_suite.py

Pipeline

%%{init: {"theme": "dark"}}%%
flowchart TD
    classDef cpu   fill:#1e3a5f,stroke:#3b82f6,color:#93c5fd
    classDef llm   fill:#2d1b69,stroke:#7c3aed,color:#c4b5fd
    classDef store fill:#14532d,stroke:#16a34a,color:#86efac
    classDef gate  fill:#431407,stroke:#c2410c,color:#fdba74
    classDef surf  fill:#0c4a6e,stroke:#0284c7,color:#7dd3fc

    subgraph INGEST["⬛  INGEST  —  LLM-free"]
        direction LR
        SRC["📁 Source\ngit · S3 · local\nfiles / conversations"]:::cpu
        TAG["🏷️ CpuTagger\nspaCy NER · key-value\nchromosome codons\nintent classification"]:::cpu
        GATE["🔬 Density Gate\nchromatin tier\nOPEN → EUCHROMATIN → HETERO"]:::gate
        DB[("🧬 Genome\nSQLite · 18.5 K genes\npromoter index · FTS5\nentity graph · harmonic links")]:::store
        SRC --> TAG --> GATE --> DB
    end

    subgraph RETRIEVAL["⬛  RETRIEVAL  —  LLM-free · 9-tier fusion scorer"]
        direction LR
        QRY["❓ Query\n/context  /context/packet"]:::cpu
        CLF["🎯 Classifier\nfactual · procedural\nmulti_hop · default\nassembly cap 2–8 genes"]:::cpu
        BM["🔍 BM25 pre-filter\nFTS5 top-200\nIDF-discriminated pool"]:::cpu
        TIERS["📊 9-Tier Fusion\n① PKI path-key IDF\n② filename anchor\n③ exact promoter tag\n④ prefix tag\n⑤ FTS5 BM25\n⑥ SPLADE sparse\n⑦ ΣĒMA 20-dim cosine\n⑧ harmonic co-activation\n⑨ SR future-occupancy"]:::cpu
        CONF{"📐 Confidence tier\nratio = top ÷ mean"}:::gate
        T3["TIGHT · 3 genes\nratio ≥ 3.0"]:::cpu
        F6["FOCUSED · 6 genes\nratio ≥ 1.8"]:::cpu
        B12["BROAD · 12 genes\nweak signal"]:::cpu
        QRY --> CLF --> BM --> TIERS --> CONF
        CONF --> T3 & F6 & B12
    end

    subgraph EXPRESSION["⬛  EXPRESSION  —  LLM-free"]
        direction LR
        SPL["✂️ Splice\nKompress 5× avg\nfoveated rank-proportional\nreorder: lost-in-middle fix"]:::cpu
        ASM["📦 Assemble\ndecoder prompt +\n&lt;GENE&gt; blocks +\nanswer slate"]:::cpu
        SPL --> ASM
    end

    subgraph OUT["OUTPUT SURFACES"]
        direction LR
        CTX["📄 /context\nexpressed_context\nContextHealth metadata"]:::surf
        PKT["📬 /context/packet\ngene_id + source_id\nverdict: verified · stale_risk\n· needs_refresh + refresh_targets"]:::surf
        ASM --> CTX
        ASM --> PKT
    end

    subgraph LLM["⚡  SINGLE LLM CALL"]
        ANS["🤖 /v1/chat/completions\nClaude · Gemini · local"]:::llm
        CTX --> ANS
    end

    DB --> RETRIEVAL
    T3 & F6 & B12 --> EXPRESSION

Dark-shipped features (entity graph Tier 5b, sub-query decomposition, BGE-M3 ANN) are omitted. See docs/architecture/DIMENSIONS.md for the full dimension inventory.

▶ Terminal walkthrough — launcher startup + first query

Part 1 — Launch

$ helix-launcher

[00:00.1]  helix-launcher v0.5.0
[00:00.2]  config: helix.toml
[00:00.3]  genome: genomes/main/genome.db  (18,547 genes · 5.0× compression)
[00:00.8]  starting helix server on :11437 ...
[00:01.4]  ✓  helix server      http://127.0.0.1:11437
[00:01.6]  starting observability stack ...
[00:02.1]  ✓  otel collector    :4317
[00:02.4]  ✓  prometheus        :9090
[00:02.9]  ✓  loki              :3100
[00:03.3]  ✓  grafana           :3000  →  http://localhost:3000
[00:03.4]  tray icon ready — right-click for controls

Part 2 — First retrieval query

curl -s http://127.0.0.1:11437/context \
  -H "Content-Type: application/json" \
  -d '{"query": "what port does helix use"}' \
  | python -m json.tool

{
  "expressed_context": "<GENE src=\"helix.toml\" facts=\"port=11437\">\nThe helix proxy server listens on port 11437.\n</GENE>\n...",
  "genes_expressed": 5,
  "budget_tier": "focused",
  "context_health": {
    "retrieval_rate": 1.0,
    "top_score": 8.3,
    "score_ratio": 4.1
  }
}

Token cost: 399 tokens delivered to the LLM vs 8,000 for a naive RAG top-5 pass — 20.1× savings.

Quick Start

# 1 — Install
pip install "helix-context[all]"

# 2 — Launch  (Windows · Linux/macOS: use helix-launcher)
start-helix-tray.bat
helix-status            # confirm :11437 is responding

# 3 — Seed your project
helix ingest ./my-project

# 4 — Test retrieval
curl http://localhost:11437/context \
  -H "Content-Type: application/json" \
  -d '{"query": "what is the main entry point?"}'

Native observability (default)

Canonical path: the tray (start-helix-tray.bat) manages the native OpenTelemetry binaries in tools/native-otel/ automatically. A balloon notification confirms the sidecar is running. To opt out: HELIX_OBSERVABILITY=0 start-helix-tray.bat.

Advanced — Docker stack: if you prefer a full Docker-compose observability stack (Prometheus, Tempo, Loki, Grafana), see deploy/otel/README.md.

MCP setup (Claude Code / Cursor / Continue)

Add to ~/.claude/settings.json (or your IDE's MCP config):

{
  "mcpServers": {
    "helix-context": {
      "command": "python",
      "args": ["-m", "helix_context.mcp_server"],
      "cwd": "/absolute/path/to/your/project",
      "env": { "HELIX_MCP_URL": "http://127.0.0.1:11437" }
    }
  }
}

OpenAI-compatible proxy (zero code changes)

ANTHROPIC_BASE_URL=http://localhost:11437 claude
OPENAI_BASE_URL=http://localhost:11437/v1 your-app

How It Works

The entire retrieval and weighing path is LLM-free — spaCy NER, Howard 2005 TCM, Stachenfeld SR, Werman W1, Hebbian co-activation. Pure CPU math from ingest to expressed context. The only LLM call in the whole system is at /v1/chat/completions. This matters for latency (sub-second retrieval), cost (no token spend on the retrieval path), and determinism.

Two surfaces for two caller types:

	`/context`	`/context/packet`
Returns	Assembled compressed window	Pointer + verdict + refresh plan
LLM reads?	Directly	No — agent fetches if needed
Verdict emitted?	Via `ContextHealth`	First-class: `verified / stale_risk / needs_refresh`
Use for	Chat clients, Continue	MCP agents, tool use, programmatic decisions

→ PIPELINE_LANES.md · DIMENSIONS.md · Agentome paper

Configuration

Genome path

Set path in [genome] to move the database to any drive or directory:

[genome]
path = "genomes/main/genome.db"   # relative to helix run directory
# Put this on your fastest NVMe for best ingest throughput
# Example: path = "D:/helix/genome.db"

Running multiple projects

One helix instance per genome — each reads its own helix.toml. Use the helix_context.hgt Python API to share genes across instances (Horizontal Gene Transfer).

Backup

SQLite WAL mode makes it safe to copy the .db file while helix is running:

# cron / Linux
cp genomes/main/genome.db backups/genome-$(date +%Y%m%d).db

# PowerShell / Windows
Copy-Item genomes\main\genome.db backups\genome-$(Get-Date -Format yyyyMMdd).db

A built-in backup manager with configurable paths and interval is on the roadmap.

DAL — source content fetching

/context/packet returns source_id pointers. Callers resolve them to bytes via the DAL:

from helix_context.adapters.dal import DAL

dal = DAL()                              # file + HTTP built-in
dal.register("s3", my_s3_fetcher)       # register additional schemes
text, meta = dal.fetch("s3://bucket/schema.json")

API Reference

Endpoint	Description
`POST /context`	Retrieve and assemble compressed context
`POST /context/packet`	Retrieve pointer + verdict (agent-safe)
`POST /ingest`	Add a document or exchange to the genome
`GET /stats`	Genome size, compression ratio, tier metrics
`GET /fingerprint`	Navigation-first retrieval (scores + metadata)
`POST /v1/chat/completions`	OpenAI-compatible proxy with automatic context injection

→ Full endpoint reference: docs/api/endpoints.md → MCP tool schemas: docs/api/mcp-tools.md

Architecture

Doc	What it covers
PIPELINE_LANES.md	Swim-lane reference: ingest, context, packet, fingerprint flows
DIMENSIONS.md	The 9 retrieval dimensions — schema, data, bench status
LAUNCHER.md	Supervisor, tray, observability stack lifecycle
SESSION_REGISTRY.md	Multi-agent session + party isolation
OBSERVABILITY.md	Prometheus metrics, Grafana dashboards, alert rules
KNOWLEDGE_GRAPH.md	Entity graph, harmonic links, co-activation

Acknowledgments

Built on: spaCy NER · Howard 2005 TCM · Stachenfeld 2017 SR · SQLite FTS5 BM25 · Kompress compression

License

Apache 2.0 — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.6.2

May 30, 2026

0.6.1

May 30, 2026

0.6.0

May 28, 2026

This version

0.5.0

May 9, 2026

0.4.0b1 pre-release

Apr 18, 2026

0.3.0b5 pre-release

Apr 10, 2026

0.3.0b3 pre-release

Apr 10, 2026

0.3.0b2 pre-release

Apr 10, 2026

0.3.0b1 pre-release

Apr 10, 2026

0.2.0b2 pre-release

Apr 10, 2026

0.2.0b1 pre-release

Apr 10, 2026

0.1.0b2 pre-release

Apr 10, 2026

0.1.0b1 pre-release

Apr 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

helix_context-0.5.0.tar.gz (2.7 MB view details)

Uploaded May 9, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

helix_context-0.5.0-py3-none-any.whl (463.2 kB view details)

Uploaded May 9, 2026 Python 3

File details

Details for the file helix_context-0.5.0.tar.gz.

File metadata

Download URL: helix_context-0.5.0.tar.gz
Upload date: May 9, 2026
Size: 2.7 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for helix_context-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`6f8c6f8f153717def2f05a88facc01943b0dd7c6a8df617f9ba7effe9736be8b`
MD5	`c380f5613854a2c376b3839a887df91b`
BLAKE2b-256	`345a564d117ab827fb072a0b62da45302cd3e21fb97530c65e193084e5217d00`

See more details on using hashes here.

File details

Details for the file helix_context-0.5.0-py3-none-any.whl.

File metadata

Download URL: helix_context-0.5.0-py3-none-any.whl
Upload date: May 9, 2026
Size: 463.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for helix_context-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7d4596d2132bff48c3e1351952a46588952d39b2f1f6bd90914af4fb3ba741a4`
MD5	`4ef152c25eb1199b73570fbfde1d187a`
BLAKE2b-256	`8589a80caab0d2d7d1bc095c4e8a22c57ce4d2003432a62eeb52239dcb802d8d`

See more details on using hashes here.

helix-context 0.5.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Helix Context

Benchmarks

Pipeline

Quick Start

Native observability (default)

MCP setup (Claude Code / Cursor / Continue)

OpenAI-compatible proxy (zero code changes)

How It Works

Configuration

Genome path

Running multiple projects

Backup

DAL — source content fetching

API Reference

Architecture

Acknowledgments

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes