Context Operating System for LLM Applications — local, zero-cost context intelligence

These details have not been verified by PyPI

Project links

Project description

ctxintel — Context Operating System for LLM Applications

ctxintel is a fully local, zero-cost context intelligence engine for LLM applications. Instead of blindly summarizing or truncating conversations, it runs a structured 6-stage deterministic pipeline that scores, extracts, remembers, compresses, and optimizes your context window — all without a single external AI API call.

🔒 Fully offline · 🎯 Deterministic & debuggable · 📊 Data-driven rules · ⚡ Drop-in for any LLM framework

Install

pip install ctxintel

# Optional: enable NER and dependency parsing
pip install spacy
python -m spacy download en_core_web_sm

Quick Start

from ctxintel import ContextIntel

sdk = ContextIntel(preset="coding_assistant")

messages = [
    {"role": "user",      "content": "Hi, I'm John. Building a REST API."},
    {"role": "assistant", "content": "What stack are you using?"},
    {"role": "user",      "content": "FastAPI and Python. Deploying on AWS."},
    {"role": "user",      "content": "Don't use synchronous code."},
    {"role": "user",      "content": "ok"},
    {"role": "user",      "content": "cool"},
    {"role": "user",      "content": "Now add JWT authentication."},
]

result = sdk.process(messages)
print(f"Tokens: {result.original_token_count} → {result.token_count}")
print(f"Memories: {result.extracted_memories}")
print(sdk.memory_summary())

How It Works

Raw Messages
     │
     ▼
┌─────────────────┐
│  1. RANKING     │  TF-IDF + recency + signal patterns + semantic uniqueness
└────────┬────────┘
         ▼
┌─────────────────┐
│  2. EXTRACTION  │  patterns.yaml rules + spaCy NER + dependency parsing
└────────┬────────┘
         ▼
┌─────────────────┐
│  3. MEMORY      │  Per-memory scoring + frequency tracking + JSON persistence
└────────┬────────┘
         ▼
┌─────────────────┐
│  4. COMPRESSION │  Extractive summarization via sumy LSA (zero AI calls)
└────────┬────────┘
         ▼
┌─────────────────┐
│  5. OPTIMIZATION│  Fit within token budget via tiktoken
└────────┬────────┘
         ▼
   Final Context
   (minimal tokens, maximum relevance)

Presets

Preset	Use Case	Token Budget	Threshold
`coding_assistant`	Code generation & programming	8,000	0.35
`customer_support`	Support bots & helpdesk	6,000	0.40
`ai_tutor`	Educational assistants	10,000	0.30
`agent_system`	Autonomous agents	12,000	0.45
`general`	General chat apps	8,000	0.40

from ctxintel import list_presets
print(list_presets())

Extending Patterns

The core innovation of ctxintel is its data-driven rule engine at ctxintel/data/patterns.yaml. You can add custom categories without changing any code:

# Add to the categories section in patterns.yaml
categories:
  database:
    patterns:
      - "using {X} database"
      - "store data in {X}"
      - "migrate to {X}"
    keywords:
      - postgresql
      - mysql
      - mongodb
      - redis
    priority: high

No code changes needed — ctxintel loads the YAML at runtime.

Why Not Just Summarize?

Approach	What You Lose
Truncation	Early context, user preferences, constraints
Blind summarization	Structured facts, decisions, priorities
ctxintel	Nothing important — it scores, extracts, and remembers

ctxintel doesn't just shorten your context. It understands what matters: user preferences are preserved, tasks are tracked, decisions are remembered, and filler is compressed — all deterministically.

Zero API Calls

⚠️ ctxintel makes zero external API calls. Ever.

No OpenAI. No Anthropic. No Cohere. No network requests. Everything runs locally using scikit-learn, sumy, spaCy, and tiktoken. Your conversations never leave your machine.

API Reference

`ContextIntel`

sdk = ContextIntel(
    preset="coding_assistant",  # or None for manual config
    preserve=["task", "constraint"],
    threshold=0.4,
    token_budget=8000,
    memory_path=".ctxintel_memory.json",
)

# Full pipeline
result = sdk.process(messages)

# Incremental workflow
sdk.add_message("user", "Hello!")
sdk.add_message("assistant", "Hi there!")
result = sdk.flush()

# Utilities
sdk.preview_compression(messages)
sdk.memory_summary()
sdk.reset_memory()
sdk.supported_categories()

`ContextResult`

result.messages             # Final optimized messages
result.memories             # Extracted memories
result.token_count          # Final token count
result.original_token_count # Original token count
result.compression_ratio    # Reduction ratio (0.0–1.0)
result.extracted_memories   # Number of memories found

Roadmap

Sentence-transformers reranker (opt-in)
CLI: ctxintel compress chat.json --budget 4000
Streaming message support
Custom patterns.yaml path support
LangChain / LlamaIndex integration wrappers
Multi-conversation memory aggregation

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

Jun 3, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ctxintel-0.1.0.tar.gz (24.7 kB view details)

Uploaded Jun 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ctxintel-0.1.0-py3-none-any.whl (21.8 kB view details)

Uploaded Jun 3, 2026 Python 3

File details

Details for the file ctxintel-0.1.0.tar.gz.

File metadata

Download URL: ctxintel-0.1.0.tar.gz
Upload date: Jun 3, 2026
Size: 24.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.7

File hashes

Hashes for ctxintel-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`62e215d8f9b7f0010935f0941948f5e762b4ed4bdce10537fa519fa19f45a8a1`
MD5	`9b68dc10b37158a8df76a3ec4ceafdc4`
BLAKE2b-256	`d7d3ba1424de53531340e1d8ae96c5d3e3f95edd07a595c9a4c46c4ff562115f`

See more details on using hashes here.

File details

Details for the file ctxintel-0.1.0-py3-none-any.whl.

File metadata

Download URL: ctxintel-0.1.0-py3-none-any.whl
Upload date: Jun 3, 2026
Size: 21.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.11.7

File hashes

Hashes for ctxintel-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`321e9a46a531c583e9d401ce666520f2bf296a32f7c6bc18958c71c82a95e645`
MD5	`62450b687831e329d71cd48184459299`
BLAKE2b-256	`db1c2166ee2f9c478825262fc43a9cb0714de6b69ba52c51ca0123f69c641675`

See more details on using hashes here.

ctxintel 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

ctxintel — Context Operating System for LLM Applications

Install

Quick Start

How It Works

Presets

Extending Patterns

Why Not Just Summarize?

Zero API Calls

API Reference

`ContextIntel`

`ContextResult`

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes