Context orchestration primitives for LLM applications.

These details have not been verified by PyPI

Project links

Project description

kontxt

Context engineering for production AI systems

Most AI projects fail not because of bad models, but because of bad context. kontxt solves this.

The Problem

85% of AI projects fail to deliver on their promises (Gartner). The root cause isn't model quality—it's context engineering:

❌ Context overload - Passing too much irrelevant data → hallucinations, high costs
❌ Missing context - Omitting critical information → poor decisions
❌ No memory - Long conversations overflow context windows → agent amnesia
❌ Poor formatting - Unstructured data → model confusion
❌ No observability - Can't debug production failures

"Context engineering is the new skill in AI. It is about providing the right information and tools, in the right format, at the right time." — Philipp Schmid

The Solution

kontxt is a lightweight library that gives you production-grade context control:

🎯 Budget control - Set token limits, auto-trim intelligently
🧠 Memory primitives - Scratchpads, vector stores, persistence
🔄 Multi-phase flows - Coordinate complex agent workflows
📊 Token observability - Track usage, debug context issues
🔌 Vendor-agnostic - Works with OpenAI, Anthropic, Gemini, or any LLM
🔒 Type-safe - Full type hints, IDE autocomplete, zero magic

Key Features

Context composition with ordered sections, lazy evaluation, and multiple render formats (OpenAI, Anthropic, Gemini)
Memory primitives including scratchpads, vector stores, and configurable backends
Phase templates for multi-stage flows with transition validation
Token budgeting with automatic trimming and priority management
State management for session tracking and workflow coordination
Production-ready with comprehensive tests and typed APIs

Installation

⚠️ Alpha Release: This is an alpha version (0.1.0a1) for early testing. APIs may change before the stable 0.1.0 release.

pip install kontxt

Or install from source:

uv pip install -e .

Development tooling:

uv pip install -e '.[dev]'

Quick Start

from kontxt import Context, SystemPrompt

context = Context()

# Type-safe section references (recommended)
context.add(SystemPrompt, "You are a dental triage assistant.")

# Or use strings (also works)
context.add("instructions", "Answer using the provided chart.")
context.add("patient", {"name": "Alex", "age": 41})
context.add("messages", {"role": "user", "content": "My tooth aches."})

prompt = context.render()
# -> XML-style prompt that preserves section boundaries

Memory Integration

from kontxt import Memory

memory = Memory()
memory.store("patient:123", {"allergy": "penicillin"}, meta={"patient_id": "123"})
memory.scratchpad.write("plan", ["Collect symptoms", "Check red flags"])

plan = memory.scratchpad.read("plan")
allergies = memory.retrieve("penicillin", filters={"patient_id": "123"})

Gemini Integration

from kontxt import Context, Memory, SystemPrompt, Format

# Create context with memory
memory = Memory()
ctx = Context(memory=memory)

# Type-safe section references
ctx.add(SystemPrompt, "You are a helpful AI assistant")

# Convenient helper for user messages
ctx.add_user_message("Explain quantum computing")

# Render for Gemini
payload = ctx.render(
    format=Format.GEMINI,  # Type-safe enum with IDE autocomplete
    generation_config={"temperature": 0.7}
)

# Call Gemini API (you control the API call)
from google import genai
client = genai.Client(api_key="...")
response = client.models.generate_content(model="gemini-2.0-flash-exp", **payload)

# Add response back to context
ctx.add_response(response.text)

Multi-Phase Workflows with State

from enum import Enum
from kontxt import Context, State, SystemPrompt, ChatMessages, Format

# Define workflow phases
class Phases(str, Enum):
    INTAKE = "intake"
    ASSESSMENT = "assessment"
    COMPLETE = "complete"

# Initialize state with phase validation
state = State(
    initial={"session": {"phase": "intake"}},
    phases=Phases  # Validates phase values at runtime
)

# Initialize context with state
ctx = Context(state=state)
ctx.add(SystemPrompt, "You are a medical triage assistant")

# Configure phases with type-safe section references
ctx.phase(Phases.INTAKE).configure(
    instructions="Gather patient information",
    includes=[SystemPrompt, ChatMessages],  # Type-safe!
    transitions_to=["assessment"],  # Only assessment allowed from intake
    max_history=10
)

ctx.phase(Phases.ASSESSMENT).configure(
    instructions="Assess patient condition",
    includes=[SystemPrompt, ChatMessages],
    transitions_to=["complete"],
    max_history=5
)

# Use in workflow
ctx.add_user_message("I have a headache")
payload = ctx.render(phase=state.phase(), format=Format.GEMINI)

# ... call LLM, get response ...

ctx.add_response(response.text)

# Advance phase with validation
ctx.advance_phase(Phases.ASSESSMENT)  # ✅ Validates transition is allowed

Import Patterns

# ✅ Recommended: Import from kontxt
from kontxt import Context, Memory, State, Format, SystemPrompt, ChatMessages

# ✅ Or explicit from types (for organization)
from kontxt import Context, Memory, State
from kontxt.types import Format, SystemPrompt, ChatMessages

Available Render Formats

Format.TEXT       # Plain text with XML-like tags
Format.OPENAI     # OpenAI chat completion format
Format.ANTHROPIC  # Anthropic messages API format
Format.GEMINI     # Google Gemini API format

Built-in Section Types

from kontxt import SystemPrompt, ChatMessages, Instructions, Tools

# Use for type safety and IDE autocomplete
ctx.add(SystemPrompt, "You are helpful")
ctx.add(ChatMessages, {"role": "user", "content": "Hello"})

# Or create custom section types
from kontxt import SectionType
PatientData = SectionType("patient")
ctx.add(PatientData, {"name": "John", "age": 30})

See examples/ for complete examples:

simple_rag.py - Basic RAG workflow
multi_phase_workflow.py - Multi-phase workflow with state management

Why kontxt vs LangChain/LlamaIndex?

Most frameworks abstract the wrong things.

They abstract the LLM (doesn't matter—all models work similarly). They don't abstract context (matters most—it's complex and error-prone).

kontxt inverts this:

✅ LLM is your responsibility (use any vendor, local models, whatever)
✅ Context is our responsibility (we make it production-grade)

Feature	kontxt	LangChain	LlamaIndex
Learning curve	5 minutes	Hours	Hours
Dependencies	2 (pydantic, tiktoken)	20+	15+
Token budgets	✅ Built-in	❌ Manual	❌ Manual
Multi-phase flows	✅ Native	⚠️ Custom	⚠️ Custom
Memory operations	✅ 4 primitives	⚠️ Complex	⚠️ Complex
Vendor lock-in	❌ None	⚠️ High	⚠️ High
Type safety	✅ Full	⚠️ Partial	⚠️ Partial

TL;DR: We do one thing (context engineering) and do it perfectly. They try to do everything, and context becomes an afterthought.

Built for Production

kontxt is built on research-backed context engineering principles:

The Four Operations (Lance Martin, 2025)

WRITE - Externalize context beyond the window
SELECT - Retrieve relevant context intelligently
COMPRESS - Reduce tokens while preserving signal
ISOLATE - Partition context for clarity

mem.scratchpad.write("plan", data)           # WRITE
notes = mem.retrieve("plan", filters={...})  # SELECT
ctx.set_budget(max_tokens=4000, priority=[]) # COMPRESS
sub = ctx.fork(include=["system"])           # ISOLATE

Why This Matters

Research shows:

Context position matters: LLMs exhibit attention bias—details in the middle get lost
More ≠ better: A model given 46 tools fails; given 19 tools succeeds (same context window)
Format matters: How you structure data affects model performance as much as what data you include

kontxt handles these nuances so you don't have to.

Who This Is For

Choose kontxt if you're building:

🏥 Multi-phase agents (medical triage, customer support, legal analysis)
💬 Long conversations (therapy bots, tutoring, extended troubleshooting)
💰 Cost-sensitive systems (token budgets matter, can't blow $500 on one session)
🔍 Observable AI (need to debug why agents fail in production)
🔌 Vendor-agnostic apps (might switch from GPT-4 to Claude to Gemini)

If your AI needs to work in production, not just demos, use kontxt.

Documentation

Documentation scaffolding lives under docs/. We plan to publish the first version once the API stabilises. Contributions are welcome—open an issue if you spot gaps or inconsistencies.

Development

uv sync
uv run pytest
uv run ruff check .

See CONTRIBUTING for detailed guidance.

Roadmap

Additional storage backends (Qdrant, Pinecone, etc.)
Built-in compression helpers powered by user-supplied LLMs
Observability hooks for prompt debugging and token telemetry
Async APIs once ergonomics questions are resolved

License

Licensed under the Apache 2.0 License. See LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.0a8 pre-release

Dec 4, 2025

0.1.0a7 pre-release

Dec 4, 2025

0.1.0a6 pre-release

Nov 15, 2025

0.1.0a5 pre-release

Nov 15, 2025

0.1.0a4 pre-release

Nov 14, 2025

This version

0.1.0a3 pre-release

Nov 12, 2025

0.1.0a2 pre-release

Nov 12, 2025

0.1.0a1 pre-release

Nov 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kontxt-0.1.0a3.tar.gz (23.3 kB view details)

Uploaded Nov 12, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kontxt-0.1.0a3-py3-none-any.whl (24.4 kB view details)

Uploaded Nov 12, 2025 Python 3

File details

Details for the file kontxt-0.1.0a3.tar.gz.

File metadata

Download URL: kontxt-0.1.0a3.tar.gz
Upload date: Nov 12, 2025
Size: 23.3 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kontxt-0.1.0a3.tar.gz
Algorithm	Hash digest
SHA256	`81f342c0936673be4d0eda62fdf5fad2652218881664e9a527454b81bcb46bce`
MD5	`55d9c243c6ebf86ecc78399908283a7c`
BLAKE2b-256	`bf1e3fd8074e3ec3a981ae28f7801f71bae5e4d8e9b86c0218b9f1a81624c7dc`

See more details on using hashes here.

Provenance

The following attestation bundles were made for kontxt-0.1.0a3.tar.gz:

Publisher: publish.yml on raiselab-ai/kontxt

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: kontxt-0.1.0a3.tar.gz
- Subject digest: 81f342c0936673be4d0eda62fdf5fad2652218881664e9a527454b81bcb46bce
- Sigstore transparency entry: 696499135
- Sigstore integration time: Nov 12, 2025
Source repository:
- Permalink: raiselab-ai/kontxt@2c014d6ab62f25fbabf808d3b2c49025e56b9b83
- Branch / Tag: refs/tags/v0.1.0a3
- Owner: https://github.com/raiselab-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@2c014d6ab62f25fbabf808d3b2c49025e56b9b83
- Trigger Event: release

File details

Details for the file kontxt-0.1.0a3-py3-none-any.whl.

File metadata

Download URL: kontxt-0.1.0a3-py3-none-any.whl
Upload date: Nov 12, 2025
Size: 24.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kontxt-0.1.0a3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`cd247789e1540fda712a40d7da01a36599fbf97bf3218ab9e8832e5eeac0b0ae`
MD5	`4d0ee7ba871586fa4158653998e0784a`
BLAKE2b-256	`17893682200b447fa759b82cea57147b610a8983a2746f35c912493337c67155`

See more details on using hashes here.

Provenance

The following attestation bundles were made for kontxt-0.1.0a3-py3-none-any.whl:

Publisher: publish.yml on raiselab-ai/kontxt

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: kontxt-0.1.0a3-py3-none-any.whl
- Subject digest: cd247789e1540fda712a40d7da01a36599fbf97bf3218ab9e8832e5eeac0b0ae
- Sigstore transparency entry: 696499167
- Sigstore integration time: Nov 12, 2025
Source repository:
- Permalink: raiselab-ai/kontxt@2c014d6ab62f25fbabf808d3b2c49025e56b9b83
- Branch / Tag: refs/tags/v0.1.0a3
- Owner: https://github.com/raiselab-ai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@2c014d6ab62f25fbabf808d3b2c49025e56b9b83
- Trigger Event: release

kontxt 0.1.0a3

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

kontxt

The Problem

The Solution

Key Features

Installation

Quick Start

Memory Integration

Gemini Integration

Multi-Phase Workflows with State

Import Patterns

Available Render Formats

Built-in Section Types

Why kontxt vs LangChain/LlamaIndex?

Built for Production

The Four Operations (Lance Martin, 2025)

Why This Matters

Who This Is For

Documentation

Development

Roadmap

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance