Kytchen RLM - Recursive Language Models with BYOLLM pantry storage + sandboxed prep + sauce (audit trail)

These details have not been verified by PyPI

Project links

Project description

Kytchen

Too many cooks in the kitchen. Yes chef.

BYOLLM MCP servers for recursive reasoning over documents. Kytchen is your organized prep space: pantry (storage) + prep (tools) + sauce (audit trail), so your LLM can cook the final answer.

Quick Start

pip install 'kytchen[mcp]'
kytchen-rlm install        # auto-detects Claude Desktop, Cursor, Windsurf, VS Code
kytchen-rlm doctor         # verify installation

What This Repo Contains

Kytchen is a monorepo that includes:

kytchen/: the core Python package
- RLM core (recursive reasoning loop)
- Sandboxed REPL and built-in analysis helpers
- MCP servers (kytchen-mcp, kytchen-local, kytchen-mcp-cloud)
- Cloud API (FastAPI) for hosted datasets + runs + streaming
- Exporters ("sauce" / audit trail → Markdown/PDF, etc.)
kytchen-sdk/: a separate Python SDK package (kytchen_sdk) for the Cloud API
kytchen-web/: the Next.js dashboard + TypeScript client (UI for datasets/runs/streaming)
docs/: product + architecture docs

If you’re handing this to another AI for advice, the key is: Kytchen is BYOLLM. It provides context infrastructure + tools + audit trail; it does not provide model inference.

High-Level Architecture

There are three related but distinct ways to use Kytchen:

Local MCP server (kytchen-local)
- No DB required
- Stores context in a sandboxed Python REPL session and exposes tools via MCP
Cloud API (kytchen.api.app)
- FastAPI backend for datasets/runs/tool sessions
- Supports streaming query progress (SSE)
- Intended to back the web dashboard + SDKs
SDKs / Web UI
- kytchen-sdk/ (Python) and kytchen-web/ (Next.js + TS client)
- Call the Cloud API

Conceptually:

┌──────────────────────────────────────────────────────────┐
│ Your LLM client (Claude/Cursor/Windsurf/...)             │
│ - Has the model subscription / provider key (BYOLLM)     │
└──────────────────────────────────────────────────────────┘
                       │
                       ▼
┌──────────────────────────────────────────────────────────┐
│ Kytchen (tools + context + evidence)                     │
│ - Pantry: datasets/context stored once                   │
│ - Prep: tools to search/peek/compute                     │
│ - Sauce: citations/evidence + metrics + trajectory       │
└──────────────────────────────────────────────────────────┘

Repository Layout

.
├── kytchen/                      # core python package
│   ├── core.py                   # recursive loop (RLM runner)
│   ├── repl/                      # sandboxed REPL + helpers + citations
│   ├── mcp/                       # MCP servers (cloud + local)
│   ├── api/                       # FastAPI backend (Cloud)
│   ├── exporters/                 # sauce/audit exports
│   └── converters/                # pdf/docx/xlsx → text
├── kytchen-sdk/                  # Python SDK (httpx client)
├── kytchen-web/                  # Next.js dashboard + TS client
└── docs/                         # product + architecture docs

File Converters (PDF / DOCX / XLSX)

Install optional converter dependencies:

pip install 'kytchen[mcp,converters]'

Then use these MCP tools (in kytchen-local) to load large documents without context stuffing:

convert_pdf(file_path="...", pages="1-5", context_id="doc")
convert_docx(file_path="...", context_id="doc")
convert_xlsx(file_path="...", formulas="evaluated", context_id="sheet")

Optional OCR:

pip install 'kytchen[ocr]'

Manual configuration

Add to your MCP client config:

{
  "mcpServers": {
    "kytchen": {
      "command": "kytchen",
      "env": { "KYTCHEN_API_KEY": "kyt_sk_..." }
    },
    "kytchen-local": {
      "command": "kytchen-local"
    }
  }
}

How It Works

┌──────────────────────────────────────────────────────────────────┐
│  CONTEXT  →  stored once as `ctx`                                │
└──────────────────────────────────────────────────────────────────┘
                                 │
                                 ▼
┌──────────────────────────────────────────────────────────────────┐
│  🔧 80+ TOOLS                                                    │
├──────────────────────────────────────────────────────────────────┤
│  extract_*  │ emails, IPs, money, dates, URLs, functions, TODOs │
│  grep/head  │ filter lines, sort, uniq, columns                 │
│  search     │ regex with context, contains, find_all            │
│  stats      │ word_count, frequency, ngrams, diff               │
│  transform  │ replace, split, before/after, normalize           │
│  validate   │ is_email, is_url, is_json, is_ip                  │
│  convert    │ to_json, to_snake_case, slugify                   │
└──────────────────────────────────────────────────────────────────┘
                                 │
                                 ▼
┌──────────────────────────────────────────────────────────────────┐
│  📋 EVIDENCE  →  cite() accumulates provenance with line numbers │
└──────────────────────────────────────────────────────────────────┘
                                 │
                        ┌────────┴────────┐
                        ▼                 ▼
                    Continue          Finalize
                  (loop back)    (answer + citations)

The model sees metadata, not full text. It writes Python to explore iteratively. Sauce (evidence) auto-accumulates.

Kytchen Cloud API (FastAPI)

The Cloud API lives in kytchen/api/app.py.

Core endpoints (high signal):

GET /healthz: health check
GET /v1/datasets: list datasets for workspace (workspace comes from API key)
POST /v1/datasets: upload dataset (multipart/form-data)
POST /v1/query: run a query and return a full QueryResult
POST /v1/query/stream: SSE streaming query progress ("Glass Kitchen")
POST /v1/tool_call: tool execution endpoint (powers the sandbox + citations)

BYOLLM detail: /v1/query and /v1/query/stream accept a provider and provider_api_key so the user can bring their own Anthropic/OpenAI key. Kytchen is not an inference service.

QueryResult shape

The backend returns:

id: run id
answer: string
evidence: list of citations (snippet + optional line_range + note)
metrics: token/cost accounting (baseline estimate + actual)

Streaming (`/v1/query/stream`)

This endpoint is text/event-stream and emits JSON payloads in the form:

{"type": "started", ...}
{"type": "step", ...} (trajectory updates)
{"type": "completed", ...} or {"type": "error", ...}

If you’re integrating a UI, the step events are what power real-time progress rendering.

Python SDK (Cloud)

The Python SDK lives in kytchen-sdk/ and is intended for calling the Cloud API from Python apps.

Entry point: kytchen_sdk.client.KytchenClient
Dataset operations: client.datasets.*
Query:
- await client.query(...) → QueryResult
- async for event in client.query_stream(...) → RunEvent stream (SSE)

See kytchen-sdk/README.md and kytchen-sdk/examples/basic.py.

Web Dashboard

The Next.js app is in kytchen-web/.

It is meant to:

manage workspaces/datasets
visualize query progress via streaming ("open kitchen")
show metrics + evidence ("sauce")

See kytchen-web/README.md for frontend-local setup.

Example

You: Load this contract and find all liability exclusions

[AI calls load_context, search_context, cite(), evaluate_progress, finalize]

AI: Found 3 liability exclusions:
    1. Section 4.2: Consequential damages excluded (lines 142-158)
    2. Section 7.1: Force majeure carve-out (lines 289-301)
    3. Section 9.3: Cap at contract value (lines 445-452)

    Sauce: [4 citations with line ranges]

When to Use

Use Kytchen	Skip Kytchen
Long documents (>10 pages)	Short docs (<30k tokens)
Need regex search	Simple lookups
Need computation on extracted data	Latency-critical apps
Want citations with line numbers
Iterative analysis across turns

MCP Tools Reference

Tool	Purpose
`load_context`	Store document in sandboxed REPL as `ctx`
`peek_context`	View character or line ranges
`search_context`	Regex search with evidence logging
`exec_python`	Run code against context (includes `cite()` helper)
`chunk_context`	Split into navigable chunks with metadata
`think`	Structure reasoning sub-steps
`evaluate_progress`	Check confidence and convergence
`get_evidence`	Retrieve citation trail with filtering
`get_status`	Session state and metrics
`summarize_so_far`	Compress history to manage context
`finalize`	Complete with answer and citations

REPL Helpers (80+ functions available in exec_python)

Core: peek, lines, search, chunk, cite

Extraction (auto-detect from context): extract_numbers, extract_money, extract_percentages, extract_dates, extract_times, extract_timestamps, extract_emails, extract_urls, extract_ips, extract_phones, extract_paths, extract_env_vars, extract_versions, extract_uuids, extract_hashes, extract_hex

Code analysis: extract_functions, extract_classes, extract_imports, extract_comments, extract_strings, extract_todos

Log analysis: extract_log_levels, extract_exceptions, extract_json_objects

Statistics: word_count, char_count, line_count, sentence_count, paragraph_count, unique_words, word_frequency, ngrams

Line operations (grep-like): head, tail, grep, grep_v, grep_c, uniq, sort_lines, number_lines, strip_lines, blank_lines, non_blank_lines, columns

Text manipulation: replace_all, split_by, between, before, after, truncate, wrap_text, indent_text, dedent_text, normalize_whitespace, remove_punctuation, to_lower, to_upper, to_title

Pattern matching: contains, contains_any, contains_all, count_matches, find_all, first_match

Comparison: diff, similarity, common_lines, diff_lines

Collections: dedupe, flatten, first, last, take, drop, partition, group_by, frequency, sample_items, shuffle_items

Validation: is_numeric, is_email, is_url, is_ip, is_uuid, is_json, is_blank

Conversion: to_json, from_json, to_csv_row, from_csv_row, to_int, to_float, to_snake_case, to_camel_case, to_pascal_case, to_kebab_case, slugify

# Examples
emails = extract_emails()  # Auto-extracts from ctx
money = extract_money()    # Finds $1,234.56 patterns
errors = grep("ERROR")     # Filter lines
word_frequency(top_n=10)   # Most common words

Sandbox Builtins

Types: bool, int, float, str, dict, list, set, tuple, type, frozenset, bytes, bytearray, complex, slice, object

Functions: len, range, enumerate, zip, map, filter, iter, next, callable, min, max, sum, sorted, reversed, any, all, abs, round, pow, divmod, repr, ascii, chr, ord, format, hex, oct, bin, print, isinstance, issubclass, hash, id

Exceptions: Exception, ValueError, TypeError, RuntimeError, KeyError, IndexError, ZeroDivisionError, NameError, AttributeError, StopIteration, AssertionError, LookupError, ArithmeticError, UnicodeError

Imports: re, json, csv, math, statistics, collections, itertools, functools, datetime, textwrap, difflib, random, string, hashlib, base64, urllib.parse, html

Configuration

Environment Variables:

Variable	Purpose
`KYTCHEN_MAX_ITERATIONS`	Iteration limit
`KYTCHEN_MAX_COST`	Cost limit in USD

CLI Commands:

kytchen-rlm install              # Interactive installer
kytchen-rlm install <client>     # Install to specific client
kytchen-rlm uninstall <client>   # Remove from client
kytchen-rlm doctor               # Verify installation

Supported clients: claude-desktop, cursor, windsurf, vscode, claude-code

Security

The sandbox is best-effort, not hardened.

Blocked: open, os, subprocess, socket, eval, exec, dunder access, imports outside allowlist

For production: Run in a container with resource limits. Do not expose to untrusted users without additional isolation.

Development

git clone https://github.com/shannon-labs/kytchen.git
cd kytchen
pip install -e '.[dev,mcp]'
pytest -q  # 230 tests

Backend API (dev mode)

pip install -e '.[dev,api]'
export KYTCHEN_DEV_MODE=1
uvicorn kytchen.api.app:app --reload

MCP server (local)

pip install -e '.[mcp]'
python -m kytchen.mcp.server

Frontend (Next.js)

cd kytchen-web
npm install
npm run dev

Self-Hosting

The repo includes Docker assets (see docker-compose.yml at the repo root if present in your checkout).

At minimum you’ll need:

a way to store dataset bytes (filesystem / S3-compatible storage / MinIO)
an API key bootstrap flow (for local dev, a static key is fine)

If you’re self-hosting and using BYOLLM:

Your client still supplies provider_api_key when calling /v1/query.

Configuration

Common environment variables:

KYTCHEN_DEV_MODE=1: run the API in in-memory mode (no DB)
KYTCHEN_PROVIDER: default provider name (e.g. anthropic, openai)
KYTCHEN_MODEL: default model name

The web app uses kytchen-web/.env.local for Supabase + API endpoints.

Glossary (Project Terms)

Pantry: stored datasets/context
Prep: tools and sandboxed REPL operations
Sauce: evidence/citations (provenance)
Ticket / Run: a query execution with a trajectory
Trajectory: step-by-step trace of the reasoning loop
Glass Kitchen: streaming progress UX (SSE)

If You’re Another AI Reading This

Useful entry points (high-signal files):

kytchen/core.py: RLM loop orchestration, trajectory creation, provider calls
kytchen/repl/sandbox.py + kytchen/repl/helpers.py: sandbox + evidence (cite) primitives
kytchen/api/app.py: Cloud API routes (/v1/query, /v1/query/stream, datasets)
kytchen-sdk/kytchen_sdk/client.py: Python SDK calling conventions + expected shapes
docs/WORKSPACE_ARCHITECTURE.md: product + API design intent

When giving advice, clarify:

Is the goal MCP local tooling or Cloud API + dashboard?
Is the issue about streaming, evidence/audit, dataset processing, or self-hosting?

Recent Changes

v0.2.0 (December 2025)

80+ new REPL helpers for document analysis:

16 extraction functions (emails, IPs, money, dates, phones, URLs, paths, versions, UUIDs, functions, classes, TODOs, log levels)
8 statistics (word/line/char count, word frequency, n-grams)
12 grep-like line operations (head, tail, grep, sort, uniq, columns)
15 text manipulation (replace, split, before/after, truncate, normalize)
6 pattern matching (contains, count_matches, find_all)
4 comparison (diff, similarity)
11 collection utilities (dedupe, flatten, group_by, frequency)
7 validators (is_email, is_url, is_ip, is_uuid, is_json)
11 converters (to_json, to_snake_case, slugify)

30+ new builtins: map, filter, iter, next, repr, chr, ord, pow, divmod, hash, id, callable, frozenset, bytes, slice...

6 new allowed imports: random, string, hashlib, base64, urllib.parse, html

v0.1.3 (December 2025)

Added type builtin to sandbox
Added NameError and AttributeError exceptions

Research

Inspired by Recursive Language Models by Alex Zhang and Omar Khattab.

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

1.0.0 yanked

Dec 17, 2025

Reason this release was yanked:

old

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kytchen_rlm-1.0.0.tar.gz (684.1 kB view details)

Uploaded Dec 17, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

kytchen_rlm-1.0.0-py3-none-any.whl (215.8 kB view details)

Uploaded Dec 17, 2025 Python 3

File details

Details for the file kytchen_rlm-1.0.0.tar.gz.

File metadata

Download URL: kytchen_rlm-1.0.0.tar.gz
Upload date: Dec 17, 2025
Size: 684.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.8

File hashes

Hashes for kytchen_rlm-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`a521412b2e5a4997ec654d5e486f30b63043571f3aaf84d9760d9bcb8109bddd`
MD5	`dfab3d7cecc151fd1a014baa34ee3fd3`
BLAKE2b-256	`f52153832e4c0c643983960923a3efd772239c4626c691ab4c9d6cd91bc91924`

See more details on using hashes here.

File details

Details for the file kytchen_rlm-1.0.0-py3-none-any.whl.

File metadata

Download URL: kytchen_rlm-1.0.0-py3-none-any.whl
Upload date: Dec 17, 2025
Size: 215.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.8

File hashes

Hashes for kytchen_rlm-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f4d7f52fc30d6bbf9eccf17e23e308f1f18c817204db12d68710132f017697f1`
MD5	`b87f784e23277969125a92e3b911bb7a`
BLAKE2b-256	`a00ff6125cdb85a64e882a37fc0fbab84b30eac9cd734097e1f5fe65bba976dd`

See more details on using hashes here.

kytchen-rlm 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Kytchen

Quick Start

What This Repo Contains

High-Level Architecture

Repository Layout

File Converters (PDF / DOCX / XLSX)

How It Works

Kytchen Cloud API (FastAPI)

QueryResult shape

Streaming (/v1/query/stream)

Python SDK (Cloud)

Web Dashboard

Example

When to Use

Development

Backend API (dev mode)

MCP server (local)

Frontend (Next.js)

Self-Hosting

Configuration

Glossary (Project Terms)

If You’re Another AI Reading This

Recent Changes

v0.2.0 (December 2025)

v0.1.3 (December 2025)

Research

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Streaming (`/v1/query/stream`)