KV-cache native context database with reasoning-based tree retrieval

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Junyao

These details have not been verified by PyPI

Project description

ConDB: The KV-Cache Native Context Database

A new context database for reasoning-driven retrieval via tree search.
Fast, context-aware retrieval at scale with up to 70% less token cost.

🌲 What is ConDB?

ConDB (Context Database) is a tree-structured context database that uses LLM-powered reasoning-based retrieval via tree search instead of vector similarity — no vector DB, no chunking. It accepts PageIndex-compatible document trees, ChatIndex conversation trees, filesystem trees, and custom hierarchical JSON — with no runtime dependency on either. The LLM reasons over the tree, like a human expert using a table of contents, to locate relevant content.

Why not vector search?

Similarity ≠ relevance — vector search retrieves what looks similar, not what is truly relevant. Similar-looking chunks may differ in intent (low accuracy), while truly relevant information may be expressed in very different language and get missed entirely (low recall). True relevance requires reasoning
Chunking breaks semantic continuity — documents must be split into fixed-size segments to fit embedding models, causing context fragmentation that destroys their natural structure and cross-section relationships
Retrieval is blind to context — embedding models encode the query alone, ignoring conversational history, user intent, and other contextual signals

ConDB replaces this with reasoning-based tree search: the LLM performs node-level relevance classification over a hierarchical index, incorporating full context — making retrieval adaptive, explainable, and traceable.

What makes ConDB different

Fast tree search at scale — reasoning-driven tree search with block partitioning and parallel processing, supporting complex, context-aware retrieval over large hierarchical structures
KV-cache native — the first database designed around LLM KV-cache reuse. By caching intermediate results during tree search, ConDB reduces token usage by up to 70% with no loss in accuracy. The same efficiency gains extend to memory systems for long-context reasoning at scale
Unified long-context infrastructure — a single system for both static and dynamic long-context workloads

Static long context

Structured, persistent knowledge — documents (via PageIndex), file systems, and codebases. Scalable retrieval within large, organized hierarchies.

Dynamic long context

Evolving, runtime context — agent memory, long conversations (via ChatIndex), and autoresearch. Systems can continuously update, retrieve, and reason over newly generated information.

Key capabilities

Hierarchical storage — document trees, chat trees, and custom hierarchical JSON in SQLite
Multiple retrieval strategies — beam search for small trees, block retrieval for large documents
Multi-provider LLM support — Anthropic (Claude) and OpenAI (GPT) out of the box
Extensible — plug in custom storage backends, LLM providers, or retrieval strategies

🚀 Getting Started

Install

pip install -r requirements.txt

Basic Usage

import contextdb

# Open database
db = contextdb.open("my_docs.sqlite")

# Configure LLM
db.set_llm(provider="anthropic", model="claude-sonnet-4-6")

# Store a document tree
tree_id = db.store(document_tree_json, format="document")

# Query with LLM reasoning
result = db.query(tree_id, "What are the key findings?")
print(result.contents)

Index from files with an external tree builder

from contextdb import ContextTree

def build_markdown_tree(path: str) -> dict:
    ...

ct = ContextTree("context.sqlite")

tree_id = ct.index_markdown_file("doc.md", tree_builder=build_markdown_tree)

# You can also generate a tree out of process and call:
# tree_id = ct.index_document_tree(document_tree_json)

ct.close()

Configuration

Create a .env file with your API keys:

ANTHROPIC_API_KEY=sk-...
OPENAI_API_KEY=sk-...

Model and provider settings live in contextdb/config/config.yaml:

llm:
  provider: anthropic          # anthropic or openai
  model: claude-sonnet-4-6     # any model the provider supports
  context_limit: 100000
  max_concurrent: 10

retriever:
  beam_size: 3
  max_turns: 5

Override at runtime with environment variables:

LLM_MODEL=claude-opus-4-6 python your_script.py

🔍 Retrieval Strategies

ConDB automatically selects the best retrieval strategy based on tree size:

Strategy	Best for	How it works
Beam	Small trees (< 50 nodes)	LLM evaluates and selects promising branches at each depth level
Block	Large documents (50+ nodes)	Splits tree into token-bounded blocks, LLM reasons over each block. KV-cache native — caches intermediate block results to cut token usage by up to 70%

You can also specify a strategy explicitly:

result = db.query(tree_id, "question", strategy="block", beam_size=3)

📈 Benchmark Snapshot

Two benchmarks live under bench/.

Filesystem mode — SWEBench-FileTree

Runs on AmuroEita/SWEBench-FileTree, a path-only version of SWE-bench code retrieval:

500 GitHub issues as queries
475 (repo, commit) repository snapshots as independent retrieval universes
58,058 file paths; no source code, no file summaries

Given an issue and one snapshot's file tree, return the file(s) the fix touches. Specification: notes/condb_swebench_filetree_bench.md.

export ANTHROPIC_API_KEY=sk-ant-...
python bench/run_swebench_filetree.py --tier medium

Tiers (by retriever difficulty; lower difficulty = more path signal in query):

easy     107 queries   gold path appears in query text (sanity check)
medium   133 queries   gold filename appears in query    (main report)
hard     261 queries   gold module stem appears          (fuzzy matching)
all      500 queries   no filter, includes ~48% path-signal-less queries

Output goes to bench/runs/<timestamp>__<tier>/: report.md, summary.json, per_query.jsonl.

Block mode can optionally rerank only the cross-block merge candidates before the file/directory split:

python bench/run_swebench_filetree.py --tier medium --strategy block --ranker vector

Available rankers are none, bm25, and vector. The vector ranker uses LiteLLM embeddings (--embedding-provider, --embedding-model) and leaves the default ranker=none unchanged.

Latest Run

Claude Sonnet 4.6, --ranker none, --top-k 10, 500 queries, 0 failures. Block (ConDB) is compared against a Vertical baseline — a per-beam variant that expands each parent's children into separate subtree blocks (A→B, A→C), one LLM call per branch, losing the cross-branch view Block keeps.

variant	recall@gold	exact@gold	MRR	nDCG@10	avg returned	avg latency
Vertical (baseline)	0.382	0.366	0.466	0.481	3.00	~24 s
Block (ConDB)	0.711	0.672	0.805	0.813	7.20	~8 s

Block gains +0.33 recall@gold at ~3× lower latency.

Block per-gold-count breakdown — the cutoff is the query's gold-file count (6+ row contains one 6-gold query and one 21-gold outlier):

gold files	queries	cutoff	recall@gold	exact@gold	found@gold	avg returned
1	430	1	0.749	0.749	0.75	7.00
2	48	2	0.521	0.271	1.04	8.31
3	13	3	0.410	0.077	1.23	8.77
4	6	4	0.417	0.000	1.67	9.17
5	1	5	0.200	0.000	1.00	2.00
6+	2	gold	0.274	0.000	2.00	10.00

Reproduce:

python bench/run_swebench_filetree.py --tier all --strategy block    --ranker none --top-k 10
python bench/run_swebench_filetree.py --tier all --strategy vertical --ranker none --top-k 10

Document mode — single long document

Compares retriever algorithms (Block / Beam / Vertical / ...) on one hierarchical document. Reports time, LLM calls, token usage with prompt caching, and USD cost.

python bench/run_document_bench.py \
  --doc examples/large_doc.json \
  --config bench/queries.json

Queries live in the config JSON as {"queries": ["...", "..."]}. Swap in any --doc and any --config to benchmark a different document.

🧩 Learn More

Architecture

contextdb/
├── api/
│   ├── condb.py          # ConDB — main entry point
│   └── context_tree.py   # ContextTree — tree indexing + query API
├── core/
│   └── storage.py        # TreeDB (SQLite), StorageProtocol
├── adapter/
│   └── base.py           # DocumentTree, ChatIndex, Generic adapters
├── retriever/
│   ├── base.py           # Retriever protocols
│   └── algorithm/        # Beam, Block retrieval strategies
├── llm.py                # LLMClient (Anthropic, OpenAI)
├── config/               # YAML configs for retrievers
└── prompts/              # Jinja2 prompt templates

Extending

Custom Storage Backend

from contextdb import StorageProtocol

class MyStorage:
    def get_node(self, tree_id, node_id): ...
    def get_children(self, tree_id, node_id): ...
    # implement StorageProtocol methods

ct = ContextTree(storage=MyStorage())

Custom LLM Provider

from contextdb import LLMProtocol

class MyLLM:
    def chat(self, messages, system="", tools=None):
        return {"content": [...], "stop_reason": "..."}

ct = ContextTree("db.sqlite", llm=MyLLM())

Testing

./run_tests.sh all

💬 Community

Related Projects

PageIndex — vectorless, reasoning-based RAG that builds hierarchical tree indexes from long documents
ChatIndex — tree indexing for long conversations, enabling reasoning-based retrieval over chat histories
AgentFS — filesystem for AI agents

Connect with Us

Licensed under Apache 2.0.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

Junyao

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

1.0

May 27, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pageindex_condb-1.0.tar.gz (78.6 kB view details)

Uploaded May 27, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pageindex_condb-1.0-py3-none-any.whl (88.3 kB view details)

Uploaded May 27, 2026 Python 3

File details

Details for the file pageindex_condb-1.0.tar.gz.

File metadata

Download URL: pageindex_condb-1.0.tar.gz
Upload date: May 27, 2026
Size: 78.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pageindex_condb-1.0.tar.gz
Algorithm	Hash digest
SHA256	`343bbd25fac3ff01a01b7e23d4f2657c36a8fa838a42be62fb1bab29638da9d4`
MD5	`419749aa6e70c4b3932b0cbe4f1530dd`
BLAKE2b-256	`27791e5b38a2da9184ba88964f065581e384f622c0f5ecfcff99dd6fa5ad2f33`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pageindex_condb-1.0.tar.gz:

Publisher: publish.yml on VectifyAI/ConDB

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pageindex_condb-1.0.tar.gz
- Subject digest: 343bbd25fac3ff01a01b7e23d4f2657c36a8fa838a42be62fb1bab29638da9d4
- Sigstore transparency entry: 1643738620
- Sigstore integration time: May 27, 2026
Source repository:
- Permalink: VectifyAI/ConDB@171f3727f8ac684368d261c52259574f6c207ff3
- Branch / Tag: refs/tags/v1.0
- Owner: https://github.com/VectifyAI
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@171f3727f8ac684368d261c52259574f6c207ff3
- Trigger Event: release

File details

Details for the file pageindex_condb-1.0-py3-none-any.whl.

File metadata

Download URL: pageindex_condb-1.0-py3-none-any.whl
Upload date: May 27, 2026
Size: 88.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for pageindex_condb-1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`2e9eae1496950ec365dede7cf991095e55c787099717888f7b0d07ab0ac41977`
MD5	`fe1c2c9d32a6f902447b8a2c736e642b`
BLAKE2b-256	`c053cbd761b9410e182d7b93bde8fded5046cf374b0117ad84147fe9403daa4d`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pageindex_condb-1.0-py3-none-any.whl:

Publisher: publish.yml on VectifyAI/ConDB

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pageindex_condb-1.0-py3-none-any.whl
- Subject digest: 2e9eae1496950ec365dede7cf991095e55c787099717888f7b0d07ab0ac41977
- Sigstore transparency entry: 1643738654
- Sigstore integration time: May 27, 2026
Source repository:
- Permalink: VectifyAI/ConDB@171f3727f8ac684368d261c52259574f6c207ff3
- Branch / Tag: refs/tags/v1.0
- Owner: https://github.com/VectifyAI
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@171f3727f8ac684368d261c52259574f6c207ff3
- Trigger Event: release

pageindex-condb 1.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

ConDB: The KV-Cache Native Context Database

🌲 What is ConDB?

Why not vector search?

What makes ConDB different

Static long context

Dynamic long context

Key capabilities

🚀 Getting Started

Install

Basic Usage

Index from files with an external tree builder

Configuration

🔍 Retrieval Strategies

📈 Benchmark Snapshot

Filesystem mode — SWEBench-FileTree

Latest Run

Document mode — single long document

🧩 Learn More

Architecture

Extending

Testing

💬 Community

Related Projects

Connect with Us

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance