Knowledge memory library for long-horizon AI agents — hybrid retrieval over documents, embeddings, and graph relationships

These details have not been verified by PyPI

Project description

Khora

"Khora is the receptacle, the space, the matrix in which all things come to be." - Plato, Timaeus

A Python library for creating knowledge repositories that ingest unstructured and structured multi-source data and expose a single query substrate, built for integrating into long-horizon AI agents.

Quickstart

Two supported stacks. Pick by deployment shape - the public API is identical for both.

uv add khora              # Docker: PostgreSQL + pgvector + Neo4j
uv add khora[embedded]    # Embedded: SQLite + LanceDB, no external services

import asyncio
from khora import Khora, context_text

async def main() -> None:
    async with Khora() as kb:  # reads KHORA_DATABASE_URL / KHORA_NEO4J_URL
        ns = await kb.create_namespace()
        await kb.remember(
            "Marie Curie won the Nobel Prize in Physics in 1903.",
            namespace=ns.namespace_id,
            entity_types=["PERSON", "AWARD"],
            relationship_types=["WON"],
        )
        result = await kb.recall("What did Curie win?", namespace=ns.namespace_id)
        print(context_text(result))

asyncio.run(main())

For the Docker stack, set KHORA_DATABASE_URL and KHORA_NEO4J_URL, run uv run alembic upgrade head, then run the snippet above. For the embedded stack, set KHORA_STORAGE_BACKEND=sqlite_lance and the corresponding db_path - no external services required.

See the docs for the full extras list and env-var reference.

Why khora?

Knowledge repositories for long-horizon agents - copilots, customer-support bots, research assistants - hit two problems that pure vector search doesn't solve:

Ingest is more than chunking. A useful repository needs entities, relationships, and temporal anchors extracted from the raw text. Khora runs a 3-phase ingest pipeline (stage → enrich → expand) with selective LLM extraction (default 70% of chunks, configurable) and cross-batch entity resolution.
Retrieval is more than cosine. Real queries mix semantic similarity, multi-hop entity reasoning, freshness, and keyword precision. Khora combines vector + Cypher graph traversal + BM25 + RRF fusion + temporal-anchored reranking, routed per query.

Storage stacks

Stack	Install	Use when
PostgreSQL + pgvector + Neo4j	`uv add khora`	Default. Docker-friendly, multi-tenant ready, scales horizontally.
SQLite + LanceDB (experimental)	`uv add khora[embedded]`	Demos, evaluation, tests, single-user CLIs. Documented ceiling: ~1M chunks, ~100k entities, ~500k edges, traversal depth ≤3. Known gaps: no point-in-time queries, partial atomicity in `coordinator.transaction()`, FTS on chunks only.

Retrieval engines

Khora's retrieval layer is pluggable. More than one engine can sit on the same storage substrate. The default is VectorCypher: it handles structured and unstructured data from multiple sources by combining a knowledge graph with a vector store, then dispatching each query to the best retrieval method via a query-aware router.

The router selects between vector similarity, graph traversal, BM25 keyword, or a fused blend. RRF fusion, optional PPR (Personalized PageRank), and optional cross-encoder reranking shape the result set. Selective per-chunk LLM extraction (KET-RAG style) bounds ingest cost.

Batch processing

submit_batch() stages documents as PENDING and returns a BatchHandle immediately. A background processor picks them up and calls on_result per document as each completes.

The processor is opt-in - call kb.start_pending_processor() after connect() on services that write documents. Read-only services do not need it.

async with Khora() as kb:
    kb.start_pending_processor()
    handle = await kb.submit_batch(
        [{"content": "doc 1"}, {"content": "doc 2"}],
        on_result=lambda completed, total, result: print(result),
        namespace=ns_id,
        entity_types=["PERSON", "ORG"],
        relationship_types=["WORKS_AT"],
    )
    await handle.wait()

Integrations

Opt-in adapters for the major agentic frameworks. Each adapter is in its own extra; the framework is imported lazily so import khora never pulls in a framework you don't use.

Framework	Install	Khora surface
CrewAI	`uv add khora[crewai]`	`KhoraMemory` - drop-in storage backend for CrewAI's unified `Memory`.
LangGraph	`uv add khora[langgraph]`	`KhoraStore` - `BaseStore` implementation for `StateGraph` semantic long-term memory.
Google ADK	`uv add khora[google-adk]`	`KhoraMemoryService` - `BaseMemoryService` drop-in for ADK `Runner`.
OpenAI Agents SDK	`uv add khora[openai-agents]`	`KhoraSession`, `khora_recall_tool`, `KhoraMemoryHooks`.
LlamaIndex	`uv add khora[llamaindex]`	`KhoraRetriever`, `KhoraMemoryBlock`.

See the docs for per-adapter guides and the "write your own" Protocol surface.

Examples

Runnable demos under examples/:

00_quickstart/ - remember + recall, grounded answers, forget, namespaces.
10_core_apis/ - batch ingest, recall filters, ontology config, entities + relationships, graph traversal.
20_integrations/ - LangGraph, OpenAI Agents SDK, CrewAI.
30_workloads/ - per-user preferences with temporal decay, document Q&A with multi-signal abstention, support-ticket knowledge graphs, agent conversation history, namespace versioning, resume search with cross-document entity resolution.

Run any demo from the repo root, e.g. uv run python examples/30_workloads/01_per_user_preferences.py. The embedded backend (examples/khora.embedded.yaml) needs no external services; pass --config examples/khora.standard.yaml to target PostgreSQL + Neo4j.

Rust acceleration (optional)

Khora ships an optional Rust extension (khora-accel) that speeds up MMR, cosine similarity, PageRank, entity resolution, community detection, and temporal operators. Pure-Python fallbacks ship in the base package; the Rust path is opt-in.

uv add khora[rust]

Prebuilt wheels cover the common platforms (macOS arm64/x86_64, Linux x86_64/aarch64, Windows x86_64), so most users won't need a toolchain. Building from source requires Rust 1.85+.

Observability

Khora emits OpenTelemetry spans and metrics through the OTel API. Export path is your choice: vanilla OTel SDK (uv add khora[otel]), Logfire (uv add khora[logfire]), or nothing (zero-cost no-op). Credential fields on KhoraConfig are pydantic.SecretStr; free-text never leaks into span attributes.

See the docs for the env-var contract, vendor recipes, and the telemetry surface reference.

Documentation

Full documentation at docs.deyta.ai/khora - API reference, configuration, architecture, migrations, integrations, and the downstream-consumer guide.

Development

make dev         # start PostgreSQL + Neo4j (Docker); alias: make db-up
make test        # pytest with coverage
make format      # ruff format + isort
make lint        # ruff + ty typecheck

See CHANGELOG.md for release history.

License

Licensed under the Apache License, Version 2.0. See LICENSE and NOTICE.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.20.0

Jun 15, 2026

0.19.0

Jun 8, 2026

0.18.5

Jun 5, 2026

0.18.4

Jun 5, 2026

0.18.3

Jun 2, 2026

0.18.2

Jun 2, 2026

0.18.1

Jun 1, 2026

0.18.0

Jun 1, 2026

0.17.4

May 29, 2026

0.17.3

May 28, 2026

0.17.2

May 26, 2026

0.17.1

May 26, 2026

0.17.0

May 24, 2026

0.16.4

May 22, 2026

0.16.3

May 21, 2026

0.16.2

May 21, 2026

0.16.1

May 20, 2026

0.16.0

May 19, 2026

0.15.3

May 19, 2026

0.15.2

May 19, 2026

0.15.0

May 17, 2026

0.14.0

May 16, 2026

0.13.0

May 15, 2026

0.12.1

May 15, 2026

0.12.0

May 14, 2026

0.11.2

May 14, 2026

0.11.1

May 14, 2026

0.11.0

May 14, 2026

0.10.8

May 13, 2026

0.10.7

May 13, 2026

0.10.6

May 12, 2026

0.10.5

May 12, 2026

0.10.4

May 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

khora-0.20.0.tar.gz (7.1 MB view details)

Uploaded Jun 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

khora-0.20.0-py3-none-any.whl (1.4 MB view details)

Uploaded Jun 15, 2026 Python 3

File details

Details for the file khora-0.20.0.tar.gz.

File metadata

Download URL: khora-0.20.0.tar.gz
Upload date: Jun 15, 2026
Size: 7.1 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for khora-0.20.0.tar.gz
Algorithm	Hash digest
SHA256	`b7a244bb03d17ba860f696415b569d9b8c297f0dd055bec8e8f07ee583c2d475`
MD5	`6345089c84d05da8c3171f92a36d0053`
BLAKE2b-256	`4db5dcd966aa8a0a494c47d50d6da010289ebead1d86e1b49835f57a061e5c73`

See more details on using hashes here.

Provenance

The following attestation bundles were made for khora-0.20.0.tar.gz:

Publisher: release.yml on DeytaHQ/khora

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: khora-0.20.0.tar.gz
- Subject digest: b7a244bb03d17ba860f696415b569d9b8c297f0dd055bec8e8f07ee583c2d475
- Sigstore transparency entry: 1822828596
- Sigstore integration time: Jun 15, 2026
Source repository:
- Permalink: DeytaHQ/khora@6f931ed44edc0698df7377fff38af22b9c741c58
- Branch / Tag: refs/tags/v0.20.0
- Owner: https://github.com/DeytaHQ
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@6f931ed44edc0698df7377fff38af22b9c741c58
- Trigger Event: push

File details

Details for the file khora-0.20.0-py3-none-any.whl.

File metadata

Download URL: khora-0.20.0-py3-none-any.whl
Upload date: Jun 15, 2026
Size: 1.4 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for khora-0.20.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`adda4333f28aa90ffe0e8a109c7863eb3e379685043433ebdc981d85d453b43c`
MD5	`6e01abc34c98ea873e37f8a45ac37652`
BLAKE2b-256	`fe0ab56ede68efad23fa5975052e89428df532743106e40fd772901690b39871`

See more details on using hashes here.

Provenance

The following attestation bundles were made for khora-0.20.0-py3-none-any.whl:

Publisher: release.yml on DeytaHQ/khora

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: khora-0.20.0-py3-none-any.whl
- Subject digest: adda4333f28aa90ffe0e8a109c7863eb3e379685043433ebdc981d85d453b43c
- Sigstore transparency entry: 1822828606
- Sigstore integration time: Jun 15, 2026
Source repository:
- Permalink: DeytaHQ/khora@6f931ed44edc0698df7377fff38af22b9c741c58
- Branch / Tag: refs/tags/v0.20.0
- Owner: https://github.com/DeytaHQ
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@6f931ed44edc0698df7377fff38af22b9c741c58
- Trigger Event: push

khora 0.20.0

Navigation

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Meta

Project description

Khora

Quickstart

Why khora?

Storage stacks

Retrieval engines

Batch processing

Integrations

Examples

Rust acceleration (optional)

Observability

Documentation

Development

License

Project details

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance