An agentic memory engine designed for lossless, tiered verbatim storage and multi-hop retrieval.

These details have not been verified by PyPI

Project links

Homepage

Project description

EpochDB Logo

EpochDB — Agentic Memory Engine

EpochDB is a high-performance, state-aware memory engine designed for lossless, tiered storage, atomic state management, and multi-hop relational reasoning. It is built specifically for AI agents that require perfect historical recall, long-term state persistence, and deterministic fact corrections.

Why EpochDB?

Flat vector databases retrieve text based on semantic similarity but struggle to resolve conflicting facts (e.g. "where does the user work now?" vs "where did they work last year?"). EpochDB solves this through Atomic State Management:

Topic Lock & Entity Seeding: Ensures retrieval stays within the target topic by seeding candidates directly from the Knowledge Graph.
State-Aware Supersession: Automatically identifies and filters out stale facts when they are updated.
Tiered HNSW Hierarchy: Sub-millisecond recall across working memory (L1 RAM) and historical archives (L2 Disk).
Memory Forking & Lineage: Supports logical branches (db.fork) for multi-agent collaboration and hypothetical reasoning without copying data.
Rich Domain Objects: Returns structured Memory, Entity, and Graph abstractions rather than raw database tuples.

Architecture

EpochDB uses a tiered hierarchy modeled after CPU caches to balance low latency and massive scale:

graph TD
    Agent([Agent / Application]) -->|remember / add_memory| Engine[EpochDB Engine]

    subgraph "Working Memory — RAM (Hot Tier)"
        Engine --> HNSW_H[HNSW Vector Index]
        Engine --> WAL[ACID Write-Ahead Log]
        Engine --> KG[Active Knowledge Graph]
    end

    subgraph "Historical Archive — Disk (Cold Tier)"
        HNSW_H -->|Async Flush| Parquet[(Parquet + F32 + Zstd)]
        Parquet <--> HNSW_C[HNSW Index per Epoch]
        HNSW_C <--> GEI[Global Entity Index]
    end

    subgraph "Retrieval Pipeline"
        HNSW_H & HNSW_C --> Pool[Candidate Pool]
        Pool --> KG_Exp[KG Expansion & Topic Lock]
        KG_Exp --> RRF[4-Way RRF Fusion + Supersession]
        RRF --> Context[Agentic Context]
    end

Performance, Latency & Token Efficiency

1. The 1.000 Sweep Benchmark

EpochDB achieves a perfect 1.000 score across named benchmark suites designed to validate engine logic:

Benchmark	What it tests	Metric	Score
LoCoMo	Multi-hop relational reasoning	Multi-hop recall	1.000
ConvoMem	Fact correction & preference recall	recall@3	1.000
LongMemEval	Longitudinal recall (cross-epoch)	recall@3	1.000
NIAH	Needle in a Haystack (High-noise)	precision@3	1.000

Run the benchmark suite locally:

venv/bin/python -m benchmarks.run_all

2. Operational Latency

Precision metrics across Hot and Cold tiers:

Direct/Multi-Hop Relational Retrieval (Hot Tier): 0.2 ms – 0.4 ms
Historical HNSW Retrieval (Cold Tier): ~4.0 ms (30x speedup from ~125 ms via persistent indexing)
Cold Tier Full Scan (pyarrow.dataset): 45.0 ms (for cross-epoch scalar aggregations)
Scalar Range Query (B-tree): 0.8 ms
Series Interpolation (IntervalTree): 1.2 ms
Constraint Satisfaction (Z3 SAT Solver): 2.5 ms
WAL Crash Recovery Replay: 9.1 ms

3. LangGraph Token Savings

When used as a checkpointer, EpochDB keeps LangGraph states "thin" by storing historical turns as Unified Memory Atoms and querying them selectively. This achieves linear $O(N)$ token scaling (saving 55% to 79% of input tokens compared to standard checkpointers' quadratic $O(N^2)$ accumulation).

Installation

pip install epochdb

Quickstart

1. Synchronous API Facade

from epochdb import EpochDB

# Initialize with auto-embedding
with EpochDB(storage_dir="./memory", embedding_model="all-MiniLM-L6-v2") as db:
    # Store a memory with KG triples
    db.remember("User works at DataFlow.", metadata={"triples": [("user", "works_at", "DataFlow")]})
    
    # Update facts (supersession resolves conflicts)
    db.remember("Actually, user now works at VectorAI.", metadata={"triples": [("user", "works_at", "VectorAI")]})
    
    # Query returns rich Memory objects
    results = db.query("Where does the user work?", k=1)
    print(results[0].text)  # "Actually, user now works at VectorAI."

2. Asynchronous API Facade

import asyncio
from epochdb import AsyncEpochDB

async def main():
    # Async context manager for non-blocking I/O in agent loops
    async with AsyncEpochDB(storage_dir="./memory", embedding_model="all-MiniLM-L6-v2") as db:
        await db.remember("VectorAI develops CRISPR-X platform.", metadata={"triples": [("VectorAI", "develops", "CRISPR-X")]})
        
        results = await db.query("What does VectorAI build?", k=1)
        print(results[0].text)

asyncio.run(main())

3. MongoDB-Style Metadata Filtering

# Filter retrieval using operators like $eq, $ne, $in, $nin, $gt, $gte, $lt, $lte
results = db.query(
    "Query text", 
    k=5, 
    filters={
        "author": "Jeff", 
        "importance": {"$gt": 3},
        "category": {"$in": ["development", "production"]}
    }
)

4. Soft-Delete & Compaction

# Mark memory as deleted (filtered out from queries by default)
db.delete(memory_id, hard=False)

# Reclaim space and deduplicate historical Parquet archives in the Cold Tier
db.compact()

5. Entity & Graph Traversal

# Retrieve entity object
vector_ai = db.get_entity("VectorAI")

# Traverse relations in Global Entity Index
related = vector_ai.related()  # [Entity("user"), Entity("CRISPR-X")]

# Chronological timeline of the entity
timeline = vector_ai.timeline()

# Generate local graph segment
graph = db.entity_graph("VectorAI", depth=2)
print(graph.nodes)  # ['VectorAI', 'user', 'CRISPR-X']
print(graph.edges)  # List of edge dictionaries mapping sources and targets

LangGraph Integration

EpochDB provides native checkpointer support for both synchronous and asynchronous workflows:

from epochdb.checkpointer import EpochDBCheckpointer
from epochdb import EpochDB

# Synchronous compile
with EpochDB(storage_dir="./agent_state") as db:
    checkpointer = EpochDBCheckpointer(db)
    app = workflow.compile(checkpointer=checkpointer)

For async runtimes:

from epochdb import AsyncEpochDB
from epochdb.checkpointer import EpochDBCheckpointer

async def run_agent():
    async with AsyncEpochDB(storage_dir="./agent_state") as db:
        checkpointer = EpochDBCheckpointer(db)
        app = workflow.compile(checkpointer=checkpointer)
        # Uses aput, aget_tuple, and alist internally under the hood

Repository Structure

The codebase is modularized to isolate engine subsystems:

core/: Core transactions, checkpointers, and base units.
storage/: Hot Tier (RAM HNSW) and Cold Tier (Parquet storage).
entities/: Global KG manager, cascade updates, and reflection rules.
retrieval/: Multi-stage retrieval managers, quantitative indexes, and RRF fusion.
api/: Public facade APIs (EpochDB and AsyncEpochDB) and domain objects (Memory, Entity, Graph).

Technical Specifications & Constants

+20.0 Topic Lock Boost: Set mathematically larger than the maximum possible Reciprocal Rank Fusion (RRF) score sum (which caps at $\approx 0.05$ across semantic and recency ranks, using $K=60$). This acts as a "hard lock," ensuring query-intent-matched facts always outrank adjacent semantic noise.
0.0001x Supersession Penalty: Multiplicatively demotes stale facts (e.g. older conflicting values for the same subject-predicate pair) to the bottom of the retrieval pool, resolving contradictions deterministically while preserving database history.
1e-7 Signal-to-Noise Demotion: Once a Topic-Locked fact is identified, all non-locked background noise is demoted by $10^{-7}$ to keep the LLM's context window clean and free from distractors.
Quantitative logic & Triggers: Native support for Scalars, Time-Series, and Constraints. IntervalTree enables precise $O(\log n + k)$ range queries with base-unit normalization via persistent schema_registry.json.
Reactive Cascade Graphs: CascadeManager automatically triggers downstream policy updates, while Coefficient of Variation (CV) reflections auto-generate constraint atoms from observed historical data trends.
Analytical Cold Tier: Leveraging pyarrow.dataset for high-performance cross-epoch scanning and numeric aggregation directly over compressed Parquet archives.
ACID Crash Recovery: Zero data loss for in-flight memories via the synchronous Write-Ahead Log.

License

MIT — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.0.3

Jun 19, 2026

1.0.2

Jun 19, 2026

1.0.1

Jun 18, 2026

This version

1.0.0

Jun 5, 2026

0.7.2

May 25, 2026

0.7.1

May 25, 2026

0.7.0

May 25, 2026

0.6.1

May 21, 2026

0.6.0

May 16, 2026

0.5.0

May 2, 2026

0.4.6

Apr 22, 2026

0.4.5

Apr 13, 2026

0.4.2

Apr 13, 2026

0.4.1

Apr 13, 2026

0.4.0

Apr 12, 2026

0.3.0

Apr 10, 2026

0.2.2

Apr 10, 2026

0.2.1

Apr 10, 2026

0.2.0

Apr 9, 2026

0.1.3

Apr 8, 2026

0.1.1

Apr 8, 2026

0.1.0

Apr 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

epochdb-1.0.0.tar.gz (60.4 kB view details)

Uploaded Jun 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

epochdb-1.0.0-py3-none-any.whl (55.8 kB view details)

Uploaded Jun 5, 2026 Python 3

File details

Details for the file epochdb-1.0.0.tar.gz.

File metadata

Download URL: epochdb-1.0.0.tar.gz
Upload date: Jun 5, 2026
Size: 60.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.5

File hashes

Hashes for epochdb-1.0.0.tar.gz
Algorithm	Hash digest
SHA256	`dc807fc1f2c29246c9c02a47b72a5eef1e51b803ec638ac78f765760709d649b`
MD5	`034c97a46fd68984a3abfa78edd1d427`
BLAKE2b-256	`e1832e9310b1e7403da91eee70334359a2115049c8872ede8f991877ab062f81`

See more details on using hashes here.

File details

Details for the file epochdb-1.0.0-py3-none-any.whl.

File metadata

Download URL: epochdb-1.0.0-py3-none-any.whl
Upload date: Jun 5, 2026
Size: 55.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.5

File hashes

Hashes for epochdb-1.0.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f860c1bcb2589a5461be54e3b579407462fca9b5c906ca6691136c55d712d967`
MD5	`5f8a853509bc8e1ea4bcebe46a52f4fe`
BLAKE2b-256	`36ac80c30000878c34627bf1aab3600c0bb35368d26227d003d613ccc0977826`

See more details on using hashes here.

epochdb 1.0.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

EpochDB — Agentic Memory Engine

Why EpochDB?

Architecture

Performance, Latency & Token Efficiency

1. The 1.000 Sweep Benchmark

2. Operational Latency

3. LangGraph Token Savings

Installation

Quickstart

1. Synchronous API Facade

2. Asynchronous API Facade

3. MongoDB-Style Metadata Filtering

4. Soft-Delete & Compaction

5. Entity & Graph Traversal

LangGraph Integration

Repository Structure

Technical Specifications & Constants

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes