A Cognitive Memory Engine for Persistent AI Systems

These details have been verified by PyPI

Project links

Owner

Yantrik OS

GitHub Statistics

These details have not been verified by PyPI

Project description

YantrikDB — A Cognitive Memory Engine for Persistent AI Systems

The memory engine for AI that actually knows you.

The Problem

Current AI systems have no coherent memory architecture. They bolt together generic databases — vector stores, knowledge graphs, key-value caches — none of which were designed for how cognition works. This makes persistent, evolving AI relationships impossible at scale.

Today's AI memory is:

Store everything → Embed → Retrieve top-k → Inject into context → Hope it helps.

That does not scale cognitively.

The Thesis

AI needs a purpose-built memory engine with native support for:

Temporal decay — memories age and fade like human memory
Semantic consolidation — patterns are extracted, redundancy is compressed
Conflict resolution — contradictions are detected and resolved conversationally
Multi-device replication — local-first CRDT-based sync across devices
Proactive cognition — background processing that gives AI genuine reasons to initiate conversation

All in a single embedded engine — no server, no network hops, no stitching together five databases.

Why Not Use Existing Solutions?

Solution	What it does	What it lacks
Vector DBs (Pinecone, Weaviate, Milvus)	High-dimensional nearest-neighbor lookup	No time awareness, no causality, no compression, no self-organization
Knowledge Graphs (Neo4j)	Structured relations, entity linking	Hard to scale dynamically, poor for fuzzy memory, not adaptive
Memory Frameworks (LangChain, LlamaIndex)	Retrieval wrappers, context injection	Not true memory architectures — just middleware

Human memory is hierarchical, compressed, contextual, self-updating, emotionally weighted, time-aware, and predictive. No existing system addresses this holistically.

Architecture

Design Principles

Embedded, not client-server — single file, no server process (like SQLite)
Local-first, sync-native — works offline, syncs when connected
Cognitive operations, not SQL — record(), recall(), relate(), not SELECT
Living system, not passive store — does work between conversations

Unified Index Architecture

Five index types in one engine, sharing the same memory pages, WAL, and query planner:

┌─────────────────────────────────────────────────────┐
│                  YantrikDB Engine                         │
│                                                     │
│  ┌───────────┬───────────┬───────────┬───────────┐ │
│  │  Vector   │  Graph    │ Temporal  │   Decay   │ │
│  │  Index    │  Index    │  Index    │   Heap    │ │
│  │  (HNSW)  │ (Entities)│ (Events)  │(Priority) │ │
│  └───────────┴───────────┴───────────┴───────────┘ │
│  ┌───────────┐                                      │
│  │ Key-Value │                                      │
│  │  Store    │                                      │
│  └───────────┘                                      │
│                                                     │
│  ┌───────────────────────────────────────────────┐  │
│  │         Write-Ahead Log (WAL)                 │  │
│  └───────────────────────────────────────────────┘  │
│  ┌───────────────────────────────────────────────┐  │
│  │      Replication Log (append-only)            │  │
│  │      CRDT-based conflict resolution           │  │
│  └───────────────────────────────────────────────┘  │
└─────────────────────────────────────────────────────┘

Vector Index (HNSW) — semantic similarity search across memories
Graph Index — entity relationships ("Max is user's dog", "user works at Meta")
Temporal Index — time-series style, "what happened around Tuesday"
Decay Heap — priority queue with importance scores that degrade over time
Key-Value Store — fast facts ("user's name is Pranab")

Memory Types

Inspired by cognitive science (Tulving's taxonomy):

Type	What it stores	Example
Episodic	Events, experiences with context	"User had a rough day at work on Feb 20"
Semantic	Facts, knowledge, abstractions	"User is a software engineer who likes AI"
Procedural	Strategies, behaviors, what worked	"User prefers concise answers with code examples"
Emotional	Valence weighting on memories	"Dog's death → high emotional weight → never forget"

Core Operations

yantrikdb.record(memory, importance=0.8, emotion="frustrated")
yantrikdb.recall("What does the user feel about their job?")
yantrikdb.relate("user.job", "user.stress", strength=0.7)
yantrikdb.consolidate(topic="user.career", since="30d")
yantrikdb.decay(threshold=0.1)       // prune low-importance memories
yantrikdb.forget(memory_id)          // explicit removal
yantrikdb.conflict(memory_a, memory_b)  // flag contradiction
yantrikdb.resolve(conflict_id, resolution)  // user-driven resolution

Conflict Resolution — Human-in-the-Loop

When synced devices produce contradictory memories, YantrikDB doesn't guess. It creates a conflict segment — a first-class data structure:

┌──────────────────────────────────────────┐
│            Conflict Segment              │
│                                          │
│  conflict_id:  c_0042                    │
│  type:         identity_fact             │
│  priority:     high                      │
│  memory_a:     "works at Google" (phone) │
│  memory_b:     "works at Meta" (laptop)  │
│  status:       pending_resolution        │
│  strategy:     ask_user                  │
│  resolved_by:  null                      │
│  resolution:   null                      │
└──────────────────────────────────────────┘

Resolution happens conversationally, not programmatically:

"Oh by the way — last month you mentioned something about Meta. Did you end up switching from Google?"

Conflicts are triaged by priority:

Conflict Type	Action
Critical identity facts	Ask immediately
Preferences that changed	Ask naturally in conversation
Minor contradictions	Keep both, resolve lazily
Temporal conflicts	Prefer most recent, flag if uncertain

Multi-Device Sync Protocol

YantrikDB is local-first with CRDT-based replication:

┌──────────────────────┐       ┌──────────────────────┐
│   Device A (Phone)   │       │  Device B (Laptop)   │
│                      │       │                      │
│  ┌────────────────┐  │ sync  │  ┌────────────────┐  │
│  │ YantrikDB Engine │◄─┼───────┼─►│ YantrikDB Engine │  │
│  └────────────────┘  │       │  └────────────────┘  │
│  ┌────────────────┐  │       │  ┌────────────────┐  │
│  │ Replication    │  │       │  │ Replication    │  │
│  │ Log            │  │       │  │ Log            │  │
│  └────────────────┘  │       │  └────────────────┘  │
└──────────────────────┘       └──────────────────────┘
         │                              │
         └──────────┬───────────────────┘
                    │
            P2P / Relay / BLE
        (encrypted, zero-knowledge)

Append-only replication log — every write, consolidation, and decay event is logged
CRDT merging — graph edges/nodes and facts merge without conflicts
Vector indexes rebuild locally — raw memories sync, each device rebuilds HNSW
Forget propagation — tombstones ensure forgotten memories stay forgotten
Optional cloud relay — dumb encrypted pipe, not a server. Sees nothing.

Storage Tiers

Tier	Backing	Use case
Hot	In-memory	Recent/frequent memories, active conversation
Warm	SSD-backed	Medium-term, weeks to months
Cold	Compressed archival	Old memories, on-demand hydration

Proactive Cognition Loop

YantrikDB runs a background processing loop even between conversations — giving AI genuine reasons to reach out:

┌─────────────────────────────────────────────────┐
│           Proactive Trigger System               │
│                                                  │
│  Memory Conflicts    → "You mentioned two        │
│  (need resolution)     different moving dates"   │
│                                                  │
│  Pattern Detection   → "You seem stressed        │
│  (noticed something)   every Sunday evening"     │
│                                                  │
│  Temporal Triggers   → "Your mom's birthday      │
│  (time-based)          is tomorrow"              │
│                                                  │
│  Decay Warnings      → "I'm fuzzy on your        │
│  (about to forget)     new coworker's name"      │
│                                                  │
│  Goal Tracking       → "How's the marathon       │
│  (user set a goal)     training going?"          │
│                                                  │
│  Consolidation       → "I noticed you always     │
│  Insights              feel better after talking  │
│                        to your sister"            │
└─────────────────────────────────────────────────┘

Every proactive message is grounded in real memory data — not engagement farming.

Built-in safety constraints:

Rule	Purpose
Cooldown periods	No messaging every hour
Priority threshold	Only reach out when it matters
Time-of-day awareness	Don't message at 3am
User-controlled frequency	"Check in weekly" vs "only urgent"
Groundedness requirement	Every message must trace to real memories

Background Processing Cycle

Consolidation pass — compress, summarize, abstract
Conflict detection — find contradictions across synced devices
Pattern mining — "user tends to X when Y"
Trigger evaluation — "is anything worth reaching out about?"
Decay pass — age out low-importance memories

Technical Decisions

Decision	Choice	Rationale
Architecture	Embedded (like SQLite)	No server overhead, sub-ms local reads, single-tenant
Core language	Rust	Memory safety without GC pauses, ideal for embedded engines
Bindings	Python, TypeScript	Agent/AI layer integration
Storage format	Single file per user	Portable, backupable, no infrastructure
Sync	CRDTs + append-only log	Conflict-free for most operations, deterministic
Query interface	Cognitive operations API	Not SQL — designed for how agents think

Target Use Cases

AI Companions — persistent, evolving relationships across devices
Autonomous Agents — long-horizon planning with memory consolidation
Multi-Agent Systems — shared memory between cooperating agents
Personal AI Assistants — that actually remember and grow with you

Roadmap

V0 — Single device, embedded engine, core memory model (record, recall, relate, consolidate, decay)
V1 — Replication log, sync between two devices
V2 — Conflict resolution with human-in-the-loop, production-grade sync
V3 — Proactive cognition loop, pattern detection, trigger system
V4 — Multi-agent shared memory, federated learning across users

Research & Publications

U.S. Patent Application 19/573,392 (March 2026): "Cognitive Memory Database System with Relevance-Conditioned Scoring and Autonomous Knowledge Management"
Zenodo: YantrikDB: A Cognitive Memory Engine for Persistent AI Systems
Related work by the author: "Convert Once, Consume Many: SDF for Cacheable, Typed Semantic Extraction from Web Pages" — solving efficient data ingestion for AI agents (the upstream problem to memory)

Author

Pranab Sarkar

Patent

YantrikDB's cognitive memory methods are covered by U.S. Patent Application No. 19/573,392 (filed March 20, 2026), claiming priority to Provisional Application No. 63/991,357 (filed February 26, 2026).

License

This program is free software: you can redistribute it and/or modify it under the terms of the GNU Affero General Public License as published by the Free Software Foundation, version 3.

See LICENSE for the full text.

Project details

These details have been verified by PyPI

Project links

Owner

Yantrik OS

GitHub Statistics

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.7.19

May 20, 2026

0.7.18

May 20, 2026

0.7.17

May 18, 2026

0.7.16

May 16, 2026

0.7.15

May 14, 2026

0.7.14

May 14, 2026

0.7.13

May 14, 2026

0.7.12

May 14, 2026

0.7.11

May 13, 2026

0.7.10

May 13, 2026

0.7.9

May 12, 2026

0.7.8

May 9, 2026

0.7.7

May 9, 2026

0.7.6

May 9, 2026

0.7.5

May 9, 2026

0.7.4

May 9, 2026

0.7.3

May 9, 2026

0.7.2

May 8, 2026

0.6.3

Apr 28, 2026

0.4.2

Mar 24, 2026

0.4.1

Mar 24, 2026

0.4.0

Mar 23, 2026

0.3.1

Mar 21, 2026

0.3.0

Mar 21, 2026

0.2.9

Mar 21, 2026

0.2.8

Mar 21, 2026

0.2.7

Mar 21, 2026

0.2.6

Mar 21, 2026

0.2.4

Mar 20, 2026

0.2.3

Mar 20, 2026

0.2.2

Mar 20, 2026

0.2.1

Mar 20, 2026

This version

0.2.0

Mar 20, 2026

0.1.0

Mar 20, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yantrikdb-0.2.0.tar.gz (759.0 kB view details)

Uploaded Mar 20, 2026 Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

yantrikdb-0.2.0-cp313-cp313-win_amd64.whl (2.2 MB view details)

Uploaded Mar 20, 2026 CPython 3.13Windows x86-64

yantrikdb-0.2.0-cp313-cp313-manylinux_2_34_x86_64.whl (2.8 MB view details)

Uploaded Mar 20, 2026 CPython 3.13manylinux: glibc 2.34+ x86-64

yantrikdb-0.2.0-cp313-cp313-macosx_11_0_arm64.whl (2.3 MB view details)

Uploaded Mar 20, 2026 CPython 3.13macOS 11.0+ ARM64

File details

Details for the file yantrikdb-0.2.0.tar.gz.

File metadata

Download URL: yantrikdb-0.2.0.tar.gz
Upload date: Mar 20, 2026
Size: 759.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: maturin/1.12.6

File hashes

Hashes for yantrikdb-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`6189a76b33d844d6293b8bc5089bf828b1ff56879cb395ea00d2b9257c2dca8a`
MD5	`cac80578eb25af37ce7728d6bc6d0191`
BLAKE2b-256	`40b1bac081c05c25d53ad819bef6053b3762518a14f09417a3e2ea807da9ae96`

See more details on using hashes here.

File details

Details for the file yantrikdb-0.2.0-cp313-cp313-win_amd64.whl.

File metadata

Download URL: yantrikdb-0.2.0-cp313-cp313-win_amd64.whl
Upload date: Mar 20, 2026
Size: 2.2 MB
Tags: CPython 3.13, Windows x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: maturin/1.12.6

File hashes

Hashes for yantrikdb-0.2.0-cp313-cp313-win_amd64.whl
Algorithm	Hash digest
SHA256	`bf8c73274c80e9e95578243e5f584895010b824ec51111a857c968143fa674fc`
MD5	`c8732666c3fc222b7696109a62f3736b`
BLAKE2b-256	`e60fac11877108a7d4b35b216607d374b8658e27a6ddab9ac4a549ab3697cac2`

See more details on using hashes here.

File details

Details for the file yantrikdb-0.2.0-cp313-cp313-manylinux_2_34_x86_64.whl.

File metadata

Download URL: yantrikdb-0.2.0-cp313-cp313-manylinux_2_34_x86_64.whl
Upload date: Mar 20, 2026
Size: 2.8 MB
Tags: CPython 3.13, manylinux: glibc 2.34+ x86-64
Uploaded using Trusted Publishing? Yes
Uploaded via: maturin/1.12.6

File hashes

Hashes for yantrikdb-0.2.0-cp313-cp313-manylinux_2_34_x86_64.whl
Algorithm	Hash digest
SHA256	`5ce779a6f5cb1d01bc044ba2ef419ffa2d03563283a533812abe50274b56f27a`
MD5	`5597aefc18f57c72a1f7c744668c6b83`
BLAKE2b-256	`c8de5033942a652ad9258a7d0edac77c2a8a52efab2ebcbc452a9a5047369b00`

See more details on using hashes here.

File details

Details for the file yantrikdb-0.2.0-cp313-cp313-macosx_11_0_arm64.whl.

File metadata

Download URL: yantrikdb-0.2.0-cp313-cp313-macosx_11_0_arm64.whl
Upload date: Mar 20, 2026
Size: 2.3 MB
Tags: CPython 3.13, macOS 11.0+ ARM64
Uploaded using Trusted Publishing? Yes
Uploaded via: maturin/1.12.6

File hashes

Hashes for yantrikdb-0.2.0-cp313-cp313-macosx_11_0_arm64.whl
Algorithm	Hash digest
SHA256	`bbc1c242b22419bb592a1a42b40af06ae1601c29cdb4cd854b3094d27c3f5581`
MD5	`b8c7f7aad2ceb484510ee4fd057dff78`
BLAKE2b-256	`b84a2ac9c53a07f51c1cb2b99b620be1aac8610e24c5f140485d389b74d08c57`

See more details on using hashes here.

yantrikdb 0.2.0

Navigation

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Meta

Classifiers

Project description

YantrikDB — A Cognitive Memory Engine for Persistent AI Systems

The Problem

The Thesis

Why Not Use Existing Solutions?

Architecture

Design Principles

Unified Index Architecture

Memory Types

Core Operations

Conflict Resolution — Human-in-the-Loop

Multi-Device Sync Protocol

Storage Tiers

Proactive Cognition Loop

Background Processing Cycle

Technical Decisions

Target Use Cases

Roadmap

Research & Publications

Author

Patent

License

Project details

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distributions

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes

File details

File metadata

File hashes