Bio-inspired persistent memory for AI agents. 5-layer architecture with FTS5, vector search, knowledge graph, and HybridRAG fusion.

These details have not been verified by PyPI

Project links

Project description

🧠 Rewind Memory

Bio-inspired persistent memory for AI agents.

Rewind gives AI agents structured, persistent memory that works like biological memory — keyword search, knowledge graphs, semantic vectors, score fusion, and intelligent retrieval. Every memory is classified by type (user, feedback, project, reference), weighted by recency, and checked for drift. Runs fully local on SQLite with zero external dependencies.

Architecture

┌─────────────────────────────────────────┐
│            L2 Orchestrator              │
│    (fusion · ranking · deduplication)   │
├─────────┬─────────┬─────────┬──────────┤
│   L0    │   L1    │   L3    │    L4    │
│Sensory  │ System  │  Graph  │  Vector  │
│ Buffer  │  Files  │         │  Search  │
│         │         │         │          │
│ SQLite  │  File   │ SQLite  │sqlite-vec│
│  FTS5   │ system  │  Graph  │ (vec0)   │
└─────────┴─────────┴─────────┴──────────┘

Layers

Layer	Name	Bio Analogy	Backend	Purpose
L0	Sensory Buffer	Sensory cortex	SQLite FTS5	Fast keyword search, BM25 ranking
L1	System Memory	Working memory	File system	Identity, preferences, context — loaded every session
L2	Orchestrator	Prefrontal cortex	In-process	Fuses keyword + graph + vector results
L3	Knowledge Graph	Association cortex	SQLite	Entity relationships, spreading activation, Hebbian learning
L4	Vector Search	Episodic memory	sqlite-vec	Semantic similarity via local embeddings (Ollama)

Quick Start

pip install rewind-memory
rewind doctor                    # Auto-diagnose and fix memory stack
rewind ingest-chats              # Backfill historical OpenClaw conversations
rewind watch                     # Real-time conversation indexing (L0 keyword search)
rewind search "what did we discuss about X"

Full walkthrough: docs/QUICKSTART.md — 5 minutes to first search.

# 1. Install Ollama for vector search (optional but recommended)
curl -fsSL https://ollama.com/install.sh | sh
ollama pull nomic-embed-text  # ~274MB

# 2. Install Rewind
pip install rewind-memory

# 3. Ingest your files
rewind ingest ~/my-project/docs/

# 4. Search
rewind search "what was decided about auth"

Commands

Command	Description
`rewind ingest <path>`	Ingest files into memory
`rewind search <query>`	Search memory
`rewind health`	Health check
`rewind doctor`	Auto-diagnose and fix issues
`rewind watch`	Real-time session watcher (NEW in 0.4.7)
`rewind ingest-chats`	Historical conversation backfill (NEW in 0.4.7)
`rewind watch-sessions`	Real-time conversation capture — indexes sessions as they happen
`rewind serve`	API server + background file watcher
`rewind remember <text>`	Store a manual note in memory
`rewind bench`	Run LoCoMo benchmark
`rewind proxy`	Memory-augmented LLM proxy
`rewind migrate`	Migrate backends (Pro)

Free vs Pro

Feature	Free	Pro
L0 Keyword Search (FTS5)	✅	✅
L3 Knowledge Graph (SQLite)	✅	✅
L4 Vector Search (sqlite-vec)	✅ (Ollama)	✅ (NV-Embed-v2)
`rewind doctor`	✅	✅
`rewind watch`	✅ (L0)	✅ (L0 + L5)
`rewind watch-sessions`	✅ (L0 + L3)	✅ (L0 + L3 + L5)
`rewind ingest-chats`	✅ (L0)	✅ (L0 + L5)
OCPlatform gateway autopatcher	✅	✅
Memory Proxy	✅	✅
L5 Semantic Search (Qdrant)	—	✅
L6 Document Store	—	✅
Multi-channel awareness	—	✅
Cloud GPU embeddings	—	✅
Cross-encoder reranking	—	✅
Memory extraction (post-turn)	—	✅
Neo4j Knowledge Graph	—	✅

Usage

from rewind.client import RewindClient
from rewind.config import default_config

client = RewindClient(default_config("free"))

# Search across all layers
results = client.search("what is Hebbian learning")
for r in results:
    print(f"{r.score:.2f} [{r.layer}] {r.text[:80]}")

# Ingest a document
client.ingest("notes/meeting.md")

# Health check
print(client.health())

CLI

# Search
rewind search "memory consolidation" --limit 10

# Ingest files
rewind ingest ./documents/

# Health check
rewind health

Integrations (Claude Code, Cursor, OpenClaw)

Rewind ships with an MCP server that works with any MCP-compatible tool:

Claude Code — add to ~/.claude/settings.json:

{
  "mcpServers": {
    "rewind": {
      "command": "rewind-mcp"
    }
  }
}

OpenClaw — native hook (recommended) or gateway patch:

# Route memory_search through Rewind
rewind-openclaw setup --url http://localhost:8031

# Native hook (recommended — survives OpenClaw updates)
rewind-openclaw hook
rewind-openclaw hook --verify
rewind-openclaw hook --remove

# Gateway patch (legacy — needs re-apply after updates)
rewind-openclaw patch

Cursor / Windsurf / Cline — add rewind-mcp as an MCP server in settings.

See docs/INTEGRATIONS.md for full setup guides.

Memory Proxy (zero-config, works with any tool)

The proxy auto-injects memory into every LLM call. No MCP needed — just change your API URL:

pip install rewind-memory[proxy]

# Ingest your project first
rewind ingest ./my-project/

# Start the memory proxy
rewind proxy --port 8080

Then point your tool at it:

# Cursor
OPENAI_BASE_URL=http://localhost:8080/v1 cursor .

# Aider
OPENAI_API_BASE=http://localhost:8080/v1 aider

# Any OpenAI-compatible tool
export OPENAI_BASE_URL=http://localhost:8080/v1

The proxy:

Searches memory on every prompt (keyword + graph + vector fusion)
Injects relevant context into the system message
Auto-budgets injection size based on model context limits
Passes through your API key to the upstream provider
Supports streaming responses
Works with OpenAI, Anthropic, NVIDIA, any OpenAI-compatible API

# Point at Anthropic instead of OpenAI
rewind proxy --port 8080 --upstream https://api.anthropic.com/v1

# Point at a local model
rewind proxy --port 8080 --upstream http://localhost:11434/v1

Configuration

# ~/.rewind/config.yaml
tier: free
workspace_path: ~/my-project

embedding:
  provider: ollama
  model: nomic-embed-text
  url: http://localhost:11434

layers:
  l0_sensory: true
  l1_stm: true
  l3_graph: true
  l4_workspace: true

Upgrading

Rewind Core is free and fully functional as a local memory stack. For additional layers (communications, documents, bio lifecycle) and cloud GPU embeddings, see Rewind Pro.

Pro extends the Core package via a plugin — install both and Pro layers activate automatically:

client = RewindClient(default_config("pro"), api_key="rw_live_...")

Feature	Core (Free)	Pro ($9/mo)	MOS (Custom)
Keyword search (FTS5)	✅	✅	✅
System memory	✅	✅	✅
Knowledge graph (SQLite)	✅	✅	✅
Semantic vector search	✅ local Ollama	✅ cloud GPU (A10G)	✅ custom GPU
HybridRAG fusion	✅ 5-layer (L0–L4)	full 7-layer (L0–L6)	full 7-layer
Memory type taxonomy	✅ (user/feedback/project/reference)	✅	✅
Recency weighting	✅ (type-aware decay)	✅	✅
Query-intent matching	✅ (boosts matching types)	✅	✅
Memory drift detection	✅ (flags stale references)	✅	✅
OpenClaw gateway autopatcher	✅ (pre-turn memory injection)	✅	✅
Communications memory (L5)	—	✅	✅
Document store (L6)	—	✅	✅
LLM relevance selection	—	✅ (cheap model side-query)	✅
Cross-encoder reranking	—	✅ (GPU-accelerated)	✅
Memory extraction (post-turn)	—	✅ (auto-extracts durable memories)	✅
Partial compaction	—	✅ (preserves task intent)	✅
Cloud GPU embeddings (NV-Embed-v2)	—	25K/mo included	custom volume rates
Retrieval feedback learning	—	✅	✅
Migration assist (→ Neo4j/Qdrant)	—	✅	✅
Bio lifecycle (decay, pruning)	—	—	✅
Air-gapped deployment	—	—	✅
Dedicated engineer + SLA	—	—	✅

Real-Time Conversation Capture

Capture conversations as they happen — no manual backfill needed:

# Watch all OCPlatform session files, index new turns into L0 + L3 + L5
rewind watch-sessions

# Custom session directory
rewind watch-sessions --session-dir /path/to/sessions

# With specific backends
rewind watch-sessions --qdrant-url http://localhost:6333 --embed-url http://localhost:8041/v1/embeddings

watch-sessions uses watchdog to monitor OpenClaw session JSONL files. When new conversation turns are written, they're immediately indexed into:

L0 (BM25 keyword search)
L3 (Knowledge graph — entity extraction + co-occurrence edges)
L5 (Qdrant semantic vectors — if available)

This complements the gateway pre-turn hook: the hook reads memory before each turn, watch-sessions writes new conversations into memory after each turn. Together they form a closed loop.

Requires: pip install 'watchdog>=3.0'

Development

git clone https://github.com/saraidefence/rewind-memory.git
cd rewind-memory
pip install -e ".[dev]"
pytest tests/

Patent

The 7-layer bio-inspired memory architecture is patent pending.

License

MIT — see LICENSE.

Built by SARAI Defence. Ukrainian-built. Patent pending.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.5.8

Apr 14, 2026

This version

0.5.7

Apr 14, 2026

0.5.5

Apr 10, 2026

0.5.4

Apr 10, 2026

0.5.3

Apr 6, 2026

0.5.2

Apr 4, 2026

0.5.1

Apr 4, 2026

0.5.0

Apr 4, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rewind_memory-0.5.7.tar.gz (73.8 kB view details)

Uploaded Apr 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rewind_memory-0.5.7-py3-none-any.whl (77.2 kB view details)

Uploaded Apr 14, 2026 Python 3

File details

Details for the file rewind_memory-0.5.7.tar.gz.

File metadata

Download URL: rewind_memory-0.5.7.tar.gz
Upload date: Apr 14, 2026
Size: 73.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for rewind_memory-0.5.7.tar.gz
Algorithm	Hash digest
SHA256	`f1c4ea132f9f0c7e5a2ce8a967676c6a184ba725345df7e65a5a4dd625934d15`
MD5	`1a9b19644a448215cc7bbc068ee13d92`
BLAKE2b-256	`d7063ecb9aab88830290644ddc3632d2d8214eec778bbcc3d3b2b63b8a053dce`

See more details on using hashes here.

File details

Details for the file rewind_memory-0.5.7-py3-none-any.whl.

File metadata

Download URL: rewind_memory-0.5.7-py3-none-any.whl
Upload date: Apr 14, 2026
Size: 77.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for rewind_memory-0.5.7-py3-none-any.whl
Algorithm	Hash digest
SHA256	`73dd83a617dd6b62f6d6e7f880bfa8bca027790a187bdd3e143422b6f32742a7`
MD5	`59ba8540f0391f92c1acb7e9daf835dd`
BLAKE2b-256	`822f1575e993a3abee96a82af613b547b12913c819e6208860e60fff114bea32`

See more details on using hashes here.

rewind-memory 0.5.7

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🧠 Rewind Memory

Architecture

Layers

Quick Start

Commands

Free vs Pro

Usage

CLI

Integrations (Claude Code, Cursor, OpenClaw)

Memory Proxy (zero-config, works with any tool)

Configuration

Upgrading

Real-Time Conversation Capture

Development

Patent

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes