Persistent AI memory without RAG — four-tier retrieval (SQLite FTS5 → graph → vectors → LLM agent)
Project description
supermem
Persistent AI memory without RAG — four-tier retrieval that uses an LLM agent only as a last resort, backed by SQLite FTS5, an embedded graph database, and your local markdown vault.
An MCP (Model Context Protocol) server that gives AI assistants — Claude Desktop, LM Studio, ChatGPT — persistent, structured memory backed by SQLite + an optional graph database. The LLM agent is tier 4, not the default path — most queries resolve in milliseconds via full-text search.
Quick Start (Personal, No GPU)
pip install supermem
# Point supermem at a directory of markdown files
export SUPERMEM_VAULT_PATH=~/notes
export SUPERMEM_LLM_PROVIDER=openrouter
export OPENROUTER_API_KEY=your_key_here
# Start the MCP server (add to Claude Desktop's mcp.json)
supermem serve
Add to Claude Desktop mcp.json:
{
"mcpServers": {
"supermem": {
"command": "supermem",
"args": ["serve"]
}
}
}
Quick Start (Production with Docker)
# Clone and configure
git clone https://github.com/lamenting-hawthorn/supermem
cp .env.example .env
# Edit .env: set SUPERMEM_VAULT_PATH, SUPERMEM_LLM_PROVIDER, API keys
# MCP server only (stdio, for Claude Desktop)
docker compose up supermem-mcp
# MCP server + HTTP dashboard
docker compose --profile worker up
# Dashboard at http://localhost:37777
Architecture: Four-Tier Retrieval
Every query goes through tiers in order, short-circuiting when enough results are found. Tiers 1–3 never call an LLM.
Query
│
├─ Tier 1: SQLite FTS5 full-text search ~1ms always available
│ porter tokenizer, WAL mode
│
├─ Tier 2: Kuzu embedded graph expansion ~5ms optional (install kuzu)
│ BFS traversal via [[wikilink]] edges
│
├─ Tier 3: ChromaDB vector similarity ~50ms optional (SUPERMEM_VECTOR=true)
│ sentence-transformer embeddings
│
└─ Tier 4: LLM agent fallback ~5-30s always available
navigates vault via Python sandbox
Short-circuit rule: if tier 1 returns ≥ min_results (default 3), tiers 2–4 are skipped entirely. Unavailable tiers are skipped with a WARNING log — no errors raised.
MCP Tool Reference
| Tool | Parameters | Returns | Notes |
|---|---|---|---|
use_memory_agent |
query: str |
Formatted answer | Backward-compatible. Routes through all 4 tiers; falls back to full agent only if tiers 1–3 insufficient |
supermem_hybrid |
query: str, tier_limit: int = 4 |
JSON with obs_ids, source_tier, latency_ms |
Preferred for programmatic use. Token-efficient — returns IDs first |
get_observations |
ids: list[int] |
JSON array of observation dicts | Fetch full content for specific IDs |
get_timeline |
obs_id: int, window: int = 5 |
JSON array of chronological observations | Context around a specific observation |
Progressive Disclosure Pattern
# 1. Search — cheap, returns IDs only
result = await supermem_hybrid("Alice's project status", tier_limit=2)
# {"obs_ids": [42, 17, 88], "source_tier": 1, "latency_ms": 2.1}
# 2. Fetch — only for IDs you actually need
obs = await get_observations([42, 17])
# [{"id": 42, "content": "...", "tier_used": 1}, ...]
# 3. Timeline — context around interesting observations
ctx = await get_timeline(42, window=3)
Environment Variables
| Variable | Default | Description |
|---|---|---|
SUPERMEM_LLM_PROVIDER |
openrouter |
openrouter | ollama | vllm | claude | lmstudio |
SUPERMEM_LLM_MODEL |
provider default | Model string (e.g. openai/gpt-4o-mini, llama3) |
SUPERMEM_DB_PATH |
~/.supermem/supermem.db |
SQLite database path |
SUPERMEM_VAULT_PATH |
.memory_path file |
Markdown vault directory |
SUPERMEM_VECTOR |
false |
Set true to enable ChromaDB tier |
SUPERMEM_API_KEY |
(none) | Bearer token for HTTP API auth (disabled if unset) |
SUPERMEM_RATE_LIMIT |
60 |
Requests/minute limit |
SUPERMEM_WORKER_PORT |
37777 |
HTTP dashboard port |
SUPERMEM_COMPRESS_EVERY |
50 |
Observations written before LLM compression |
OPENROUTER_API_KEY |
(required for openrouter) | OpenRouter API key |
ANTHROPIC_API_KEY |
(required for claude) | Anthropic API key |
OLLAMA_HOST |
http://localhost:11434 |
Ollama server URL |
VLLM_HOST / VLLM_PORT |
localhost / 8000 |
vLLM server address |
LMSTUDIO_HOST |
http://localhost:1234 |
LM Studio server URL |
Connector Guide
Import external data into your vault with one command:
# ChatGPT export (Settings → Data controls → Export data → .zip)
supermem connect chatgpt ~/Downloads/chatgpt_export.zip
# Notion workspace export (.zip)
supermem connect notion ~/Downloads/notion_export.zip
# Nuclino workspace export (.zip)
supermem connect nuclino ~/Downloads/nuclino_export.zip
# GitHub repositories (live via API)
supermem connect github owner/repo1,owner/repo2 --token ghp_xxx
# Google Docs (OAuth, opens browser)
supermem connect google_docs "My Doc Name"
All connectors write markdown to your vault, then automatically index the files into SQLite + graph. Private content wrapped in <private>...</private> tags is stripped before indexing.
CLI Reference
supermem serve # Start MCP server (stdio transport, for Claude Desktop)
supermem serve --worker # Start MCP server + HTTP dashboard on :37777
supermem chat # Interactive terminal REPL (no client required)
supermem backup # Create timestamped .tar.gz (vault + SQLite)
supermem backup --output /path/to/archive.tar.gz
supermem restore <archive.tar.gz>
supermem connect <type> <source> [--token TOKEN] [--max-items N]
HTTP Dashboard (Optional)
Start with supermem serve --worker or docker compose --profile worker up.
| Endpoint | Method | Description |
|---|---|---|
/health |
GET | {"status":"ok","db":true,"graph":false,"vector":false} |
/sessions |
GET | Paginated session list with summaries |
/observations |
GET | Filter by session/date/type |
/search |
POST | {"query": "...", "tier_limit": 4} |
/index/rebuild |
POST | Reindex entire vault |
/backup |
GET | Streams vault + DB as .tar.gz |
/stats |
GET | {obs_count, entity_count, session_count, db_size_mb} |
Auth: Authorization: Bearer <SUPERMEM_API_KEY>. Disabled when env var is unset.
Privacy
Wrap sensitive content in <private>...</private> tags. It is stripped before writing to any storage layer (SQLite, Kuzu, ChromaDB). The content passes through to the agent sandbox only — it never persists.
# Meeting Notes
Alice discussed the roadmap.
<private>Budget: $2.4M approved for Q3</private>
Next steps: ship v2 by June.
Running Tests
uv run pytest tests/ -v # all tests
uv run pytest tests/unit/ -v # unit only (fast, no network)
uv run pytest tests/integration/ -v # integration (real storage)
uv run pytest tests/ --cov=supermem --cov-report=term-missing # with coverage
Coverage gate: 60% (CI enforced). Kuzu and Anthropic tests are auto-skipped if packages are not installed.
License
Apache 2.0 — see LICENSE.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file supermem-0.3.0.tar.gz.
File metadata
- Download URL: supermem-0.3.0.tar.gz
- Upload date:
- Size: 498.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c98ed1ce42d9b78ce09fa1885bb7fbb6f2b24f9476b0b7cce6fe1c9f7559b19d
|
|
| MD5 |
152495c580345d4d568101531e1b4b5e
|
|
| BLAKE2b-256 |
b1e084391f287fea4702bdc08a1136050a9b9bde9a315be895bd5374b8705853
|
Provenance
The following attestation bundles were made for supermem-0.3.0.tar.gz:
Publisher:
ci.yml on lamenting-hawthorn/supermem
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
supermem-0.3.0.tar.gz -
Subject digest:
c98ed1ce42d9b78ce09fa1885bb7fbb6f2b24f9476b0b7cce6fe1c9f7559b19d - Sigstore transparency entry: 1779613741
- Sigstore integration time:
-
Permalink:
lamenting-hawthorn/supermem@bc4ec248247a23cc991905e98cfda58b129e9632 -
Branch / Tag:
refs/tags/v0.3.0 - Owner: https://github.com/lamenting-hawthorn
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci.yml@bc4ec248247a23cc991905e98cfda58b129e9632 -
Trigger Event:
push
-
Statement type:
File details
Details for the file supermem-0.3.0-py3-none-any.whl.
File metadata
- Download URL: supermem-0.3.0-py3-none-any.whl
- Upload date:
- Size: 157.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b894bdcabbe0be4ef6495a5433e130e16799364a171809949c9572c54878282a
|
|
| MD5 |
48b6265ab3cb82bfa6736886157b163b
|
|
| BLAKE2b-256 |
834265a5d6441da36bcd5862852ce31565f8c1d6d8f6ab8fb467d67430afa434
|
Provenance
The following attestation bundles were made for supermem-0.3.0-py3-none-any.whl:
Publisher:
ci.yml on lamenting-hawthorn/supermem
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
supermem-0.3.0-py3-none-any.whl -
Subject digest:
b894bdcabbe0be4ef6495a5433e130e16799364a171809949c9572c54878282a - Sigstore transparency entry: 1779613942
- Sigstore integration time:
-
Permalink:
lamenting-hawthorn/supermem@bc4ec248247a23cc991905e98cfda58b129e9632 -
Branch / Tag:
refs/tags/v0.3.0 - Owner: https://github.com/lamenting-hawthorn
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
ci.yml@bc4ec248247a23cc991905e98cfda58b129e9632 -
Trigger Event:
push
-
Statement type: