The Universal Memory Layer for Any AI Agent — Zero-Dependency, Sub-Millisecond, Fully Private
Project description
Mnemosyne is a universal, Hermes-first memory layer that works with any agent framework (Claude Code, Cursor, Codex, OpenWebUI, OpenClaw, or your own custom agent). One pip install, one SQLite database. No external services required.
Table of Contents
- Works With Everything
- Quick Start
- Benchmark
- CLI Usage
- Python API
- Architecture
- Why Mnemosyne?
- Configuration
- Hermes Plugin (23 tools)
- Contributing
- Support
- License
Works With Everything
| Platform | Method | Setup |
|---|---|---|
| Cursor | MCP | Add to .cursor/mcp.json |
| Claude Code | MCP | Add to claude.json |
| OpenAI Codex CLI | MCP | Add to .codex/mcp.json |
| Windsurf | MCP | Add to .windsurf/mcp_config.json |
| OpenWebUI | Native @tool | Drop bridge file into data/tools/ |
| OpenClaw | Native provider | pip install mnemosyne-memory[openclaw] |
| Hermes Agent | MCP + Plugin | Native — ships enabled |
| Any MCP client | MCP (stdio/SSE) | One config line |
| Any Python agent | Direct SDK | import mnemosyne |
See docs/integrations/ for complete setup guides per platform.
Quick Start
pip install mnemosyne-memory
# With all features (vector search + MCP server)
pip install "mnemosyne-memory[all]"
Add to your agent
MCP-based (Cursor, Claude Code, Codex, Windsurf):
{
"mcpServers": {
"mnemosyne": {
"command": "mnemosyne",
"args": ["mcp"],
"env": {}
}
}
}
Python SDK (any agent):
from mnemosyne import remember, recall
remember("User prefers dark mode interfaces")
results = recall("user preferences")
OpenWebUI: Drop a 1-line bridge file into data/tools/.
OpenClaw: Add provider: mnemosyne.integrations.openclaw:create_provider to config.
Benchmarks
Mnemosyne holds top-tier scores on the two major memory benchmarks, LongMemEval (ICLR 2025) and BEAM (ICLR 2026), both in one SQLite file, zero cloud dependencies.
LongMemEval (retrieval)
| System | Score | Notes |
|---|---|---|
| Mnemosyne (dense) | 98.9% Recall@All@5 | Apr 2026, bge-small-en-v1.5, 100 instances |
| Mempalace | 96.6% Recall@5 | AAAK + Palace architecture |
| Backboard | 93.4% | Independent assessment |
| Hindsight | 91.4% | Vectorize.io |
BEAM (end-to-end QA)
| Scale | Mnemosyne v3 | Honcho | Hindsight | LIGHT | RAG |
|---|---|---|---|---|---|
| 100K | 65.2% | 63.0% | 73.4% | 35.8% | 32.3% |
Per-ability (100K): IE 91.5% · MR 87.5% · TR 75.0% · ABS 100.0% · CR 50.0% · KU 50.0% · EO 25.0% · IF 62.5% · PF 54.5% · SUM 55.6%
BEAM retrieval (pure recall)
| Scale | Recall@10 | Latency | Storage | Messages |
|---|---|---|---|---|
| 100K | 20% | 372ms | 1.8 MB | 200 |
| 500K | 20% | 412ms | 3.2 MB | 1,000 |
| 1M | 20% | 493ms | 4.8 MB | 2,000 |
| 10M | 20% | 35ms | 7.2 MB | 20,000 |
Recall holds flat across all scales. 100% abstention accuracy, never hallucinates on unknowns. Episodic compression delivers 9.4x storage savings.
Full reports: docs/beam-benchmark.md
CLI Usage
# MCP server (works with any MCP client)
mnemosyne mcp # stdio (default)
mnemosyne mcp --transport sse --port 8080 # SSE (web clients)
# Direct memory ops
mnemosyne remember "User likes dark mode"
mnemosyne recall "preferences"
mnemosyne stats
mnemosyne sleep # Run consolidation
# Export / import
mnemosyne export --output backup.json
mnemosyne import --input backup.json
Python API
from mnemosyne import remember, recall
# Store a fact
remember("User prefers dark mode interfaces",
importance=0.9, source="preference")
# Store globally (visible across all sessions)
remember("User email is user@example.com",
importance=0.95, scope="global")
# Store with expiry
remember("Temp token: abc123",
importance=0.8, valid_until="2026-12-31")
# Search
results = recall("interface preferences", top_k=3)
# Temporal recall (recency boost)
results = recall("deployments",
temporal_weight=0.5, temporal_halflife=48.0)
# Entity extraction
remember("Met with Abdias about the v2 release",
extract_entities=True)
# LLM-driven fact extraction
remember("User said they prefer Python for backend work",
extract=True)
# Temporal triples (knowledge graph)
from mnemosyne.core.triples import TripleStore
kg = TripleStore()
kg.add("Maya", "assigned_to", "auth-migration",
valid_from="2026-01-15")
kg.query("Maya", as_of="2026-02-01")
# Memory banks (per-domain isolation)
from mnemosyne.core.banks import BankManager
BankManager().create_bank("work")
work_mem = Mnemosyne(bank="work")
work_mem.remember("Sprint review on Friday")
Advanced: BEAM Direct Access
from mnemosyne.core.beam import BeamMemory
beam = BeamMemory(session_id="my_session")
beam.remember("Important context", importance=0.9)
beam.consolidate_to_episodic(
summary="User likes Neovim",
source_wm_ids=["wm1"]
)
results = beam.recall("editor preferences", top_k=5)
Architecture
┌────────────────────────────────────────────────────────────┐
│ Any AI Agent │
│ (Hermes · Claude Code · Cursor · Codex · OpenWebUI · MCP) │
└────────────────────────┬───────────────────────────────────┘
│ MCP / SDK / Plugin
┌────────────────────────▼───────────────────────────────────┐
│ Mnemosyne BEAM │
│ ┌────────────┐ ┌──────────────┐ ┌────────────────────┐ │
│ │ Working │ │ Episodic │ │ TripleStore │ │
│ │ Memory │──▶│ Memory │ │ (Temporal KG) │ │
│ │ (hot ctx) │ │ (long-term) │ └────────────────────┘ │
│ └────────────┘ └──────┬───────┘ │
│ │ │
│ ┌──────────▼──────────┐ │
│ │ SQLite DB │ │
│ │ (single file) │ │
│ │ sqlite-vec + FTS5 │ │
│ │ MIB binary vectors │ │
│ └─────────────────────┘ │
└─────────────────────────────────────────────────────────────┘
BEAM (Bilevel Episodic-Associative Memory):
- Working memory — Hot context, auto-injected before LLM calls, TTL-based eviction
- Episodic memory — Long-term storage with sqlite-vec + FTS5 hybrid search
- TripleStore — Temporal knowledge graph with version chains
Hybrid scoring: 50% vector similarity + 30% FTS5 rank + 20% importance, all inside SQLite.
Binary vectors: Information-theoretic binarization (MIB) compresses 384-dim float32 embeddings into 48 bytes — 32x reduction. Hamming distance entirely within SQLite. No ANN indices, no external vector DB.
Why Mnemosyne?
| Feature | Mnemosyne | mem0 | Letta | Honcho | SuperMemory | Hindsight | ChromaDB |
|---|---|---|---|---|---|---|---|
| Local-first | ✅ SQLite | ⚠️ Hybrid | ❌ Docker+PG | ⚠️ PG+worker | ❌ SaaS | ✅ SQLite | ✅ Embedded |
| Zero deps | ✅ pip only | ❌ Qdrant/PG | ❌ PG+vector | ❌ PG+3 LLMs | ❌ SaaS infra | ✅ pip only | ✅ pip only |
| MCP server | ✅ Built-in | ❌ | ❌ | ❌ | ✅ | ❌ | ❌ |
| Python SDK | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ | ✅ |
| Multi-platform | ✅ 8+ targets | ⚠️ 3 adapters | ❌ Agent-only | ⚠️ 4 adapters | ✅ MCP | ❌ Agent-only | ❌ Library only |
| Open source | ✅ MIT | ✅ Apache 2.0 | ✅ OSS | ⚠️ AGPL | ❌ Proprietary | ✅ MIT | ✅ Apache 2.0 |
| Benchmark | 65.2% BEAM / 98.9% LongMem | 49% LongMem | 83.2% LoCoMo | 90.4% LongMem | 85.2% MemoryBench | 73.4% BEAM | N/A (vector DB) |
| Self-hosted | ✅ Yes | ✅ Optional | ✅ Optional | ✅ Yes | ❌ Enterprise | ✅ Yes | ✅ Yes |
| Integration template | ✅ Published | ❌ | ❌ | ❌ | ❌ | ❌ | ❌ |
| Memory architecture | BEAM (3-tier) | Session + facts | OS-virtual context | Peer + reasoning | 5-layer stack | Episodic + semantic | Vector store only |
| Purpose | Full memory system | Memory API | Agent runtime | Managed memory | Consumer + agent | Research memory | Vector database |
Configuration
Environment Variables
| Variable | Default | Description |
|---|---|---|
MNEMOSYNE_DATA_DIR |
~/.hermes/mnemosyne/data |
Database directory |
MNEMOSYNE_VEC_TYPE |
int8 |
Vector compression: float32, int8, or bit |
MNEMOSYNE_VEC_WEIGHT |
0.5 |
Vector similarity weight |
MNEMOSYNE_FTS_WEIGHT |
0.3 |
FTS5 keyword weight |
MNEMOSYNE_IMPORTANCE_WEIGHT |
0.2 |
Importance weight |
MNEMOSYNE_WM_MAX_ITEMS |
10000 |
Working memory limit |
MNEMOSYNE_RECENCY_HALFLIFE |
168 |
Decay halflife in hours |
| MNEMOSYNE_EMBEDDING_API_URL | ${OPENROUTER_BASE_URL:-https://openrouter.ai/api/v1} | Preferred name for custom embedding API endpoint (OpenAI-compatible). Falls back to OPENROUTER_BASE_URL. |
| MNEMOSYNE_EMBEDDING_API_KEY | ${OPENROUTER_API_KEY:-${OPENAI_API_KEY:-}} | Preferred name for embedding API key. Falls back to OPENROUTER_API_KEY, then OPENAI_API_KEY. |
| MNEMOSYNE_EMBEDDING_MODEL | BAAI/bge-small-en-v1.5 | Embedding model. Swap for multilingual: BAAI/bge-m3, intfloat/multilingual-e5-base, etc. |
Full reference: docs/configuration.md
Language Support
Default embeddings are English-optimized (bge-small-en-v1.5). For non-English or multilingual recall, swap the model:
# Multilingual (100+ languages)
export MNEMOSYNE_EMBEDDING_MODEL=BAAI/bge-m3
# Or Chinese, German, etc.
export MNEMOSYNE_EMBEDDING_MODEL=BAAI/bge-small-zh-v1.5
See docs/configuration.md#custom-embedding-models for tradeoffs (RAM, speed, dimension changes).
Hermes Plugin (23 tools)
When used with Hermes Agent, Mnemosyne exposes 23 tools for full memory lifecycle management — 3 lifecycle hooks (pre_llm_call, on_session_start, post_tool_call) for automatic context injection, plus MCP support.
Install (Hermes users):
pip install mnemosyne-hermes
hermes config set memory.provider mnemosyne
hermes memory setup
Then disable Hermes' built-in file memory to avoid duplication:
hermes tools disable memory
See docs/hermes-integration.md for the full setup guide.
Tool categories
| Category | Tools |
|---|---|
| Core memory (9) | remember, recall, sleep, stats, get, update, forget, invalidate, validate |
| Knowledge graph (4) | triple_add, triple_query, graph_query, graph_link |
| Multi-agent surface (4) | shared_remember, shared_recall, shared_forget, shared_stats |
| Working notes (3) | scratchpad_write, scratchpad_read, scratchpad_clear |
| Ops (3) | export, import, diagnose |
All 23 tools surface through the mnemosyne-hermes package, which wraps the mnemosyne-memory core library. The plugin manifest at integrations/hermes/ is also discoverable by Hermes' plugin system.
Updating: pip install --upgrade mnemosyne-hermes && hermes gateway restart or git pull && pip install --upgrade integrations/hermes && hermes gateway restart (source).
Contributing
See CONTRIBUTING.md for guidelines.
Full docs: docs/ · Changelog: CHANGELOG.md · Releases: GitHub Releases · Integrations: docs/integrations/
Support
Discord: Join the Mnemosyne community · Issues: GitHub Issues
⭐ Star the repo if you find it useful!
License
MIT License — See LICENSE
Copyright (c) 2026 Abdias J
"The faintest ink is more powerful than the strongest memory." — Hermes Trismegistus
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mnemosyne_memory-3.3.0.tar.gz.
File metadata
- Download URL: mnemosyne_memory-3.3.0.tar.gz
- Upload date:
- Size: 1.1 MB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7e9140eab3893e7d639aacf482b9c2a1d0c8d10abe993e729101eb6f272483ef
|
|
| MD5 |
6e32978dd8fbd95963b0e0a0f7bce477
|
|
| BLAKE2b-256 |
627082472d7bde6b61d6a75c9fc15d8f3aa7b25d56f5630e3f60c1909846f40c
|
Provenance
The following attestation bundles were made for mnemosyne_memory-3.3.0.tar.gz:
Publisher:
release.yml on AxDSan/mnemosyne
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
mnemosyne_memory-3.3.0.tar.gz -
Subject digest:
7e9140eab3893e7d639aacf482b9c2a1d0c8d10abe993e729101eb6f272483ef - Sigstore transparency entry: 1688485443
- Sigstore integration time:
-
Permalink:
AxDSan/mnemosyne@741516bce206b55c90274705229dc18dc580494c -
Branch / Tag:
refs/tags/v3.3.0 - Owner: https://github.com/AxDSan
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@741516bce206b55c90274705229dc18dc580494c -
Trigger Event:
push
-
Statement type:
File details
Details for the file mnemosyne_memory-3.3.0-py3-none-any.whl.
File metadata
- Download URL: mnemosyne_memory-3.3.0-py3-none-any.whl
- Upload date:
- Size: 449.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
57a3c4c601e1321841541f5128d3ca60f0618629f3d972f3a4121ea40ded4d6f
|
|
| MD5 |
ee477ee0dfcc032597f4ff7a93b944bb
|
|
| BLAKE2b-256 |
e8971295a1e813690e15b0c53e26a109e2fcc21f0415d29a003de0608b82e187
|
Provenance
The following attestation bundles were made for mnemosyne_memory-3.3.0-py3-none-any.whl:
Publisher:
release.yml on AxDSan/mnemosyne
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
mnemosyne_memory-3.3.0-py3-none-any.whl -
Subject digest:
57a3c4c601e1321841541f5128d3ca60f0618629f3d972f3a4121ea40ded4d6f - Sigstore transparency entry: 1688485461
- Sigstore integration time:
-
Permalink:
AxDSan/mnemosyne@741516bce206b55c90274705229dc18dc580494c -
Branch / Tag:
refs/tags/v3.3.0 - Owner: https://github.com/AxDSan
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@741516bce206b55c90274705229dc18dc580494c -
Trigger Event:
push
-
Statement type: