Self-hostable, governed, vendor-neutral memory for AI agents (PostgreSQL + pgvector).
Project description
memnos
Self-hostable, governed, vendor-neutral memory for AI agents — on one PostgreSQL.
memnos gives AI agents long-term memory that persists across sessions, with governance built in (token auth, namespace ACL, audit, an encrypted secret vault) and no vendor lock-in. It runs on a single PostgreSQL + pgvector database — no second vector store, no graph database — and uses no LLM at query time (retrieval is hybrid search + a local cross-encoder reranker). Works with Claude Code, Cursor, Windsurf and any MCP client, plus a REST API and a cross-platform CLI.
Released on PyPI (
uv tool install memnos). Apache-2.0 · self-hostable · single-org · local-first.
Claude Code ─┐
Cursor ├─ MCP (stdio) ─┐
Windsurf ─┘ │
hooks (auto) ───────────────┼─► memnos server ──► PostgreSQL + pgvector (ONE engine)
REST / CLI ─────────────────┘ ├─ hybrid retrieve: pgvector (HNSW) + BM25 (tsvector) → RRF
│ → cross-encoder rerank → quota + timeline + entity arms
│ (NO LLM at query time)
├─ bi-temporal facts + belief-change supersession
├─ governance: token auth · namespace ACL · audit · usage
└─ encrypted secret vault (AES-256-GCM) + ingest redaction
What makes memnos different
- One engine. Everything lives in a single PostgreSQL + pgvector — no second vector store, no graph database to run, scale, secure, or back up.
- Deterministic memory. Conflicting facts are resolved by rule (bi-temporal, single-valued supersession), not by asking an LLM at write time — so writes are predictable and reproducible.
- No LLM at query time. Recall is hybrid search (pgvector HNSW + BM25, RRF) → a local cross-encoder rerank → quota / timeline / entity arms. Fast, cheap, and private.
- Governed by default. Token auth, namespace ACL, audit log, usage/cost ledger, and an encrypted secret vault with ingest redaction — in the open-source build.
- Vendor-neutral, self-hosted. Apache-2.0, your Postgres, your data, your LLM keys (never stored in plaintext).
memnos is a governed memory engine, not an agent runtime. A detailed, version-pinned comparison with other memory systems lives at memnos.net/compare.
Quickstart (local)
Prerequisite: PostgreSQL 13+ with the pgvector ≥ 0.7 extension available. memnos does not
install Postgres — it connects to yours. (For local dev: docker compose -f docker-compose.dev.yml up -d.)
Install the memnos command into its own isolated environment (recommended — uv is
fastest; pipx also works). Don't pip install into your system Python — a polluted or
half-upgraded system interpreter will fail to load native deps like psycopg.
uv tool install memnos # recommended (no uv? `brew install uv` or
# curl -LsSf https://astral.sh/uv/install.sh | sh)
# or: pipx install memnos
# or run ./install.sh (macOS/Linux) / .\install.ps1 (Windows) — picks uv→pipx for you
memnos setup # enter your Postgres connection → creates schema + admin token
memnos serve # start the server → open http://127.0.0.1:8900/admin
Inside your own virtualenv, plain
pip install memnosis fine too —python -m venv .venv && .venv/bin/pip install memnos.
memnos --help covers everything: setup serve token grant principal namespace secret stats health whoami ns remember recall. Config (DSN, vault key, port) lives in
~/.memnos/config.json. Full walkthrough: QUICKSTART.md.
An OpenAI key (in .env or memnos secret set openai → OPENAI_API_KEY=secret://openai)
enables 1536-d embeddings + fact extraction. Without it, memnos runs in free local 384-d
mode (embeddings only, no extraction). memnos never holds your LLM key in plaintext —
it stays in .env or the encrypted vault.
Integrations
One command wires memnos into your agent — no manual config editing:
memnos claude-setup # Claude Code: MCP + hooks (auto recall/save) + /memnos + CLAUDE.md
memnos agent-setup codex # Codex CLI (MCP via ~/.codex/config.toml + AGENTS.md)
memnos agent-setup cursor # Cursor
memnos agent-setup windsurf # Windsurf
memnos agent-setup claude-desktop
Each mints a scoped token, is idempotent, and backs up edited files; memnos setup runs
claude-setup automatically when it detects Claude Code. Claude Code is the only agent
with lifecycle hooks (auto-recall before each prompt, auto-save after); every other agent
gets the memnos MCP tools (recall, recall_wide, remember, reconcile_claim, …).
- REST —
POST /remember,POST /recall(Bearer token, namespace-scoped). - CLI / SDK —
memnos remember/recall, oruv pip install memnos-sdk(LangChain / LangGraph / LlamaIndex adapters). - Full client guides:
docs/guides/clients/.
REST, MCP, hooks and the benchmark all run the same engine (MemnosMemory) — there is
one codebase, not a benchmarked copy and a shipped copy.
Management console + governance
A zero-build web console ships in the open-source build at /admin (create namespaces,
mint/revoke tokens, manage grants, view the dashboard, store secrets). Every call is
token-authenticated, namespace-ACL'd, and audited. (SSO/OIDC, advanced RBAC, multi-tenant
control plane, and the richer enterprise UI are the commercial layer.)
memnos admin # bootstrap an admin token → paste into /admin
Benchmarks — LoCoMo (and how we report it)
57–61% under the gpt-4o judge / 58% under an independent cross-provider judge on the full
LoCoMo benchmark (10 conversations, 1,542 QA). The gpt-4o-judge band is reproduced from
scratch — benchmarks/locomo_eval.py on a fresh clone + DB scored 57%, 58% and 61% across
independent ingests (the spread is non-deterministic extraction, not the engine), with every
prediction published under benchmarks/results/.
We care more about credibility than a big headline:
- Setup: full 10 conversations. Ingest → bi-temporal SPO fact extraction (gpt-4o-mini) + consolidation; retrieve via hybrid (pgvector + BM25, RRF) + cross-encoder rerank + timeline / entity-guarantee arms — no LLM at query time; answer with the calling agent (GPT-5-mini in our run); judge with an LLM.
- Independent judging: most published numbers are self-judged (the same vendor's model grades its own answers). We additionally score under an independent provider's judge (Claude grading GPT answers) to remove self-preference bias.
- Judge transparency: the score is judge-sensitive. On the same answers we measure a strict ~44% / standard 57-61% / lenient 85-88% band — so you can see how much the judge prompt moves any number.
- On comparisons: headlines elsewhere (~66% Mem0, ~73% Mnemory, 90%+ others) are typically self-judged and sometimes on a different benchmark (e.g. DMR, not LoCoMo). We don't claim parity — we publish a reproducible harness.
Reproduce: python benchmarks/locomo_eval.py --sample-ids 0,1,2,3,4,5,6,7,8,9
(see benchmarks/).
We'd rather report a credible 58% under an independent judge than an inflated 85% under a lenient one.
How it works
Write (LLM at ingest only): a message becomes a verbatim raw turn and structured
bi-temporal SPO facts. Single-valued attributes (lives_in, works_at) supersede on
change; multi-valued ones (did, visited) accumulate. Secrets are redacted before
storage. An offline "sleep" pass consolidates facts into entity dossiers (multi-hop
pre-join).
Read (no LLM): the query runs hybrid retrieval (pgvector HNSW + BM25 via tsvector, fused with RRF), a cross-encoder reranks, and quota retrieval guarantees raw-turn + fact coverage. Temporal questions add a guaranteed entity timeline; entity questions add an entity-guarantee arm so list/aggregation answers are complete.
Security & operations
- Auth: opaque bearer tokens (SHA-256 hashed at rest; instantly revocable — not JWTs).
- ACL: every read/write is clamped to the principal's namespace grants.
- Audit + usage ledger: who/what/when + per-op LLM cost.
- Secret vault: AES-256-GCM, value-refs (
secret://name), key rotation. - Redaction: secret-shaped text is stripped from remembered messages before storage.
- Health heuristic:
memnos healthturns metrics into actionable findings.
Local-first: the server binds
127.0.0.1. Put a TLS reverse proxy in front for remote use.
License
Apache-2.0. The open-source build is the engine + single-org self-host + the basic management console. SSO/advanced RBAC, encrypted-vault key management (KMS/HSM, rotation policies), the multi-tenant control plane, the richer enterprise UI, and managed cloud are the commercial layer.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file memnos-0.1.3.tar.gz.
File metadata
- Download URL: memnos-0.1.3.tar.gz
- Upload date:
- Size: 101.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2dd501e4cc37a25711e4c445470deb0cc363c7800caa28ebbe238e4d128b81c3
|
|
| MD5 |
0c82d3c08ed0d4f49e5a23f1117c1c42
|
|
| BLAKE2b-256 |
31220ccc15036640800f4b20cf95e5d73afee4f0ca0ac603fa1490a7bfb8623a
|
Provenance
The following attestation bundles were made for memnos-0.1.3.tar.gz:
Publisher:
release.yml on thameema/memnos
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
memnos-0.1.3.tar.gz -
Subject digest:
2dd501e4cc37a25711e4c445470deb0cc363c7800caa28ebbe238e4d128b81c3 - Sigstore transparency entry: 1774329053
- Sigstore integration time:
-
Permalink:
thameema/memnos@9e81e0403a08bad39ae4ab123cef1b7a688bb3dd -
Branch / Tag:
refs/tags/v0.1.3 - Owner: https://github.com/thameema
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@9e81e0403a08bad39ae4ab123cef1b7a688bb3dd -
Trigger Event:
release
-
Statement type:
File details
Details for the file memnos-0.1.3-py3-none-any.whl.
File metadata
- Download URL: memnos-0.1.3-py3-none-any.whl
- Upload date:
- Size: 90.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fdbd143057f80adabe873acde22721da656af6785a7a87d19252d944d2e8525c
|
|
| MD5 |
453b6d085fe5aa8b3390b9b53b91b2e7
|
|
| BLAKE2b-256 |
a47e92bb7a5fb9394dfd345f5db5f5a4c61af5966dace22558132cf874af5726
|
Provenance
The following attestation bundles were made for memnos-0.1.3-py3-none-any.whl:
Publisher:
release.yml on thameema/memnos
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
memnos-0.1.3-py3-none-any.whl -
Subject digest:
fdbd143057f80adabe873acde22721da656af6785a7a87d19252d944d2e8525c - Sigstore transparency entry: 1774329246
- Sigstore integration time:
-
Permalink:
thameema/memnos@9e81e0403a08bad39ae4ab123cef1b7a688bb3dd -
Branch / Tag:
refs/tags/v0.1.3 - Owner: https://github.com/thameema
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
release.yml@9e81e0403a08bad39ae4ab123cef1b7a688bb3dd -
Trigger Event:
release
-
Statement type: