Self-hostable, governed, vendor-neutral memory for AI agents (PostgreSQL + pgvector).

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

thameema

These details have not been verified by PyPI

Project links

Homepage

Project description

memnos

Self-hostable, governed, vendor-neutral memory for AI agents — on one PostgreSQL.

memnos gives AI agents long-term memory that persists across sessions, with governance built in (token auth, namespace ACL, audit, an encrypted secret vault) and no vendor lock-in. It runs on a single PostgreSQL + pgvector database — no second vector store, no graph database — and uses no LLM at query time (retrieval is hybrid search + a local cross-encoder reranker). Works with Claude Code, Cursor, Windsurf and any MCP client, plus a REST API and a cross-platform CLI.

Released on PyPI (uv tool install memnos). Apache-2.0 · self-hostable · single-org · local-first.

Claude Code ─┐
Cursor       ├─ MCP (stdio) ─┐
Windsurf     ─┘              │
hooks (auto) ───────────────┼─► memnos server ──► PostgreSQL + pgvector  (ONE engine)
REST / CLI ─────────────────┘     ├─ hybrid retrieve: pgvector (HNSW) + BM25 (tsvector) → RRF
                                   │   → cross-encoder rerank → quota + timeline + entity arms
                                   │   (NO LLM at query time)
                                   ├─ bi-temporal facts + belief-change supersession
                                   ├─ governance: token auth · namespace ACL · audit · usage
                                   └─ encrypted secret vault (AES-256-GCM) + ingest redaction

What makes memnos different

One engine. Everything lives in a single PostgreSQL + pgvector — no second vector store, no graph database to run, scale, secure, or back up.
Deterministic memory. Conflicting facts are resolved by rule (bi-temporal, single-valued supersession), not by asking an LLM at write time — so writes are predictable and reproducible.
No LLM at query time. Recall is hybrid search (pgvector HNSW + BM25, RRF) → a local cross-encoder rerank → quota / timeline / entity arms. Fast, cheap, and private.
Governed by default. Token auth, namespace ACL, audit log, usage/cost ledger, and an encrypted secret vault with ingest redaction — in the open-source build.
Vendor-neutral, self-hosted. Apache-2.0, your Postgres, your data, your LLM keys (never stored in plaintext).

memnos is a governed memory engine, not an agent runtime. A detailed, version-pinned comparison with other memory systems lives at memnos.net/compare.

Quickstart (local)

Prerequisite: PostgreSQL 13+ with the pgvector ≥ 0.7 extension available. memnos does not install Postgres — it connects to yours. (For local dev: docker compose -f docker-compose.dev.yml up -d.)

Install the memnos command into its own isolated environment (recommended — uv is fastest; pipx also works). Don't pip install into your system Python — a polluted or half-upgraded system interpreter will fail to load native deps like psycopg.

uv tool install memnos        # recommended  (no uv? `brew install uv`  or
                              #  curl -LsSf https://astral.sh/uv/install.sh | sh)
# or:  pipx install memnos
# or run ./install.sh (macOS/Linux) / .\install.ps1 (Windows) — picks uv→pipx for you

memnos setup                  # enter your Postgres connection → creates schema + admin token
memnos serve                  # start the server → open http://127.0.0.1:8900/admin

Inside your own virtualenv, plain pip install memnos is fine too — python -m venv .venv && .venv/bin/pip install memnos.

memnos --help covers everything: setup serve token grant principal namespace secret stats health whoami ns remember recall. Config (DSN, vault key, port) lives in ~/.memnos/config.json. Full walkthrough: QUICKSTART.md.

An OpenAI key (in .env or memnos secret set openai → OPENAI_API_KEY=secret://openai) enables 1536-d embeddings + fact extraction. Without it, memnos runs in free local 384-d mode (embeddings only, no extraction). memnos never holds your LLM key in plaintext — it stays in .env or the encrypted vault.

Integrations

One command wires memnos into your agent — no manual config editing:

memnos claude-setup            # Claude Code: MCP + hooks (auto recall/save) + /memnos + CLAUDE.md
memnos agent-setup codex       # Codex CLI (MCP via ~/.codex/config.toml + AGENTS.md)
memnos agent-setup cursor      # Cursor
memnos agent-setup windsurf    # Windsurf
memnos agent-setup claude-desktop

Each mints a scoped token, is idempotent, and backs up edited files; memnos setup runs claude-setup automatically when it detects Claude Code. Claude Code is the only agent with lifecycle hooks (auto-recall before each prompt, auto-save after); every other agent gets the memnos MCP tools (recall, recall_wide, remember, reconcile_claim, …).

REST — POST /remember, POST /recall (Bearer token, namespace-scoped).
CLI / SDK — memnos remember/recall, or uv pip install memnos-sdk (LangChain / LangGraph / LlamaIndex adapters).
Full client guides: docs/guides/clients/.

REST, MCP, hooks and the benchmark all run the same engine (MemnosMemory) — there is one codebase, not a benchmarked copy and a shipped copy.

Management console + governance

A zero-build web console ships in the open-source build at /admin (create namespaces, mint/revoke tokens, manage grants, view the dashboard, store secrets). Every call is token-authenticated, namespace-ACL'd, and audited. (SSO/OIDC, advanced RBAC, multi-tenant control plane, and the richer enterprise UI are the commercial layer.)

memnos admin          # bootstrap an admin token → paste into /admin

Benchmarks — LoCoMo (and how we report it)

57–61% under the gpt-4o judge / 58% under an independent cross-provider judge on the full LoCoMo benchmark (10 conversations, 1,542 QA). The gpt-4o-judge band is reproduced from scratch — benchmarks/locomo_eval.py on a fresh clone + DB scored 57%, 58% and 61% across independent ingests (the spread is non-deterministic extraction, not the engine), with every prediction published under benchmarks/results/.

We care more about credibility than a big headline:

Setup: full 10 conversations. Ingest → bi-temporal SPO fact extraction (gpt-4o-mini) + consolidation; retrieve via hybrid (pgvector + BM25, RRF) + cross-encoder rerank + timeline / entity-guarantee arms — no LLM at query time; answer with the calling agent (GPT-5-mini in our run); judge with an LLM.
Independent judging: most published numbers are self-judged (the same vendor's model grades its own answers). We additionally score under an independent provider's judge (Claude grading GPT answers) to remove self-preference bias.
Judge transparency: the score is judge-sensitive. On the same answers we measure a strict ~44% / standard 57-61% / lenient 85-88% band — so you can see how much the judge prompt moves any number.
On comparisons: headlines elsewhere (~66% Mem0, ~73% Mnemory, 90%+ others) are typically self-judged and sometimes on a different benchmark (e.g. DMR, not LoCoMo). We don't claim parity — we publish a reproducible harness.

Reproduce: python benchmarks/locomo_eval.py --sample-ids 0,1,2,3,4,5,6,7,8,9 (see benchmarks/).

We'd rather report a credible 58% under an independent judge than an inflated 85% under a lenient one.

How it works

Write (LLM at ingest only): a message becomes a verbatim raw turn and structured bi-temporal SPO facts. Single-valued attributes (lives_in, works_at) supersede on change; multi-valued ones (did, visited) accumulate. Secrets are redacted before storage. An offline "sleep" pass consolidates facts into entity dossiers (multi-hop pre-join).

Read (no LLM): the query runs hybrid retrieval (pgvector HNSW + BM25 via tsvector, fused with RRF), a cross-encoder reranks, and quota retrieval guarantees raw-turn + fact coverage. Temporal questions add a guaranteed entity timeline; entity questions add an entity-guarantee arm so list/aggregation answers are complete.

Security & operations

Auth: opaque bearer tokens (SHA-256 hashed at rest; instantly revocable — not JWTs).
ACL: every read/write is clamped to the principal's namespace grants.
Audit + usage ledger: who/what/when + per-op LLM cost.
Secret vault: AES-256-GCM, value-refs (secret://name), key rotation.
Redaction: secret-shaped text is stripped from remembered messages before storage.
Health heuristic: memnos health turns metrics into actionable findings.

Local-first: the server binds 127.0.0.1. Put a TLS reverse proxy in front for remote use.

License

Apache-2.0. The open-source build is the engine + single-org self-host + the basic management console. SSO/advanced RBAC, encrypted-vault key management (KMS/HSM, rotation policies), the multi-tenant control plane, the richer enterprise UI, and managed cloud are the commercial layer.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

thameema

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.1.4

Jun 10, 2026

This version

0.1.3

Jun 10, 2026

0.1.2

Jun 9, 2026

0.1.1

Jun 9, 2026

0.1.0

Jun 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

memnos-0.1.3.tar.gz (101.9 kB view details)

Uploaded Jun 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

memnos-0.1.3-py3-none-any.whl (90.7 kB view details)

Uploaded Jun 10, 2026 Python 3

File details

Details for the file memnos-0.1.3.tar.gz.

File metadata

Download URL: memnos-0.1.3.tar.gz
Upload date: Jun 10, 2026
Size: 101.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for memnos-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`2dd501e4cc37a25711e4c445470deb0cc363c7800caa28ebbe238e4d128b81c3`
MD5	`0c82d3c08ed0d4f49e5a23f1117c1c42`
BLAKE2b-256	`31220ccc15036640800f4b20cf95e5d73afee4f0ca0ac603fa1490a7bfb8623a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for memnos-0.1.3.tar.gz:

Publisher: release.yml on thameema/memnos

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: memnos-0.1.3.tar.gz
- Subject digest: 2dd501e4cc37a25711e4c445470deb0cc363c7800caa28ebbe238e4d128b81c3
- Sigstore transparency entry: 1774329053
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: thameema/memnos@9e81e0403a08bad39ae4ab123cef1b7a688bb3dd
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/thameema
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@9e81e0403a08bad39ae4ab123cef1b7a688bb3dd
- Trigger Event: release

File details

Details for the file memnos-0.1.3-py3-none-any.whl.

File metadata

Download URL: memnos-0.1.3-py3-none-any.whl
Upload date: Jun 10, 2026
Size: 90.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for memnos-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`fdbd143057f80adabe873acde22721da656af6785a7a87d19252d944d2e8525c`
MD5	`453b6d085fe5aa8b3390b9b53b91b2e7`
BLAKE2b-256	`a47e92bb7a5fb9394dfd345f5db5f5a4c61af5966dace22558132cf874af5726`

See more details on using hashes here.

Provenance

The following attestation bundles were made for memnos-0.1.3-py3-none-any.whl:

Publisher: release.yml on thameema/memnos

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: memnos-0.1.3-py3-none-any.whl
- Subject digest: fdbd143057f80adabe873acde22721da656af6785a7a87d19252d944d2e8525c
- Sigstore transparency entry: 1774329246
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: thameema/memnos@9e81e0403a08bad39ae4ab123cef1b7a688bb3dd
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/thameema
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@9e81e0403a08bad39ae4ab123cef1b7a688bb3dd
- Trigger Event: release

memnos 0.1.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

memnos

What makes memnos different

Quickstart (local)

Integrations

Management console + governance

Benchmarks — LoCoMo (and how we report it)

How it works

Security & operations

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance