Local-first Agentic Memory Layer Framework for MCP Agents • 74 tools • Hybrid search (FTS5 + vector + MMR) • GDPR • FIPS 140-3 ready • 100% local

These details have not been verified by PyPI

Project description

![M3 Memory]

M3 Memory

Local-first Agentic Memory Layer Framework for MCP Agents • 74 tools • Hybrid search (FTS5 + vector + MMR) • GDPR • FIPS 140-3 ready • 100% local

"Wait, you remember that?" — Stop re-explaining your project to your AI. Give it a long-term brain that stays 100% on your machine.

🚀 New to M3? Start here with our 5-minute "Human-First" guide.

macOS Windows Linux

Works with Claude Code, Gemini CLI, Aider, OpenCode, and any MCP-compatible agent. Quick one-line command to have your agent install chat log sub-system which saves verbatim chat log info, before compaction, with zero lag/latency and 100% retrieval recall. Just tell your AI agent "install m3-memory chat log sub-system" and your agent will automatically install it with all the proper hooks with some minimal customization questions from you (you can accept the default answers).

📦 Install

curl -fsSL https://raw.githubusercontent.com/skynetcmd/m3-memory/main/install.sh | bash

Installs on macOS or Linux with the single command above. Use this to install on Windows. Use this link to install manually and this to examine the script and what it does.

Claude Code users can also install as a plugin instead — gets you 15 /m3:* slash commands, a memory-curator subagent, and auto-wired hooks:

/plugin marketplace add skynetcmd/m3-memory
/plugin install m3@skynetcmd

Plugin reference · Claude.ai (web/desktop) connector

Add to your MCP config:

{
  "mcpServers": {
    "memory": { "command": "mcp-memory" }
  }
}

An embedder is optional but highly recommended. M3 functions as a pure keyword-search (FTS5/BM25) memory without one, but adding an embedder enables semantic retrieval and high-performance hybrid search.

🚀 Recommended: Integrated Sovereign Setup

For the best experience (Windows, Linux, or Apple Silicon), use our integrated, self-contained installer. It sets up a private instance of LM Studio and our preferred model, BGE-M3, directly in your project folder.

mcp-memory install-embedder

🍎 Older Intel Macs

If you are on an older Intel-based Mac, LM Studio is not supported. We recommend using Ollama instead:

ollama pull qwen3-embedding:0.6b && ollama serve

Other Options

You can also use a standalone Ollama or LM Studio instance. Qwen3-Embedding-0.6B (1024-dim) and BGE-M3 are the models M3 Memory is tuned for. If you use a different model, set EMBED_MODEL in your environment. If no embedder is detected at startup, M3 will automatically fall back to keyword-only mode.

Want auto-classification, summarization, and consolidation? Load a small chat model alongside the embedder (e.g. qwen2.5:0.5b via Ollama, or any 0.5–1B instruct GGUF in LM Studio / llama.cpp). M3 auto-selects it; embedding-only features work without it. See docs/QUICKSTART.md → Optional: load a small chat model.

Restart your agent. Done!

🛡️ Sovereign Embedder (Air-gapped / Offline)

M3-Memory can be installed as a completely self-contained "memory appliance" for secure or air-gapped environments. This mode includes the embedding engine (LM Studio) and the BGE-M3 model directly in the project folder—no internet connection required after the initial clone.

See the 🛡️ Sovereign & Air-Gapped Deployment Guide for full instructions.

1. Unified Setup

If you are installing from a USB drive or in an offline room, run:

mcp-memory install-embedder

Need a clean slate? You can wipe and reinstall the system payload at any time with:

mcp-memory reinstall

2. Configuration

By default, M3-Memory stores its configuration, repository payload, and backups in ~/.m3-memory. You can override this by setting the M3_MEMORY_ROOT environment variable.

3. What it does:

Zero-Dependency: Operates entirely via file-system migration. No curl, pip, or external calls.
Hardware Optimized: Automatically detects your OS and architecture (Apple Silicon, Windows x64, Linux x64, or Linux ARM64) and moves the matching binaries.
Surgical Purge: Once you choose your mode (CPU vs. GPU), it permanently deletes all unused OS binaries and model variants. It will report exactly how many MB of unneeded setup files were deleted.
Stealth Portability: Installs into a hidden .m3-lmstudio directory. If you move the project folder, M3 self-heals its absolute paths in Windows Startup, macOS LaunchAgents, or Linux Systemd units.
Clean Integration: Locks the local server to 127.0.0.1:8081 and auto-wires your .env file.

3. Existing LM Studio instances

If a local instance of LM Studio is already detected, the installer will:

Offer to link to your existing server instead of installing a separate one.
Warn if a different embedder is loaded (e.g., nomic-embed-text) and explain that re-embedding (mcp-memory re-embed) only applies to M3-owned data.
Instruct you on how to manually load bge-m3 for optimal retrieval.

🔮 What happens next (benefits of use)

You're at a coffee shop on your MacBook, asking Claude to debug a deployment issue. It remembers the architecture decisions you made last week, the server configs you stored yesterday, and the troubleshooting steps that worked last time — all from local SQLite, no internet required.

Later, you're at your Windows desktop at home with Gemini CLI, and it picks up exactly where you left off. Same memories, same context, same knowledge graph. You didn't copy files, didn't export anything, didn't push to someone else's cloud. Your PostgreSQL sync handled everything in the background the moment your laptop hit the local network.

💡 Why this exists

Most AI agents don't persist state between sessions. You re-paste context, re-explain architecture, re-correct mistakes. When facts change, the agent has no mechanism to update what it "knows."

M3 Memory gives agents a structured, persistent memory layer that handles this.

⚡ What it does

Autonomous cognitive loop — optional background worker (m3_cognitive_loop.py) that extracts facts, resolves contradictions, and links entities while you sleep. Turns raw chat logs into a refined knowledge graph without human intervention.

Persistent memory — facts, decisions, preferences survive across sessions. Stored in local SQLite.

Hybrid retrieval — FTS5 keyword matching + semantic vector similarity + MMR diversity re-ranking. Automatic, no tuning required.

Contradiction handling — conflicting facts are automatically superseded. Bitemporal versioning preserves the full history.

Knowledge graph — related memories linked automatically on write. Nine relationship types, 3-hop traversal. Entity extraction (entity_search, entity_get) supplements the graph with first-class people / places / things resolution.

Zero-config local install — pip install m3-memory plus one line in your MCP config, or mcp-memory install-m3 for a one-command setup that wires settings.json, hooks, and the chatlog subsystem in one shot. SQLite stores everything locally — no external databases, no cloud calls, no API costs. Works offline.

Cross-device sync — optional, easy-to-add bi-directional delta sync via PostgreSQL or ChromaDB, with manifest-driven multi-DB support for fleet deployments. Set one environment variable and your memories follow you across machines.

📚 Learn more


🚀 Getting started	👥 Multi-agent orchestration
✨ Core features	🧩 Multi-agent example
🏗️ System design	⚖️ Compare M3 to alternatives (sovereign substrates table)
🔧 Implementation details	⚙️ Configuration
🤖 Agent rules + all 74 tools	🛡️ Compliance & assurance (FISMA, CMMC, GDPR)
🏠 Homelab patterns	🔍 Myths & facts (verify claims about M3)
🗺️ Roadmap

🎯 Who this is for

M3 is a good fit if…


🤖 You use coding agents	Claude Code, Gemini CLI, Aider, OpenCode, or any MCP-compatible agent. Non-MCP clients work too via the built-in HTTP proxy.
👥 You run multiple agents	Coordinating Claude + Gemini + a background worker on a shared local store, with handoffs and per-agent scoping.
🛡️ You need compliance primitives	`gdpr_forget` / `gdpr_export` as MCP tools, bitemporal valid-time / transaction-time, audit trail, no telemetry.
💾 You want pure local-first	Single-file SQLite. Works offline. No external database, no cloud calls, no API costs by default.
🌐 You want memory across devices	Optional bi-directional delta sync via PostgreSQL or ChromaDB — your data, your hardware.

M3 is not the right tool if…

	Try instead
You're building LangChain / LangGraph / CrewAI pipelines and want framework-native memory	Mem0, LangChain Memory / LangMem
You want a hosted agent runtime with managed scaling, dashboards, and SLAs	Letta, Mem0 Pro
Pure retrieval-accuracy is your only criterion (M3 is mid-pack at 89.0% LME-S)	agentmemory (96.2%), Hindsight
You only need in-session chat context that's discarded after the conversation	Your agent's built-in conversation buffer; M3 is overkill

🛡️ Why trust this


74 MCP tools	Memory, search, GDPR, refresh lifecycle — plus agent registry, handoffs, notifications, tasks, entity graph, fact enrichment, and chat-log capture for multi-agent orchestration
193 end-to-end tests	Covering write, search, contradiction, sync, GDPR, maintenance, and orchestration paths
Explainable retrieval	`memory_suggest` returns vector, BM25, and MMR scores per result
SQLite core	No external database required. Single-file, portable, inspectable
GDPR compliance	`gdpr_forget` (Article 17) and `gdpr_export` (Article 20) as built-in tools — see compliance & assurance for FISMA / CMMC alignment too
Self-maintaining	Automatic decay, dedup, orphan pruning, retention enforcement
Audited security posture	Periodic Bandit + pip-audit + secrets-scan reports published under `docs/audits/`; CI gates on core-dep CVEs
Apache 2.0 licensed	Free. No SaaS tier, no usage limits, no lock-in

🧭 Maturity, honestly. The core (storage, retrieval, GDPR, MCP tools, sync) is stable and covered by the test suite. The newer enrichment + reflector pipeline matured rapidly through 2026-Q2 and has live-fire experience behind it but is still iterating. Production-ready for personal, homelab, and multi-agent developer workflows today. For regulated workloads, do your own evaluation against your specific use case — and we recommend that against any memory tool, not just M3. See docs/MYTHS_AND_FACTS.md for what we don't claim.

📊 Benchmarks

89.0% on LongMemEval-S (445/500 correct) — a 500-question evaluation of long-horizon conversational memory. Without oracle metadata: 74.8% (smart retrieval) to 68.0% (fixed-k baseline).

Question type	n	Accuracy
single-session-user	70	91.4%
single-session-assistant	56	94.6%
single-session-preference	30	93.3%
multi-session	133	85.0%
temporal-reasoning	133	86.5%
knowledge-update	78	92.3%
Overall	500	89.0%

Full methodology, ablations, and honest caveats: benchmarks/longmemeval/LME-S_Benchmarking_Report.md. LoCoMo audit pending — see benchmarks/locomo/README.md.

🔍 Verifying claims about M3. If a third-party AI assistant has described M3 with features or scores that don't match what's documented here, it's almost certainly hallucinating. See docs/MYTHS_AND_FACTS.md for the source-of-truth list of what M3 actually implements (and what it doesn't).

🧰 Core tools

Most sessions use three tools. The rest is there when you need it.

Tool	Purpose
`memory_write`	Store a fact, decision, preference, config, or observation
`memory_search`	Retrieve relevant memories (hybrid search)
`memory_update`	Refine existing knowledge
`memory_suggest`	Search with full score breakdown
`memory_get`	Fetch a specific memory by ID

All 74 tools are documented in docs/AGENT_INSTRUCTIONS.md and the full inventory lives in docs/MCP_TOOLS.md.

🤖 For AI agents

M3 Memory exposes 74 MCP tools for storing, searching, updating, and linking knowledge — including conversation grouping, a refresh lifecycle for aging memories, agent registry, handoffs, notifications, tasks, entity-graph extraction, fact enrichment, and chat-log capture for multi-agent orchestration. Any MCP-compatible agent can use them automatically.

To teach your agent best practices (search before answering, write aggressively, update instead of duplicating), drop the compact rules file into your project:

examples/AGENT_RULES.md

Full tool reference with all parameters and behaviors: docs/AGENT_INSTRUCTIONS.md

🪄 Let your agent install it

Already inside Claude Code or Gemini CLI? Paste one of these prompts:

Claude Code:

Install m3-memory for persistent memory. Run: pip install m3-memory
Then add {"mcpServers":{"memory":{"command":"mcp-memory"}}} to my
~/.claude/settings.json under "mcpServers". For best retrieval, ensure 
Ollama is running with qwen3-embedding:0.6b (optional, falls back 
to keyword search without it). Then use /mcp to verify the memory server loaded.

Gemini CLI:

Install m3-memory for persistent memory. Run: pip install m3-memory
Then add {"mcpServers":{"memory":{"command":"mcp-memory"}}} to my
~/.gemini/settings.json under "mcpServers". For best retrieval, ensure 
Ollama is running with qwen3-embedding:0.6b (optional, falls back 
to keyword search without it).

After install, test it:

Write a memory: "M3 Memory installed successfully on [today's date]"
Then search for: "M3 install"

Add the chat log subsystem

Want auto-capture of every Claude Code / Gemini CLI / OpenCode / Aider conversation into a searchable, promotable chat log store? Once m3-memory is wired up, just say:

Install the m3-memory chat log subsystem.

The agent runs bin/chatlog_init.py, wires the host-agent hook, and installs the embed sweeper schedule. See docs/CHATLOG.md for the architecture and ops guide.

🎬 See it in action

Contradiction detection

Demo: contradiction detection and automatic resolution

Hybrid search with scores

Demo: hybrid search with score breakdown

Cross-device, cross-platform sync

Demo: cross-device, cross-platform memory sync

💬 Community

Contributing · Good first issues

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

2026.5.21.0

May 21, 2026

2026.5.18.1

May 18, 2026

2026.5.18.0

May 18, 2026

This version

2026.5.6.3

May 7, 2026

2026.5.6.2

May 7, 2026

2026.5.6.1

May 6, 2026

2026.5.4.6

May 5, 2026

2026.5.4.5

May 5, 2026

2026.5.4.1

May 4, 2026

2026.5.3.3

May 4, 2026

2026.5.3.2

May 3, 2026

2026.5.3.1

May 3, 2026

2026.5.1.1

May 1, 2026

2026.4.24.12

Apr 25, 2026

2026.4.24.11

Apr 25, 2026

2026.4.24.10

Apr 25, 2026

2026.4.24.9

Apr 25, 2026

2026.4.24.8

Apr 25, 2026

2026.4.24.7

Apr 25, 2026

2026.4.24.6

Apr 25, 2026

2026.4.24.5

Apr 24, 2026

2026.4.24.3

Apr 24, 2026

2026.4.24.1

Apr 24, 2026

2026.4.22.1

Apr 20, 2026

2026.4.20

Apr 17, 2026

2026.4.19

Apr 16, 2026

2026.4.18

Apr 16, 2026

2026.4.17

Apr 16, 2026

2026.4.16

Apr 16, 2026

2026.4.8

Apr 11, 2026

2026.4.7

Apr 10, 2026

2026.4.6

Apr 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

m3_memory-2026.5.6.3.tar.gz (136.8 kB view details)

Uploaded May 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

m3_memory-2026.5.6.3-py3-none-any.whl (35.8 kB view details)

Uploaded May 7, 2026 Python 3

File details

Details for the file m3_memory-2026.5.6.3.tar.gz.

File metadata

Download URL: m3_memory-2026.5.6.3.tar.gz
Upload date: May 7, 2026
Size: 136.8 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for m3_memory-2026.5.6.3.tar.gz
Algorithm	Hash digest
SHA256	`14f197fb7625d988ea15ee2ce62120293bdb4ee99d624115b173e05e3944fcb9`
MD5	`58b41b4efb3c45e377c95b7101c23f1d`
BLAKE2b-256	`f9db704acb27331f004107ddec8ec778f21fc689b936603151d0d9c3b33bdfc4`

See more details on using hashes here.

Provenance

The following attestation bundles were made for m3_memory-2026.5.6.3.tar.gz:

Publisher: publish.yml on skynetcmd/m3-memory

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: m3_memory-2026.5.6.3.tar.gz
- Subject digest: 14f197fb7625d988ea15ee2ce62120293bdb4ee99d624115b173e05e3944fcb9
- Sigstore transparency entry: 1457251848
- Sigstore integration time: May 7, 2026
Source repository:
- Permalink: skynetcmd/m3-memory@f209b4ce396d0c1f9438118044c0d7073f95555c
- Branch / Tag: refs/tags/v2026.5.6.3
- Owner: https://github.com/skynetcmd
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@f209b4ce396d0c1f9438118044c0d7073f95555c
- Trigger Event: release

File details

Details for the file m3_memory-2026.5.6.3-py3-none-any.whl.

File metadata

Download URL: m3_memory-2026.5.6.3-py3-none-any.whl
Upload date: May 7, 2026
Size: 35.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for m3_memory-2026.5.6.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`32d353b91100cd317736efa3f733dc30cddf5eb0f2046c9d7826e5e282560edc`
MD5	`af828a05d6db885f97e818b68cdf4f89`
BLAKE2b-256	`b737320dc38450d35f38d9059377031ac05a7e95bd97f00e15cab1549cc7d157`

See more details on using hashes here.

Provenance

The following attestation bundles were made for m3_memory-2026.5.6.3-py3-none-any.whl:

Publisher: publish.yml on skynetcmd/m3-memory

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: m3_memory-2026.5.6.3-py3-none-any.whl
- Subject digest: 32d353b91100cd317736efa3f733dc30cddf5eb0f2046c9d7826e5e282560edc
- Sigstore transparency entry: 1457251965
- Sigstore integration time: May 7, 2026
Source repository:
- Permalink: skynetcmd/m3-memory@f209b4ce396d0c1f9438118044c0d7073f95555c
- Branch / Tag: refs/tags/v2026.5.6.3
- Owner: https://github.com/skynetcmd
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@f209b4ce396d0c1f9438118044c0d7073f95555c
- Trigger Event: release

m3-memory 2026.5.6.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

M3 Memory

📦 Install

🚀 Recommended: Integrated Sovereign Setup

🍎 Older Intel Macs

Other Options

🛡️ Sovereign Embedder (Air-gapped / Offline)

1. Unified Setup

2. Configuration

3. What it does:

3. Existing LM Studio instances

🔮 What happens next (benefits of use)

💡 Why this exists

⚡ What it does

📚 Learn more

🎯 Who this is for

M3 is a good fit if…

M3 is not the right tool if…

🛡️ Why trust this

📊 Benchmarks

🧰 Core tools

🤖 For AI agents

🪄 Let your agent install it

Add the chat log subsystem

🎬 See it in action

Contradiction detection

Hybrid search with scores

Cross-device, cross-platform sync

💬 Community

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance