Local-first persistent memory for AI agents — store, recall, and consolidate knowledge across sessions using FAISS, SQLite, and any LLM

These details have not been verified by PyPI

Project links

Project description

consolidation-memory

Local-first persistent memory for AI agents. Store facts, solutions, and preferences as episodes — then a local LLM automatically clusters and distills them into structured knowledge documents. Your AI remembers what it learned, not just what you said.

Works with any LLM (LM Studio, Ollama, OpenAI) and any interface — MCP (Claude Desktop/Code/Cursor), Python API, REST API, or OpenAI function calling.

Quick Start

pip install consolidation-memory[fastembed]
consolidation-memory init

MCP (Claude Desktop / Claude Code / Cursor)

Add to your Claude Desktop config (claude_desktop_config.json):

{
  "mcpServers": {
    "consolidation_memory": {
      "command": "consolidation-memory"
    }
  }
}

Python API

from consolidation_memory import MemoryClient

with MemoryClient() as mem:
    # Store
    result = mem.store("User prefers dark mode", content_type="preference", tags=["ui"])
    print(result.id)  # UUID of stored episode

    # Recall
    result = mem.recall("user interface preferences")
    for ep in result.episodes:
        print(ep["content"], ep["similarity"])
    for doc in result.knowledge:
        print(doc["title"])

    # Status
    stats = mem.status()
    print(stats.episodic_buffer["total"])
    print(stats.knowledge_base["total_topics"])

    # Forget
    mem.forget(episode_id="some-uuid")

    # Export
    export = mem.export()
    print(export.path)  # backup JSON file

    # Correct knowledge
    mem.correct("vr_setup.md", "SteamVR version is now 2.7, not 2.5")

    # Manual consolidation
    mem.consolidate()

OpenAI Function Calling (any OpenAI-compatible LLM)

import json
from openai import OpenAI
from consolidation_memory import MemoryClient
from consolidation_memory.schemas import openai_tools, dispatch_tool_call

client = OpenAI(base_url="http://localhost:1234/v1", api_key="lm-studio")
mem = MemoryClient()

messages = [{"role": "user", "content": "What do you remember about my VR setup?"}]

response = client.chat.completions.create(
    model="qwen2.5-7b-instruct",
    messages=messages,
    tools=openai_tools,
)

for call in response.choices[0].message.tool_calls or []:
    result = dispatch_tool_call(mem, call.function.name, json.loads(call.function.arguments))
    messages.append({"role": "tool", "tool_call_id": call.id, "content": json.dumps(result)})

mem.close()

Works with LM Studio, Ollama, OpenAI, Azure, any OpenAI-compatible API.

REST API

pip install consolidation-memory[rest]
consolidation-memory serve --rest --port 8080

# Store
curl -X POST http://localhost:8080/memory/store \
  -H "Content-Type: application/json" \
  -d '{"content": "User runs SteamVR on Windows 11", "content_type": "fact"}'

# Recall
curl -X POST http://localhost:8080/memory/recall \
  -H "Content-Type: application/json" \
  -d '{"query": "VR setup"}'

# Status
curl http://localhost:8080/memory/status

# Health
curl http://localhost:8080/health

All endpoints:

Method	Path	Description
GET	`/health`	Version + status
POST	`/memory/store`	Store episode
POST	`/memory/recall`	Semantic search
GET	`/memory/status`	System statistics
DELETE	`/memory/episodes/{id}`	Forget episode
POST	`/memory/consolidate`	Run consolidation
POST	`/memory/correct`	Correct knowledge doc
POST	`/memory/export`	Export to JSON

How It Works

Store → Embed → FAISS Index
                    ↓
            Recall (semantic search + priority scoring)
                    ↓
        Consolidation (cluster → LLM synthesis → knowledge docs)

Store: Episodes (facts, solutions, preferences) are embedded and stored in SQLite + FAISS
Recall: Queries are embedded and matched against episodes using cosine similarity, weighted by surprise score, recency, and access frequency
Consolidate: Background thread clusters related episodes via agglomerative clustering, then synthesizes them into structured markdown knowledge documents using a local LLM

MCP Tools

Tool	Description
`memory_store`	Store a memory episode
`memory_recall`	Semantic search over episodes + knowledge
`memory_status`	System statistics
`memory_forget`	Remove an episode
`memory_export`	Export to JSON snapshot
`memory_correct`	Correct a knowledge document

CLI Commands

consolidation-memory serve              # Start MCP server (default)
consolidation-memory serve --rest       # Start REST API on 127.0.0.1:8080
consolidation-memory serve --rest --port 9000 --host 0.0.0.0
consolidation-memory init               # Interactive setup
consolidation-memory status             # Show statistics
consolidation-memory consolidate        # Run consolidation manually
consolidation-memory export             # Export to JSON
consolidation-memory import PATH        # Import from JSON export
consolidation-memory reindex            # Re-embed with current backend

Embedding Backends

Backend	Config Value	Model	Dimensions	Requirements
FastEmbed (default)	`fastembed`	bge-small-en-v1.5	384	`pip install consolidation-memory[fastembed]`
LM Studio	`lmstudio`	nomic-embed-text-v1.5	768	LM Studio running
OpenAI	`openai`	text-embedding-3-small	1536	API key
Ollama	`ollama`	nomic-embed-text	768	Ollama running

LLM Backends (Consolidation)

Consolidation requires an LLM to synthesize episode clusters into knowledge documents. Set backend = "disabled" under [llm] to use store/recall without consolidation.

Backend	Config Value	Requirements
LM Studio (default)	`lmstudio`	LM Studio running with chat model
OpenAI	`openai`	API key
Ollama	`ollama`	Ollama running with chat model
Disabled	`disabled`	None (no consolidation)

Configuration

Config file location:

Linux/macOS: ~/.config/consolidation_memory/config.toml
Windows: %APPDATA%\consolidation_memory\config.toml
Override: CONSOLIDATION_MEMORY_CONFIG env var

[embedding]
backend = "fastembed"

[llm]
backend = "lmstudio"
api_base = "http://localhost:1234/v1"
model = "qwen2.5-7b-instruct"

[consolidation]
auto_run = true
interval_hours = 6
cluster_threshold = 0.72

[dedup]
enabled = true
similarity_threshold = 0.95

Run consolidation-memory init to generate a config interactively.

Data Directory

Linux: ~/.local/share/consolidation_memory/
macOS: ~/Library/Application Support/consolidation_memory/
Windows: %LOCALAPPDATA%\consolidation_memory\

Override with data_dir under [paths] in config.

Migrating from Existing Installation

If you have an existing data directory, point your config at it:

[paths]
data_dir = "C:\\Users\\you\\Documents\\consolidation_memory\\data"

If switching embedding backends (different dimensions), run:

consolidation-memory reindex

Installation Extras

pip install consolidation-memory                     # Core (MCP + Python API)
pip install consolidation-memory[fastembed]           # + FastEmbed (recommended)
pip install consolidation-memory[openai]              # + OpenAI SDK
pip install consolidation-memory[rest]                # + REST API (FastAPI + Uvicorn)
pip install consolidation-memory[all]                 # Everything

Development

git clone https://github.com/charliee1w/consolidation-memory
cd consolidation-memory
pip install -e ".[fastembed,dev]"
python -m pytest tests/ -v      # 88 tests

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.15.0

Mar 28, 2026

0.14.2

Mar 19, 2026

0.14.1

Mar 19, 2026

0.14.0

Mar 19, 2026

0.13.7

Mar 13, 2026

0.13.6

Mar 10, 2026

0.13.5

Mar 8, 2026

0.13.1

Mar 8, 2026

0.13.0

Mar 7, 2026

0.12.4

Mar 6, 2026

0.12.3

Mar 3, 2026

0.12.2

Mar 3, 2026

0.12.1

Mar 2, 2026

0.12.0

Mar 2, 2026

0.11.0

Mar 2, 2026

0.10.0

Mar 1, 2026

0.9.0

Mar 1, 2026

0.8.3

Mar 1, 2026

0.8.2

Mar 1, 2026

0.8.1

Mar 1, 2026

0.8.0

Mar 1, 2026

0.7.0

Feb 28, 2026

0.6.0

Feb 28, 2026

0.5.0

Feb 28, 2026

0.4.0

Feb 28, 2026

0.3.0

Feb 28, 2026

0.2.0

Feb 25, 2026

This version

0.1.0

Feb 24, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

consolidation_memory-0.1.0.tar.gz (52.4 kB view details)

Uploaded Feb 24, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

consolidation_memory-0.1.0-py3-none-any.whl (50.9 kB view details)

Uploaded Feb 24, 2026 Python 3

File details

Details for the file consolidation_memory-0.1.0.tar.gz.

File metadata

Download URL: consolidation_memory-0.1.0.tar.gz
Upload date: Feb 24, 2026
Size: 52.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for consolidation_memory-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`c09f587bad3b78cc56539a66fdf1bd0f5198a40aa4ffb71566e7f905f6cb01da`
MD5	`3ac4e2d6c1f5c8866801f1cc3c9bd00b`
BLAKE2b-256	`8331672b617ee925312eb24b611ae7f58988a7365f7226f9ad7e360b25a87485`

See more details on using hashes here.

File details

Details for the file consolidation_memory-0.1.0-py3-none-any.whl.

File metadata

Download URL: consolidation_memory-0.1.0-py3-none-any.whl
Upload date: Feb 24, 2026
Size: 50.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.2

File hashes

Hashes for consolidation_memory-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b7cb2e0e8ad36ed4d3114788d1343b17cfff32821e539c3e3cd0bcabae70a51e`
MD5	`729ddc66d3e4bd6cff4e60fbf9fe5540`
BLAKE2b-256	`bf32049dd538c25ee210bf226a23455d53af2d9bda7c50685b026808b8559f4b`

See more details on using hashes here.

consolidation-memory 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

consolidation-memory

Quick Start

MCP (Claude Desktop / Claude Code / Cursor)

Python API

OpenAI Function Calling (any OpenAI-compatible LLM)

REST API

How It Works

MCP Tools

CLI Commands

Embedding Backends

LLM Backends (Consolidation)

Configuration

Data Directory

Migrating from Existing Installation

Installation Extras

Development

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes