MCP server for Mem0 agent memory management with multi-backend support

These details have not been verified by PyPI

Project links

Project description

Mem0 Agent Memory

A Model Context Protocol (MCP) server that provides persistent memory capabilities for AI agents using Mem0. Store, search, and manage contextual information across conversations with support for multiple backends and LLM providers.

Features

Core Capabilities:

🧠 Persistent Memory: Store and retrieve contextual information across sessions
🔍 Semantic Search: Find relevant memories using natural language queries
📚 Document Ingestion: Import PDFs, DOCX, Markdown, and text files as knowledge base
🏷️ Metadata Filtering: Organize and filter memories by type, priority, status, and custom fields
📊 Memory Management: Full CRUD operations with history tracking and bulk operations

Backend Support:

Vector Stores: FAISS (local), Qdrant (embedded/server), OpenSearch (AWS), Mem0 Platform (cloud)
LLM Providers: AWS Bedrock (Claude, Titan), Ollama (local), LM Studio (local)
Graph Store: KuzuDB integration for relationship tracking (experimental)

Developer Experience:

⚡ Auto-configuration: Automatic user/agent detection from system context
🎯 Memory-First Workflows: One-command setup for Kiro IDE integration
🔧 Performance Tuning: Configurable inference, relevance filtering, and connection pooling
📦 Session Partitioning: Isolate memories by run_id for multi-session management

Quick Start

Installation

pip install mem0-agent-memory

Basic Setup

Add to your MCP client configuration:

Kiro: .kiro/settings/mcp.json Amazon Q CLI: ~/.aws/amazonq/mcp.json or .amazonq/mcp.json

{
  "mcpServers": {
    "mem0-agent-memory": {
      "command": "uvx",
      "args": ["mem0-agent-memory"],
      "env": {
        "AWS_ACCESS_KEY_ID": "your-key",
        "AWS_SECRET_ACCESS_KEY": "your-secret",
        "AWS_REGION": "us-west-2"
      }
    }
  }
}

Enable Memory-First Workflows (Kiro Only)

In your first chat session:

setup steering for memory

This configures the AI to automatically check memory before tasks and store important outcomes.

Configuration

LLM Providers

Choose your LLM backend by setting the appropriate environment variables:

AWS Bedrock (Recommended)

export AWS_ACCESS_KEY_ID="your-access-key"
export AWS_SECRET_ACCESS_KEY="your-secret-key"
export AWS_REGION="us-west-2"

# Optional: Customize models
export BEDROCK_LLM_MODEL="us.anthropic.claude-3-5-haiku-20241022-v1:0"
export BEDROCK_EMBED_MODEL="amazon.titan-embed-text-v2:0"
export BEDROCK_MAX_TOKENS="1500"  # Default: 1500

Performance: Built-in optimizations include connection pooling (50 connections), adaptive retries, and reduced latency settings.

Ollama (Local)

export OLLAMA_HOST="http://localhost:11434"
export OLLAMA_LLM_MODEL="llama3.2"           # Default
export OLLAMA_EMBED_MODEL="nomic-embed-text" # Default

# Pull models first:
ollama pull llama3.2
ollama pull nomic-embed-text

Note: For Nomic embeddings, set NOMIC_USE_PREFIXES=true for better search accuracy.

LM Studio (Local)

export LMSTUDIO_HOST="http://localhost:1234"
export LMSTUDIO_LLM_MODEL="llama-3.2-3b-instruct"
export LMSTUDIO_EMBED_MODEL="text-embedding-nomic-embed-text-v1.5"

Vector Store Backends

FAISS (Default - Local)

export FAISS_PATH="/path/to/.mem0/memory"  # Optional, defaults to .mem0/memory

Best for: Development, small-medium datasets (<100k memories), single-user scenarios.

Qdrant (Embedded or Server)

Embedded Mode (No Docker):

export QDRANT_PATH=".mem0/qdrant"

Server Mode (Production):

export QDRANT_HOST="localhost"
export QDRANT_PORT="6333"  # Optional

Benefits: Native metadata filtering, better performance for complex queries, production-ready clustering. See Qdrant Setup Guide for details.

OpenSearch (AWS)

export OPENSEARCH_HOST="your-opensearch-endpoint"
export AWS_REGION="us-west-2"

Best for: Large datasets (>100k memories), complex filtering, enterprise deployments.

Mem0 Platform (Cloud)

export MEM0_API_KEY="your-api-key"

Best for: Managed service, no infrastructure management, built-in features.

Additional Settings

# User/Agent ID (optional - auto-detected if not set)
export MEM0_USER_ID="custom-user-id"      # Defaults to system username
export MEM0_AGENT_ID="custom-agent-id"    # Defaults to workspace name
export MEM0_RUN_ID="session-123"          # Optional: session partitioning

# Performance
export MEM0_INFER_DEFAULT="true"          # LLM inference for fact extraction
export MEM0_MIN_RELEVANCE_SCORE="0.7"     # Search result threshold (0.0-1.0)

# Response Optimization (v1.3.0+)
export MEM0_VERBOSE="false"               # Compact responses (default) vs verbose
export MEM0_MAX_RELATIONS="20"            # Max graph relations in compact mode

# Nomic embeddings (Ollama only)
export NOMIC_USE_PREFIXES="true"          # Improves search accuracy

Response Modes (v1.3.0+):

Compact (default): Returns only essential fields (id, memory, metadata, score). Reduces token usage by ~55% per memory.
Verbose: Returns all fields including hash, timestamps, user_id, agent_id, etc.
Graph relations: Preserved in both modes, truncated to 20 in compact mode to prevent token bloat
Control per-call with verbose parameter or globally with MEM0_VERBOSE environment variable.

Available Tools

Core Operations

Tool	Description
`store_memory`	Store memory with optional metadata and inference control
`search_memories`	Semantic search with relevance filtering
`list_memories`	List all memories with pagination
`get_memory`	Retrieve specific memory by ID
`get_recent_memories`	Get recently added/updated memories

Advanced Operations

Tool	Description
`update_memory`	Update existing memory directly (no LLM processing)
`search_by_metadata`	Filter memories by metadata fields (type, priority, status)
`get_memory_history`	View change history for a memory
`get_memory_stats`	Get memory usage statistics

Bulk Operations

Tool	Description
`delete_memory`	Delete single memory (permanent)
`delete_all_memories`	Delete all memories for a scope (permanent)
`bulk_delete_memories`	Delete multiple memories by filter (dry-run supported)

Import/Export

Tool	Description
`export_memories`	Export to JSON or Markdown format
`import_memories`	Import memories from JSON export
`ingest_documents`	Ingest PDF, DOCX, MD, TXT files as knowledge base

Utilities

Tool	Description
`health_check`	Verify backend connectivity
`reset_memory`	Reset entire memory store (destructive)
`setup_steering`	Create Kiro memory-first steering file

For detailed parameter documentation, see the tool descriptions in your MCP client.

Usage Examples

Basic Memory Operations

# Store a memory
store_memory(
    content="User prefers React over Vue for frontend development",
    metadata={"type": "preference", "priority": "high"}
)

# Search memories
search_memories(query="React preferences", limit=5)

# Get recent memories
get_recent_memories(days=7, limit=10)

# Filter by metadata
search_by_metadata(type="preference", priority="high")

Document Ingestion

# Ingest single file
ingest_documents(path="/path/to/manual.pdf")

# Ingest directory recursively
ingest_documents(
    path="/path/to/docs",
    recursive=True,
    chunk_size=2048,
    chunk_overlap=400,
    file_metadata={"type": "documentation", "version": "2.0"}
)

# Search ingested documents
search_memories(query="how to configure authentication")

Session Partitioning

# Store memory for specific session
store_memory(
    content="Current task: Implementing user authentication",
    run_id="session-123",
    metadata={"type": "task", "status": "in_progress"}
)

# Search within session
search_memories(query="authentication", run_id="session-123")

# Clean up session
delete_all_memories(run_id="session-123")

Performance Optimization

# Fast storage (no LLM inference)
store_memory(
    content="Completed: API refactoring - 30% faster response times",
    metadata={"type": "task_completion"},
    infer=False  # 5-10x faster
)

# Smart storage (with deduplication)
store_memory(
    content="User mentioned they prefer TypeScript for type safety",
    metadata={"type": "preference"},
    infer=True  # Extracts facts, prevents duplicates
)

# Compact response (default - saves tokens)
search_memories(query="React preferences", limit=5)
# Returns: {"memories": [{"id": "...", "memory": "...", "metadata": {...}}], "count": 5}

# Verbose response (full details)
search_memories(query="React preferences", limit=5, verbose=True)
# Returns: Full details including hash, timestamps, user_id, agent_id, etc.

Performance Tips

Storage Speed:

Use infer=false for 5-10x faster writes when you don't need deduplication
Use infer=true (default) for important information to prevent duplicates

Response Optimization:

Use verbose=false (default) for 55% token reduction per memory
Compact mode returns only: id, memory, metadata, score
Graph relations preserved and truncated to 20 in compact mode
Verbose mode returns all fields including timestamps, hashes, etc.
Set MEM0_VERBOSE=true globally or use verbose parameter per-call
Configure relations limit with MEM0_MAX_RELATIONS (default: 20)

Search Optimization:

FAISS uses L2 distance: lower scores = higher similarity
Adjust MEM0_MIN_RELEVANCE_SCORE to filter less relevant results
Use metadata filtering for precise queries

Backend Selection:

FAISS: Best for <100k memories, single-user, development
Qdrant: Best for metadata-heavy queries, production deployments
OpenSearch: Best for >100k memories, enterprise scale

LLM Provider:

Bedrock: Fastest with built-in optimizations, production-ready
Ollama: Good for local development, privacy-focused
LM Studio: Alternative local option with UI

Advanced Features

KuzuDB Graph Store (Experimental)

Track relationships between entities (people, companies, technologies) alongside vector embeddings. See docs/KUZU_GRAPH_STORE.md for details.

Known Limitations

Amazon Nova Models: Now fully compatible via bedrock_patch.py
Metadata Filtering: mem0 v1.0.x only supports implicit AND operations (flat dictionary)
Platform Features: Some features (custom_categories, expiration_date) only available in Mem0 Platform

For troubleshooting, see TROUBLESHOOTING.md

Contributing

Contributions are welcome! Please:

Fork the repository
Create a feature branch
Submit a pull request

See CONTRIBUTING.md for guidelines.

Documentation

Qdrant Setup Guide - Detailed Qdrant configuration
KuzuDB Graph Store - Relationship tracking (experimental)
Nomic Embeddings - Nomic model configuration
Troubleshooting - Common issues and solutions

Citation

If you use this project in your research or work, please cite:

@software{selvam_mem0_agent_memory_2025,
  author = {Selvam, Arunkumar},
  title = {Mem0 Agent Memory - MCP Server},
  url = {https://github.com/arunkumars-mf/mem0-agent-memory},
  version = {1.3.0},
  year = {2025}
}

See CITATION.cff for more formats.

License

MIT License - see LICENSE for details.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.3.2

Jan 15, 2026

1.3.1

Jan 15, 2026

This version

1.3.0

Jan 15, 2026

1.0.9

Sep 2, 2025

1.0.8

Sep 2, 2025

1.0.7

Sep 2, 2025

1.0.6

Sep 1, 2025

1.0.5

Aug 31, 2025

1.0.4

Aug 31, 2025

1.0.3

Aug 31, 2025

1.0.2

Aug 31, 2025

1.0.1

Aug 31, 2025

1.0.0

Aug 31, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mem0_agent_memory-1.3.0.tar.gz (244.5 kB view details)

Uploaded Jan 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mem0_agent_memory-1.3.0-py3-none-any.whl (32.7 kB view details)

Uploaded Jan 15, 2026 Python 3

File details

Details for the file mem0_agent_memory-1.3.0.tar.gz.

File metadata

Download URL: mem0_agent_memory-1.3.0.tar.gz
Upload date: Jan 15, 2026
Size: 244.5 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.3

File hashes

Hashes for mem0_agent_memory-1.3.0.tar.gz
Algorithm	Hash digest
SHA256	`adb16764b4b9d0f20803ae5ecf6e697e3adcd3e359af734f4ddd90ee6069d4f9`
MD5	`7614d2d1032ca4d2c412fa479c6e5ee4`
BLAKE2b-256	`056102915ef34690104b271ed7f62017f7dea51f4fad32cbf4a8913d2f762883`

See more details on using hashes here.

File details

Details for the file mem0_agent_memory-1.3.0-py3-none-any.whl.

File metadata

Download URL: mem0_agent_memory-1.3.0-py3-none-any.whl
Upload date: Jan 15, 2026
Size: 32.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.8.3

File hashes

Hashes for mem0_agent_memory-1.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3ce40d3a0ef565628b6a06c27674f17dbdfab6c9f5da1b7d7c939be182525734`
MD5	`19e5f22281c2d72efda01457a2bef935`
BLAKE2b-256	`6f516241d1a38be88db2b64525facdfd1603d36af18badbb991299dc4cb6a292`

See more details on using hashes here.

mem0-agent-memory 1.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Mem0 Agent Memory

Features

Quick Start

Installation

Basic Setup

Enable Memory-First Workflows (Kiro Only)

Configuration

LLM Providers

Vector Store Backends

Additional Settings

Available Tools

Core Operations

Advanced Operations

Bulk Operations

Import/Export

Utilities

Usage Examples

Basic Memory Operations

Document Ingestion

Session Partitioning

Performance Optimization

Performance Tips

Advanced Features

KuzuDB Graph Store (Experimental)

Known Limitations

Contributing

Documentation

Citation

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes