A Memory Server for LLM Agents and Applications

These details have not been verified by PyPI

Project links

Project description

Redis Agent Memory Server

A memory layer for AI agents.

Documentation • GitHub • Docker

Features

Dual Interface: REST API and Model Context Protocol (MCP) server
Two-Tier Memory: Working memory (session-scoped) and long-term memory (persistent)
Configurable Memory Strategies: Customize how memories are extracted (discrete, summary, preferences, custom)
Semantic Search: Vector-based similarity search with metadata filtering
Flexible Backends: Pluggable vector store factory system
Multi-Provider LLM Support: OpenAI, Anthropic, AWS Bedrock, Ollama, Azure, Gemini via LiteLLM
AI Integration: Automatic topic extraction, entity recognition, and conversation summarization
Python SDK: Easy integration with AI applications

Quick Start

1. Installation

Using Docker

Pre-built Docker images are available from:

Docker Hub: redislabs/agent-memory-server
GitHub Packages: ghcr.io/redis/agent-memory-server

Quick Start (Development Mode):

# Start with docker-compose
# Note: Both 'api' and 'api-for-task-worker' services use port 8000
# Choose one depending on your needs:

# Option 1: Development mode (no worker, immediate task execution)
docker compose up api redis

# Option 2: Production-like mode (with background worker)
docker compose up api-for-task-worker task-worker redis mcp

# Or run just the API server (requires separate Redis)
docker run -p 8000:8000 \
  -e REDIS_URL=redis://your-redis:6379 \
  -e OPENAI_API_KEY=your-key \
  redislabs/agent-memory-server:latest \
  agent-memory api --host 0.0.0.0 --port 8000 --task-backend=asyncio

By default, the image runs the API with the Docket task backend, which expects a separate agent-memory task-worker process for non-blocking background tasks. The example above shows how to override this to use the asyncio backend for a single-container development setup.

Production Deployment:

For production, run separate containers for the API and background workers:

# API Server (without background worker)
docker run -p 8000:8000 \
  -e REDIS_URL=redis://your-redis:6379 \
  -e OPENAI_API_KEY=your-key \
  -e DISABLE_AUTH=false \
  redislabs/agent-memory-server:latest \
  agent-memory api --host 0.0.0.0 --port 8000

# Background Worker (separate container)
docker run \
  -e REDIS_URL=redis://your-redis:6379 \
  -e OPENAI_API_KEY=your-key \
  redislabs/agent-memory-server:latest \
  agent-memory task-worker --concurrency 10

# MCP Server (if needed)
docker run -p 9000:9000 \
  -e REDIS_URL=redis://your-redis:6379 \
  -e OPENAI_API_KEY=your-key \
  redislabs/agent-memory-server:latest \
  agent-memory mcp --mode sse --port 9000

From Source

# Install dependencies
pip install uv
uv install --all-extras

# Start Redis
docker-compose up redis

# Start the server (development mode, asyncio task backend)
uv run agent-memory api --task-backend=asyncio

2. Python SDK

Allowing the server to extract memories from working memory is easiest. However, you can also manually create memories:

# Install the client
pip install agent-memory-client

# For LangChain integration
pip install agent-memory-client langchain-core

from agent_memory_client import MemoryAPIClient

# Connect to server
client = MemoryAPIClient(base_url="http://localhost:8000")

# Store memories
await client.create_long_term_memories([
    {
        "text": "User prefers morning meetings",
        "user_id": "user123",
        "memory_type": "preference"
    }
])

# Search memories
results = await client.search_long_term_memory(
    text="What time does the user like meetings?",
    user_id="user123"
)

Note: While you can call client functions directly as shown above, using MCP or SDK-provided tool calls is recommended for AI agents as it provides better integration, automatic context management, and follows AI-native patterns. For the best performance, you can add messages to working memory and allow the server to extract memories in the background. See Memory Integration Patterns for guidance on when to use each approach.

LangChain Integration

For LangChain users, the SDK provides automatic conversion of memory client tools to LangChain-compatible tools, eliminating the need for manual wrapping with @tool decorators.

from agent_memory_client import create_memory_client
from agent_memory_client.integrations.langchain import get_memory_tools
from langchain.agents import create_tool_calling_agent, AgentExecutor
from langchain_core.prompts import ChatPromptTemplate, MessagesPlaceholder
from langchain_openai import ChatOpenAI

# Get LangChain-compatible tools automatically
memory_client = await create_memory_client("http://localhost:8000")
tools = get_memory_tools(
    memory_client=memory_client,
    session_id="my_session",
    user_id="alice"
)

# Create prompt and agent
prompt = ChatPromptTemplate.from_messages([
    ("system", "You are a helpful assistant with memory."),
    ("human", "{input}"),
    MessagesPlaceholder("agent_scratchpad"),
])

llm = ChatOpenAI(model="gpt-4o")
agent = create_tool_calling_agent(llm, tools, prompt)
executor = AgentExecutor(agent=agent, tools=tools)

# Use the agent
result = await executor.ainvoke({"input": "Remember that I love pizza"})

3. MCP Integration

# Start MCP server (stdio mode - recommended for Claude Desktop)
uv run agent-memory mcp

# Or with SSE mode (development mode, default asyncio backend)
uv run agent-memory mcp --mode sse --port 9000

MCP config via uvx (recommended)

Use this in your MCP tool configuration (e.g., Claude Desktop mcp.json):

{
  "mcpServers": {
    "memory": {
      "command": "uvx",
      "args": ["--from", "agent-memory-server", "agent-memory", "mcp"],
      "env": {
        "DISABLE_AUTH": "true",
        "REDIS_URL": "redis://localhost:6379",
        "OPENAI_API_KEY": "<your-openai-key>"
      }
    }
  }
}

Notes:

API keys: Set either OPENAI_API_KEY (default models use OpenAI) or switch to Anthropic by setting ANTHROPIC_API_KEY and GENERATION_MODEL to an Anthropic model (e.g., claude-3-5-haiku-20241022).
Make sure your MCP host can find uvx (on its PATH or by using an absolute command path).
- macOS: brew install uv
- If not on PATH, set "command" to the absolute path (e.g., /opt/homebrew/bin/uvx on Apple Silicon, /usr/local/bin/uvx on Intel macOS). On Linux, ~/.local/bin/uvx is common. See https://docs.astral.sh/uv/getting-started/
For production, remove DISABLE_AUTH and configure proper authentication.

LLM Provider Configuration

The server uses LiteLLM to support 100+ LLM providers. Configure via environment variables:

# OpenAI (default)
export OPENAI_API_KEY=sk-...
export GENERATION_MODEL=gpt-4o
export EMBEDDING_MODEL=text-embedding-3-small

# Anthropic
export ANTHROPIC_API_KEY=sk-ant-...
export GENERATION_MODEL=claude-3-5-sonnet-20241022
export EMBEDDING_MODEL=text-embedding-3-small  # Use OpenAI for embeddings

# AWS Bedrock
export AWS_ACCESS_KEY_ID=...
export AWS_SECRET_ACCESS_KEY=...
export AWS_REGION_NAME=us-east-1
export GENERATION_MODEL=anthropic.claude-sonnet-4-5-20250929-v1:0
export EMBEDDING_MODEL=bedrock/amazon.titan-embed-text-v2:0  # Note: bedrock/ prefix required

# Ollama (local)
export OLLAMA_API_BASE=http://localhost:11434
export GENERATION_MODEL=ollama/llama2
export EMBEDDING_MODEL=ollama/nomic-embed-text
export REDISVL_VECTOR_DIMENSIONS=768  # Required for Ollama

See LLM Providers for complete configuration options.

Documentation

📚 Full Documentation - Complete guides, API reference, and examples

Key Documentation Sections:

Quick Start Guide - Get up and running in minutes
Python SDK - Complete SDK reference with examples
LangChain Integration - Automatic tool conversion for LangChain
LLM Providers - Configure OpenAI, Anthropic, AWS Bedrock, Ollama, and more
Embedding Providers - Configure embedding models for semantic search
Vector Store Backends - Configure different vector databases
Authentication - OAuth2/JWT setup for production
Memory Types - Understanding semantic vs episodic memory
API Reference - REST API endpoints
MCP Protocol - Model Context Protocol integration

Architecture

Working Memory (Session-scoped)  →  Long-term Memory (Persistent)
    ↓                                      ↓
- Messages                         - Semantic search
- Structured memories              - Topic modeling
- Summary of past messages         - Entity recognition
- Metadata                         - Deduplication

Use Cases

AI Assistants: Persistent memory across conversations
Customer Support: Context from previous interactions
Personal AI: Learning user preferences and history
Research Assistants: Accumulating knowledge over time
Chatbots: Maintaining context and personalization

Development

# Install dependencies
uv install --all-extras

# Run tests
uv run pytest

# Format code
uv run ruff format
uv run ruff check

# Start development stack (choose one based on your needs)
docker compose up api redis                               # Development mode
docker compose up api-for-task-worker task-worker redis   # Production-like mode

License

Apache License 2.0 - see LICENSE file for details.

Contributing

We welcome contributions! Please see the development documentation for guidelines.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.15.2

Apr 10, 2026

0.15.1

Mar 26, 2026

0.15.0

Mar 25, 2026

0.14.0

Mar 17, 2026

0.13.2

Feb 23, 2026

This version

0.13.1

Feb 1, 2026

0.13.0

Jan 31, 2026

0.12.7

Jan 14, 2026

0.12.6

Jan 9, 2026

0.12.5

Dec 10, 2025

0.12.4

Dec 9, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agent_memory_server-0.13.1.tar.gz (122.7 kB view details)

Uploaded Feb 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agent_memory_server-0.13.1-py3-none-any.whl (139.0 kB view details)

Uploaded Feb 1, 2026 Python 3

File details

Details for the file agent_memory_server-0.13.1.tar.gz.

File metadata

Download URL: agent_memory_server-0.13.1.tar.gz
Upload date: Feb 1, 2026
Size: 122.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for agent_memory_server-0.13.1.tar.gz
Algorithm	Hash digest
SHA256	`04297c5cb771566c296dee455319dc880f94f470a669e3d4f84aeeba415da331`
MD5	`e69aa28a1903e8248572b9359802d40d`
BLAKE2b-256	`dd5b68be5d323edd01322a669a6abe9888b4d2403091d90d5224dbf26c80274f`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agent_memory_server-0.13.1.tar.gz:

Publisher: agent-memory-server.yml on redis/agent-memory-server

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agent_memory_server-0.13.1.tar.gz
- Subject digest: 04297c5cb771566c296dee455319dc880f94f470a669e3d4f84aeeba415da331
- Sigstore transparency entry: 896972776
- Sigstore integration time: Feb 1, 2026
Source repository:
- Permalink: redis/agent-memory-server@6cf2fe90eb4378526f6fafdc08fbb79e5c69eb47
- Branch / Tag: refs/tags/server/v0.13.1
- Owner: https://github.com/redis
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: agent-memory-server.yml@6cf2fe90eb4378526f6fafdc08fbb79e5c69eb47
- Trigger Event: push

File details

Details for the file agent_memory_server-0.13.1-py3-none-any.whl.

File metadata

Download URL: agent_memory_server-0.13.1-py3-none-any.whl
Upload date: Feb 1, 2026
Size: 139.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for agent_memory_server-0.13.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b274be5afe057c29e82b31e9428509da5e73b41c371a5cbe0b10194a988bf46d`
MD5	`a11779b5b1bc865f370c79031acc5d41`
BLAKE2b-256	`dc62aa3a86dadce00f9dbc97b7aaf0c31ef2e18431e4993cb9cb6ae7823c4ba7`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agent_memory_server-0.13.1-py3-none-any.whl:

Publisher: agent-memory-server.yml on redis/agent-memory-server

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agent_memory_server-0.13.1-py3-none-any.whl
- Subject digest: b274be5afe057c29e82b31e9428509da5e73b41c371a5cbe0b10194a988bf46d
- Sigstore transparency entry: 896972863
- Sigstore integration time: Feb 1, 2026
Source repository:
- Permalink: redis/agent-memory-server@6cf2fe90eb4378526f6fafdc08fbb79e5c69eb47
- Branch / Tag: refs/tags/server/v0.13.1
- Owner: https://github.com/redis
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: agent-memory-server.yml@6cf2fe90eb4378526f6fafdc08fbb79e5c69eb47
- Trigger Event: push

agent-memory-server 0.13.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Redis Agent Memory Server

Features

Quick Start

1. Installation

Using Docker

From Source

2. Python SDK

LangChain Integration

3. MCP Integration

MCP config via uvx (recommended)

LLM Provider Configuration

Documentation

Key Documentation Sections:

Architecture

Use Cases

Development

License

Contributing

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance