Skip to main content

AI Memory and Conversation Management Framework - Simple as mem0, Powerful as MemU

Project description

MemU Banner

MemU

A Future-Oriented Agentic Memory System

PyPI version License: Apache 2.0 Python 3.13+ Discord Twitter


MemU is an agentic memory framework for LLM and AI agent backends. It receives multimodal inputs (conversations, documents, images), extracts them into structured memory, and organizes them into a hierarchical file system that supports both embedding-based (RAG) and non-embedding (LLM) retrieval.


MemU is collaborating with four open-source projects to launch the 2026 New Year Challenge. 🎉Between January 8–18, contributors can submit PRs to memU and earn cash rewards, community recognition, and platform credits. 🎁Learn more & get involved

✨ Core Features

Feature Description
🗂️ Hierarchical File System Three-layer architecture: Resource → Item → Category with full traceability
🔍 Dual Retrieval Methods RAG (embedding-based) for speed, LLM (non-embedding) for deep semantic understanding
🎨 Multimodal Support Process conversations, documents, images, audio, and video
🔄 Self-Evolving Memory Memory structure adapts and improves based on usage patterns

🗂️ Hierarchical File System

MemU organizes memory using a three-layer architecture inspired by hierarchical storage systems:

structure
Layer Description Examples
Resource Raw multimodal data warehouse JSON conversations, text documents, images, videos
Item Discrete extracted memory units Individual preferences, skills, opinions, habits
Category Aggregated textual memory with summaries preferences.md, work_life.md, relationships.md

Key Benefits:

  • Full Traceability: Track from raw data → items → categories and back
  • Progressive Summarization: Each layer provides increasingly abstracted views
  • Flexible Organization: Categories evolve based on content patterns

🎨 Multimodal Support

MemU processes diverse content types into unified memory:

Modality Input Processing
conversation JSON chat logs Extract preferences, opinions, habits, relationships
document Text files (.txt, .md) Extract knowledge, skills, facts
image PNG, JPG, etc. Vision model extracts visual concepts and descriptions
video Video files Frame extraction + vision analysis
audio Audio files Transcription + text processing

All modalities are unified into the same three-layer hierarchy, enabling cross-modal retrieval.


🚀 Quick Start

Option 1: Cloud Version

Try MemU instantly without any setup:

👉 memu.so - Hosted cloud service with full API access

For enterprise deployment and custom solutions, contact info@nevamind.ai

Cloud API (v3)

Base URL https://api.memu.so
Auth Authorization: Bearer YOUR_API_KEY
Method Endpoint Description
POST /api/v3/memory/memorize Register a memorization task
GET /api/v3/memory/memorize/status/{task_id} Get task status
POST /api/v3/memory/categories List memory categories
POST /api/v3/memory/retrieve Retrieve memories (semantic search)

📚 Full API Documentation


Option 2: Self-Hosted

Installation

pip install -e .

Basic Example

Requirements: Python 3.13+ and an OpenAI API key

Test with In-Memory Storage (no database required):

export OPENAI_API_KEY=your_api_key
cd tests
python test_inmemory.py

Test with PostgreSQL Storage (requires pgvector):

# Start PostgreSQL with pgvector
docker run -d \
  --name memu-postgres \
  -e POSTGRES_USER=postgres \
  -e POSTGRES_PASSWORD=postgres \
  -e POSTGRES_DB=memu \
  -p 5432:5432 \
  pgvector/pgvector:pg16

# Run the test
export OPENAI_API_KEY=your_api_key
cd tests
python test_postgres.py

Both examples demonstrate the complete workflow:

  1. Memorize: Process a conversation file and extract structured memory
  2. Retrieve (RAG): Fast embedding-based search
  3. Retrieve (LLM): Deep semantic understanding search

See tests/test_inmemory.py and tests/test_postgres.py for the full source code.


Custom LLM and Embedding Providers

MemU supports custom LLM and embedding providers beyond OpenAI. Configure them via llm_profiles:

from memu import MemUService

service = MemUService(
    llm_profiles={
        # Default profile for LLM operations
        "default": {
            "base_url": "https://dashscope.aliyuncs.com/compatible-mode/v1",
            "api_key": "your_api_key",
            "chat_model": "qwen3-max",
            "client_backend": "sdk"  # "sdk" or "http"
        },
        # Separate profile for embeddings
        "embedding": {
            "base_url": "https://api.voyageai.com/v1",
            "api_key": "your_voyage_api_key",
            "embed_model": "voyage-3.5-lite"
        }
    },
    # ... other configuration
)

📖 Core APIs

memorize() - Extract and Store Memory

Processes input resources and extracts structured memory:

memorize
result = await service.memorize(
    resource_url="path/to/file.json",  # File path or URL
    modality="conversation",            # conversation | document | image | video | audio
    user={"user_id": "123"}             # Optional: scope to a user
)

# Returns:
{
    "resource": {...},      # Stored resource metadata
    "items": [...],         # Extracted memory items
    "categories": [...]     # Updated category summaries
}

retrieve() - Query Memory

Retrieves relevant memory based on queries. MemU supports two retrieval strategies:

retrieve

RAG-based Retrieval (method="rag")

Fast embedding vector search using cosine similarity:

  • Fast: Pure vector computation
  • Scalable: Efficient for large memory stores
  • Returns scores: Each result includes similarity score

LLM-based Retrieval (method="llm")

Deep semantic understanding through direct LLM reasoning:

  • Deep understanding: LLM comprehends context and nuance
  • Query rewriting: Automatically refines query at each tier
  • Adaptive: Stops early when sufficient information is found

Comparison

Aspect RAG LLM
Speed ⚡ Fast 🐢 Slower
Cost 💰 Low 💰💰 Higher
Semantic depth Medium Deep
Tier 2 scope All items Only items in relevant categories
Output With similarity scores Ranked by LLM reasoning

Both methods support:

  • Context-aware rewriting: Resolves pronouns using conversation history
  • Progressive search: Categories → Items → Resources
  • Sufficiency checking: Stops when enough information is retrieved

Usage

result = await service.retrieve(
    queries=[
        {"role": "user", "content": {"text": "What are their preferences?"}},
        {"role": "user", "content": {"text": "Tell me about work habits"}}
    ],
    where={"user_id": "123"}  # Optional: scope filter
)

# Returns:
{
    "categories": [...],     # Relevant categories (with scores for RAG)
    "items": [...],          # Relevant memory items
    "resources": [...],      # Related raw resources
    "next_step_query": "..." # Rewritten query for follow-up (if applicable)
}

Scope Filtering: Use where to filter by user model fields:

  • where={"user_id": "123"} - exact match
  • where={"agent_id__in": ["1", "2"]} - match any in list
  • Omit where to retrieve across all scopes

📚 For complete API documentation, see SERVICE_API.md - includes all methods, CRUD operations, pipeline configuration, and configuration types.


💡 Use Cases

Example 1: Conversation Memory

Extract and organize memory from multi-turn conversations:

export OPENAI_API_KEY=your_api_key
python examples/example_1_conversation_memory.py

What it does:

  • Processes multiple conversation JSON files
  • Extracts memory items (preferences, habits, opinions, relationships)
  • Generates category markdown files (preferences.md, work_life.md, etc.)

Best for: Personal AI assistants, customer support bots, social chatbots


Example 2: Skill Extraction from Logs

Extract skills and lessons learned from agent execution logs:

export OPENAI_API_KEY=your_api_key
python examples/example_2_skill_extraction.py

What it does:

  • Processes agent logs sequentially
  • Extracts actions, outcomes, and lessons learned
  • Demonstrates incremental learning - memory evolves with each file
  • Generates evolving skill guides (log_1.mdlog_2.mdskill.md)

Best for: DevOps teams, agent self-improvement, knowledge management


Example 3: Multimodal Memory

Process diverse content types into unified memory:

export OPENAI_API_KEY=your_api_key
python examples/example_3_multimodal_memory.py

What it does:

  • Processes documents and images together
  • Extracts memory from different content types
  • Unifies into cross-modal categories (technical_documentation, visual_diagrams, etc.)

Best for: Documentation systems, learning platforms, research tools


📊 Performance

MemU achieves 92.09% average accuracy on the Locomo benchmark across all reasoning tasks.

benchmark

View detailed experimental data: memU-experiment


🧩 Ecosystem

Repository Description Use Case
memU Core algorithm engine Embed AI memory into your product
memU-server Backend service with CRUD, user system, RBAC Self-host a memory backend
memU-ui Visual dashboard Ready-to-use memory console

Quick Links:


If you find memU useful or interesting, a GitHub Star ⭐️ would be greatly appreciated.

🤝 Partners

Ten OpenAgents Milvus xRoute Jazz Buddie Bytebase LazyLLM


📄 License

Apache License 2.0


🌍 Community


Star us on GitHub to get notified about new releases!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

memu_py-1.1.2.tar.gz (11.6 MB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

memu_py-1.1.2-cp313-abi3-win_amd64.whl (221.4 kB view details)

Uploaded CPython 3.13+Windows x86-64

memu_py-1.1.2-cp313-abi3-manylinux_2_39_x86_64.whl (358.2 kB view details)

Uploaded CPython 3.13+manylinux: glibc 2.39+ x86-64

memu_py-1.1.2-cp313-abi3-manylinux_2_39_aarch64.whl (349.7 kB view details)

Uploaded CPython 3.13+manylinux: glibc 2.39+ ARM64

memu_py-1.1.2-cp313-abi3-macosx_11_0_arm64.whl (327.0 kB view details)

Uploaded CPython 3.13+macOS 11.0+ ARM64

memu_py-1.1.2-cp313-abi3-macosx_10_12_x86_64.whl (329.5 kB view details)

Uploaded CPython 3.13+macOS 10.12+ x86-64

File details

Details for the file memu_py-1.1.2.tar.gz.

File metadata

  • Download URL: memu_py-1.1.2.tar.gz
  • Upload date:
  • Size: 11.6 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for memu_py-1.1.2.tar.gz
Algorithm Hash digest
SHA256 13cd3fe1b435a3deea4b59f1f14ee7d5d13bbd665ab2ae02e2fd351b45d5d00f
MD5 1056e1236d9ef1ff6f444d272a54565f
BLAKE2b-256 c0c51738901315c243c7a48a86d58102478c4ac8a7edf84a92e9ca7b9c25c9ca

See more details on using hashes here.

Provenance

The following attestation bundles were made for memu_py-1.1.2.tar.gz:

Publisher: release-please.yml on NevaMind-AI/memU

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file memu_py-1.1.2-cp313-abi3-win_amd64.whl.

File metadata

  • Download URL: memu_py-1.1.2-cp313-abi3-win_amd64.whl
  • Upload date:
  • Size: 221.4 kB
  • Tags: CPython 3.13+, Windows x86-64
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for memu_py-1.1.2-cp313-abi3-win_amd64.whl
Algorithm Hash digest
SHA256 6eb1a9c8650935ab98e0af02ecda3593d829f671ef23b428ded60f26f9ee8695
MD5 b0a0a5f57d66c49e62c2e18c9d9ab179
BLAKE2b-256 7411ae7f05a5923035d35c9e5d746f07ceae9f90f8d0c153af4bb4bffa3af666

See more details on using hashes here.

Provenance

The following attestation bundles were made for memu_py-1.1.2-cp313-abi3-win_amd64.whl:

Publisher: release-please.yml on NevaMind-AI/memU

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file memu_py-1.1.2-cp313-abi3-manylinux_2_39_x86_64.whl.

File metadata

File hashes

Hashes for memu_py-1.1.2-cp313-abi3-manylinux_2_39_x86_64.whl
Algorithm Hash digest
SHA256 22340d0b69fec89997c6a232b47bfe67b5822c615577b8c01a915f1579f6e2ce
MD5 5198b5830438650dbe61d281a81e4054
BLAKE2b-256 f55636f890907c124a2c13decf3d12afa6a8a43edee0ecfb10249451a3b3ccdc

See more details on using hashes here.

Provenance

The following attestation bundles were made for memu_py-1.1.2-cp313-abi3-manylinux_2_39_x86_64.whl:

Publisher: release-please.yml on NevaMind-AI/memU

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file memu_py-1.1.2-cp313-abi3-manylinux_2_39_aarch64.whl.

File metadata

File hashes

Hashes for memu_py-1.1.2-cp313-abi3-manylinux_2_39_aarch64.whl
Algorithm Hash digest
SHA256 d732c5b402d8562146f94406ff70609900419a2a13288eed3cd632e47e0191fb
MD5 34221baee455e72c62548badbc7ec365
BLAKE2b-256 49b16eb53f137b78388eb97f7a0a1951f0719cc921c11a818687271659b13b4e

See more details on using hashes here.

Provenance

The following attestation bundles were made for memu_py-1.1.2-cp313-abi3-manylinux_2_39_aarch64.whl:

Publisher: release-please.yml on NevaMind-AI/memU

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file memu_py-1.1.2-cp313-abi3-macosx_11_0_arm64.whl.

File metadata

File hashes

Hashes for memu_py-1.1.2-cp313-abi3-macosx_11_0_arm64.whl
Algorithm Hash digest
SHA256 1652457d93f5201e09f1eb894c1558743371a1ea1950157fad5acc71d2223efc
MD5 23360b4bf84db0fde2fdd431699b9477
BLAKE2b-256 10a8110e517e32fbc6f1d39963ae05d8a8ceb613045f0eaca20430dad179ba1d

See more details on using hashes here.

Provenance

The following attestation bundles were made for memu_py-1.1.2-cp313-abi3-macosx_11_0_arm64.whl:

Publisher: release-please.yml on NevaMind-AI/memU

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file memu_py-1.1.2-cp313-abi3-macosx_10_12_x86_64.whl.

File metadata

File hashes

Hashes for memu_py-1.1.2-cp313-abi3-macosx_10_12_x86_64.whl
Algorithm Hash digest
SHA256 ed9cfd6e7a8ced7a80a6d496b1b79d4de73afa7754fa04f2df95ed7cc0c6ee73
MD5 9f77f6398af81afeb351b8300be54f00
BLAKE2b-256 7e440e576382105bb207d6be6454d7abaf5f8af6bcb00ad3e3b7c0dc113e6ef1

See more details on using hashes here.

Provenance

The following attestation bundles were made for memu_py-1.1.2-cp313-abi3-macosx_10_12_x86_64.whl:

Publisher: release-please.yml on NevaMind-AI/memU

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page