Skip to main content

Memory-enabled AI assistant with local LLM support

Project description

๐Ÿง  Mem-LLM

PyPI version Python 3.8+ License: MIT

Memory-enabled AI assistant with local LLM support

Mem-LLM is a powerful Python library that brings persistent memory capabilities to local Large Language Models. Build AI assistants that remember user interactions, manage knowledge bases, and work completely offline with Ollama.

โœจ Key Features

  • ๐Ÿง  Persistent Memory - Remembers conversations across sessions
  • ๐Ÿค– Universal Ollama Support - Works with ALL Ollama models (Qwen3, DeepSeek, Llama3, Granite, etc.)
  • ๐Ÿ’พ Dual Storage Modes - JSON (simple) or SQLite (advanced) memory backends
  • ๐Ÿ“š Knowledge Base - Built-in FAQ/support system with categorized entries
  • ๐ŸŽฏ Dynamic Prompts - Context-aware system prompts that adapt to active features
  • ๐Ÿ‘ฅ Multi-User Support - Separate memory spaces for different users
  • ๐Ÿ”ง Memory Tools - Search, export, and manage stored memories
  • ๐ŸŽจ Flexible Configuration - Personal or business usage modes
  • ๐Ÿ“Š Production Ready - Comprehensive test suite with 34+ automated tests
  • ๐Ÿ”’ 100% Local & Private - No cloud dependencies, your data stays yours

๐Ÿš€ Quick Start

Installation

pip install mem-llm

Prerequisites

Install and start Ollama:

# Install Ollama (visit https://ollama.ai)
# Then pull a model
ollama pull granite4:tiny-h

# Start Ollama service
ollama serve

Basic Usage

from mem_llm import MemAgent

# Create an agent
agent = MemAgent(model="granite4:tiny-h")

# Set user and chat
agent.set_user("alice")
response = agent.chat("My name is Alice and I love Python!")
print(response)

# Memory persists across sessions
response = agent.chat("What's my name and what do I love?")
print(response)  # Agent remembers: "Your name is Alice and you love Python!"

That's it! Just 5 lines of code to get started.

๐Ÿ“– Usage Examples

Multi-User Conversations

from mem_llm import MemAgent

agent = MemAgent()

# User 1
agent.set_user("alice")
agent.chat("I'm a Python developer")

# User 2
agent.set_user("bob")
agent.chat("I'm a JavaScript developer")

# Each user has separate memory
agent.set_user("alice")
response = agent.chat("What do I do?")  # "You're a Python developer"

Advanced Configuration

from mem_llm import MemAgent

# Use SQL database with knowledge base
agent = MemAgent(
    model="qwen3:8b",
    use_sql=True,
    load_knowledge_base=True,
    config_file="config.yaml"
)

# Add knowledge base entry
agent.add_kb_entry(
    category="FAQ",
    question="What are your hours?",
    answer="We're open 9 AM - 5 PM EST, Monday-Friday"
)

# Agent will use KB to answer
response = agent.chat("When are you open?")

Memory Tools

from mem_llm import MemAgent

agent = MemAgent(use_sql=True)
agent.set_user("alice")

# Chat with memory
agent.chat("I live in New York")
agent.chat("I work as a data scientist")

# Search memories
results = agent.search_memories("location")
print(results)  # Finds "New York" memory

# Export all data
data = agent.export_user_data()
print(f"Total memories: {len(data['memories'])}")

# Get statistics
stats = agent.get_memory_stats()
print(f"Users: {stats['total_users']}, Memories: {stats['total_memories']}")

CLI Interface

# Interactive chat
mem-llm chat

# With specific model
mem-llm chat --model llama3:8b

# Customer service mode
mem-llm customer-service

# Knowledge base management
mem-llm kb add --category "FAQ" --question "How to install?" --answer "Run: pip install mem-llm"
mem-llm kb list
mem-llm kb search "install"

๐ŸŽฏ Usage Modes

Personal Mode (Default)

  • Single user with JSON storage
  • Simple and lightweight
  • Perfect for personal projects
  • No configuration needed
agent = MemAgent()  # Automatically uses personal mode

Business Mode

  • Multi-user with SQL database
  • Knowledge base support
  • Advanced memory tools
  • Requires configuration file
agent = MemAgent(
    config_file="config.yaml",
    use_sql=True,
    load_knowledge_base=True
)

๐Ÿ”ง Configuration

Create a config.yaml file for advanced features:

# Usage mode: 'personal' or 'business'
usage_mode: business

# LLM settings
llm:
  model: granite4:tiny-h
  base_url: http://localhost:11434
  temperature: 0.7
  max_tokens: 2000

# Memory settings
memory:
  type: sql  # or 'json'
  db_path: ./data/memory.db
  
# Knowledge base
knowledge_base:
  enabled: true
  kb_path: ./data/knowledge_base.db

# Logging
logging:
  level: INFO
  file: logs/mem_llm.log

๐Ÿงช Supported Models

Mem-LLM works with ALL Ollama models, including:

  • โœ… Thinking Models: Qwen3, DeepSeek, QwQ
  • โœ… Standard Models: Llama3, Granite, Phi, Mistral
  • โœ… Specialized Models: CodeLlama, Vicuna, Neural-Chat
  • โœ… Any Custom Model in your Ollama library

Model Compatibility Features

  • ๐Ÿ”„ Automatic thinking mode detection
  • ๐ŸŽฏ Dynamic prompt adaptation
  • โšก Token limit optimization (2000 tokens)
  • ๐Ÿ”ง Automatic retry on empty responses

๐Ÿ“š Architecture

mem-llm/
โ”œโ”€โ”€ mem_llm/
โ”‚   โ”œโ”€โ”€ mem_agent.py           # Main agent class
โ”‚   โ”œโ”€โ”€ memory_manager.py      # JSON memory backend
โ”‚   โ”œโ”€โ”€ memory_db.py           # SQL memory backend
โ”‚   โ”œโ”€โ”€ llm_client.py          # Ollama API client
โ”‚   โ”œโ”€โ”€ knowledge_loader.py    # Knowledge base system
โ”‚   โ”œโ”€โ”€ dynamic_prompt.py      # Context-aware prompts
โ”‚   โ”œโ”€โ”€ memory_tools.py        # Memory management tools
โ”‚   โ”œโ”€โ”€ config_manager.py      # Configuration handler
โ”‚   โ””โ”€โ”€ cli.py                 # Command-line interface
โ””โ”€โ”€ examples/                  # Usage examples

๐Ÿ”ฅ Advanced Features

Dynamic Prompt System

Prevents hallucinations by only including instructions for enabled features:

agent = MemAgent(use_sql=True, load_knowledge_base=True)
# Agent automatically knows:
# โœ… Knowledge Base is available
# โœ… Memory tools are available
# โœ… SQL storage is active

Knowledge Base Categories

Organize knowledge by category:

agent.add_kb_entry(category="FAQ", question="...", answer="...")
agent.add_kb_entry(category="Technical", question="...", answer="...")
agent.add_kb_entry(category="Billing", question="...", answer="...")

Memory Search & Export

Powerful memory management:

# Search across all memories
results = agent.search_memories("python", limit=5)

# Export everything
data = agent.export_user_data()

# Get insights
stats = agent.get_memory_stats()

๐Ÿ“ฆ Project Structure

Core Components

  • MemAgent: Main interface for building AI assistants
  • MemoryManager: JSON-based memory storage (simple)
  • SQLMemoryManager: SQLite-based storage (advanced)
  • OllamaClient: LLM communication handler
  • KnowledgeLoader: Knowledge base management

Optional Features

  • MemoryTools: Search, export, statistics
  • ConfigManager: YAML configuration
  • CLI: Command-line interface

๐Ÿงช Testing

Run the comprehensive test suite:

# Install dev dependencies
pip install -r requirements-dev.txt

# Run all tests (34+ automated tests)
cd tests
python run_all_tests.py

# Run specific test
python -m pytest test_mem_agent.py -v

Test Coverage

  • โœ… Core imports and dependencies
  • โœ… CLI functionality
  • โœ… Ollama connection and models
  • โœ… JSON memory operations
  • โœ… SQL memory operations
  • โœ… MemAgent features
  • โœ… Configuration management
  • โœ… Multi-user scenarios
  • โœ… Hallucination detection

๐Ÿ“ Examples

The examples/ directory contains ready-to-run demonstrations:

  1. 01_hello_world.py - Simplest possible example (5 lines)
  2. 02_basic_memory.py - Memory persistence basics
  3. 03_multi_user.py - Multiple users with separate memories
  4. 04_customer_service.py - Real-world customer service scenario
  5. 05_knowledge_base.py - FAQ/support system
  6. 06_cli_demo.py - Command-line interface examples
  7. 07_document_config.py - Configuration from documents

๐Ÿ› ๏ธ Development

Setup Development Environment

git clone https://github.com/emredeveloper/Mem-LLM.git
cd Mem-LLM
pip install -e .
pip install -r requirements-dev.txt

Running Tests

pytest tests/ -v --cov=mem_llm

Building Package

python -m build
twine upload dist/*

๐Ÿ“‹ Requirements

Core Dependencies

  • Python 3.8+
  • requests>=2.31.0
  • pyyaml>=6.0.1
  • click>=8.1.0

Optional Dependencies

  • pytest>=7.4.0 (for testing)
  • flask>=3.0.0 (for web interface)
  • fastapi>=0.104.0 (for API server)

๐Ÿค Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/AmazingFeature)
  3. Commit your changes (git commit -m 'Add some AmazingFeature')
  4. Push to the branch (git push origin feature/AmazingFeature)
  5. Open a Pull Request

๐Ÿ“„ License

This project is licensed under the MIT License - see the LICENSE file for details.

๐Ÿ‘ค Author

C. Emre KarataลŸ

๐Ÿ™ Acknowledgments

  • Built with Ollama for local LLM support
  • Inspired by the need for privacy-focused AI assistants
  • Thanks to all contributors and users

๐Ÿ“Š Project Status

  • Version: 1.0.10
  • Status: Beta (Production Ready)
  • Last Updated: October 20, 2025

๐Ÿ”— Links

๐Ÿ“ˆ Roadmap

  • Web UI dashboard
  • REST API server
  • Vector database integration
  • Multi-language support
  • Cloud backup options
  • Advanced analytics

โญ If you find this project useful, please give it a star on GitHub!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mem_llm-1.0.11.tar.gz (42.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mem_llm-1.0.11-py3-none-any.whl (35.4 kB view details)

Uploaded Python 3

File details

Details for the file mem_llm-1.0.11.tar.gz.

File metadata

  • Download URL: mem_llm-1.0.11.tar.gz
  • Upload date:
  • Size: 42.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for mem_llm-1.0.11.tar.gz
Algorithm Hash digest
SHA256 f56b7802ae7568ca5504983c15022ed0567e0bb882636916fa4049da4f7b7a66
MD5 cf295ba6d1611533062aeab2faf8b90f
BLAKE2b-256 514bbfb7708d41a7dc1cb7267ca68c81ddb8af15753b2c484c99a61a986099e1

See more details on using hashes here.

File details

Details for the file mem_llm-1.0.11-py3-none-any.whl.

File metadata

  • Download URL: mem_llm-1.0.11-py3-none-any.whl
  • Upload date:
  • Size: 35.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for mem_llm-1.0.11-py3-none-any.whl
Algorithm Hash digest
SHA256 c23d5243e3be0cf12b95e379b075aaab6ab76b75994d339d80ff83d8baad286f
MD5 c37132419aa08f38ed1ecff21c79dbf6
BLAKE2b-256 5cb29e6d72b9d1ff13a1bb26501e82e35ea3aa8330928fb779f69fa83f131a17

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page