Personal memory MCP server — search your WhatsApp, Discord, and notes with hybrid AI
Project description
Memoreei
One stop memory shop for all your AIs
Memoreei is a local-first MCP server that gives Claude (and any MCP-compatible AI) persistent, searchable memory across your conversations, chats, and notes.
"What did I say about the API redesign last month?"
"Find everything from that Telegram thread about the deployment issue."
"What were my notes on the React migration?"
Claude can answer these. Without Memoreei, it can't.
Key Features
- Local-first — all data stays in a single SQLite file on your machine
- Multi-source — WhatsApp, Discord, Telegram, Slack, Matrix, iMessage, Signal, Gmail, and more
- MCP-native — 21 tools exposed via the Model Context Protocol, usable by any MCP client
- Hybrid search — BM25 keyword search + vector semantic search, fused with Reciprocal Rank Fusion
- No mandatory cloud — default embedding model runs fully offline via ONNX
Supported Sources
| Source | Type | Status | Tests |
|---|---|---|---|
WhatsApp (.txt export) |
File import | ✅ Stable | tested |
| Discord (bot API) | Live sync | ✅ Stable | tested |
| Discord Data Package (GDPR export) | File import | ✅ Stable | tested |
| Telegram (bot API) | Live sync | ✅ Stable | tested |
| Slack (Web API) | Live sync | ✅ Stable | needs testing |
| Matrix (Client-Server API) | Live sync | ✅ Stable | needs testing |
| Mastodon (REST API) | Live sync | ✅ Stable | needs testing |
| Gmail (IMAP) | Live sync | ✅ Stable | needs testing |
| Instagram DMs (GDPR export) | File import | ✅ Stable | tested |
| Facebook Messenger (GDPR export) | File import | ✅ Stable | tested |
| SMS Backup & Restore XML | File import | ✅ Stable | tested |
| Generic JSON / JSON-lines | File import | ✅ Stable | tested |
| Generic CSV / TSV | File import | ✅ Stable | tested |
| iMessage (macOS) | Live sync | 🧪 Beta | tested |
| Signal Desktop | Live sync | 🧪 Beta | tested |
Manual notes (add_memory) |
MCP tool | ✅ Stable | tested |
File import — one-time or repeated ingest from an exported file. Live sync — incremental sync via API, checkpoint-based (only fetches new messages).
Quick Start
git clone https://github.com/your-org/memoreei.git
cd memoreei
python -m venv .venv && source .venv/bin/activate
pip install -e .
memoreei setup # interactive — pick connectors, enter credentials
memoreei sync # pull messages from configured sources
memoreei serve # start the MCP server
Connect to Claude Code
Add to .mcp.json in your project (or ~/.claude/settings.json for global):
{
"mcpServers": {
"memoreei": {
"command": "/path/to/memoreei/.venv/bin/python",
"args": ["-m", "memoreei.server"],
"cwd": "/path/to/memoreei"
}
}
}
Connect to Claude Desktop
Add to ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or %APPDATA%\Claude\claude_desktop_config.json (Windows):
{
"mcpServers": {
"memoreei": {
"command": "/path/to/memoreei/.venv/bin/python",
"args": ["-m", "memoreei.server"]
}
}
}
MCP Tools
All 21 tools are available to any connected MCP client.
Search & Retrieval
search_memory
Hybrid keyword + semantic search across all ingested memories.
| Parameter | Type | Default | Description |
|---|---|---|---|
query |
string | required | Natural language search query |
limit |
int | 10 | Max results to return |
source |
string | — | Filter by source, e.g. whatsapp:friends, discord:1234567890 |
participant |
string | — | Filter by sender name (case-insensitive) |
after |
string | — | ISO date lower bound, e.g. 2026-01-01 |
before |
string | — | ISO date upper bound |
get_context
Fetch surrounding messages for a specific memory — essential for understanding the conversation around a result.
| Parameter | Type | Default | Description |
|---|---|---|---|
memory_id |
string | required | Memory ID (ULID) from search results |
before |
int | 5 | Messages to include before the target |
after |
int | 5 | Messages to include after the target |
add_memory
Manually store a note, fact, or anything worth remembering. Auto-embeds content immediately.
| Parameter | Type | Default | Description |
|---|---|---|---|
content |
string | required | Text to remember |
source |
string | "manual" |
Source label |
metadata |
dict | — | Optional key-value pairs |
list_sources
Inventory all ingested sources with message counts.
{
"whatsapp:friends": 1842,
"discord:1234567890": 391,
"telegram:-100987654321": 227,
"manual": 12
}
File Import Tools
ingest_whatsapp
Import a WhatsApp chat export .txt file. Handles multi-line messages, media placeholders, and deduplication on re-import.
| Parameter | Type | Description |
|---|---|---|
file_path |
string | Path to the WhatsApp .txt export file |
import_discord_package
Import a Discord GDPR data export (all channels and DMs). Accepts a ZIP file or extracted folder.
Request your data at: Discord Settings → Privacy & Safety → Request All of My Data
| Parameter | Type | Description |
|---|---|---|
package_path |
string | Path to extracted folder or ZIP file |
import_messenger
Import Facebook Messenger messages from a GDPR data download (JSON format).
Download at: Facebook Settings → Your Information → Download Your Information
| Parameter | Type | Description |
|---|---|---|
data_path |
string | Path to the extracted folder containing messages/inbox/ |
import_instagram
Import Instagram DMs from a GDPR data download (JSON format).
Download at: Instagram Settings → Accounts Center → Your Information → Download Your Information
| Parameter | Type | Description |
|---|---|---|
data_path |
string | Path to the extracted folder containing your_instagram_activity/ |
import_sms_backup
Import SMS/MMS messages from an Android SMS Backup & Restore XML file.
| Parameter | Type | Description |
|---|---|---|
file_path |
string | Path to the XML backup file |
import_json_file
Import messages from any JSON file. Supports JSON arrays, JSON-lines, and wrapped objects. Covers Google Chat takeout, Google Hangouts, LinkedIn data, and custom formats.
| Parameter | Type | Default | Description |
|---|---|---|---|
file_path |
string | required | Path to the JSON or JSON-lines file |
content_field |
string | required | Field name containing the message text |
sender_field |
string | — | Field name for sender name |
timestamp_field |
string | — | Field name for timestamp (auto-detects format) |
source_label |
string | "json-import" |
Tag for imported messages |
import_csv_file
Import messages from any CSV or TSV file. Auto-detects delimiter (comma, tab, semicolon). Covers LinkedIn exports and any custom spreadsheet format.
| Parameter | Type | Default | Description |
|---|---|---|---|
file_path |
string | required | Path to the CSV/TSV file |
content_column |
string | required | Column name for message text |
sender_column |
string | — | Column name for sender |
timestamp_column |
string | — | Column name for timestamp |
source_label |
string | "csv-import" |
Tag for imported messages |
Live Sync Tools
All sync tools use checkpoint-based incremental sync — only new messages are fetched on subsequent runs.
sync_discord
Sync messages from a Discord channel via the bot API.
| Parameter | Type | Default | Description |
|---|---|---|---|
channel_id |
string | DISCORD_CHANNEL_ID env var |
Discord channel ID |
sync_telegram
Sync messages received by a Telegram bot via getUpdates. Bot must be a member of the target group or have received DMs.
| Parameter | Type | Default | Description |
|---|---|---|---|
chat_id |
string | TELEGRAM_CHAT_ID env var |
Chat ID (positive = DM, negative = group). Syncs all if omitted. |
sync_matrix
Sync messages from a Matrix room using the Client-Server API.
| Parameter | Type | Default | Description |
|---|---|---|---|
room_id |
string | MATRIX_ROOM_ID env var |
Matrix room ID, e.g. !abc123:matrix.org |
sync_slack
Sync messages from a Slack channel via the Web API (conversations.history). Requires bot token with channels:history and users:read scopes.
| Parameter | Type | Default | Description |
|---|---|---|---|
channel_id |
string | SLACK_CHANNEL_ID env var |
Slack channel ID, e.g. C1234567890 |
sync_email
Sync Gmail messages via IMAP. Uses per-folder UID checkpointing.
| Parameter | Type | Default | Description |
|---|---|---|---|
folder |
string | "INBOX" |
IMAP folder, e.g. [Gmail]/Sent Mail |
max_emails |
int | 200 | Maximum emails per sync |
sync_mastodon
Sync Mastodon posts. Public and hashtag timelines require no authentication.
| Parameter | Type | Default | Description |
|---|---|---|---|
instance |
string | MASTODON_INSTANCE env var |
Instance URL, e.g. https://fosstodon.org |
hashtag |
string | MASTODON_HASHTAG env var |
Hashtag without #, or omit for public timeline |
access_token |
string | MASTODON_ACCESS_TOKEN env var |
OAuth token (optional, for home timeline) |
sync_imessage
🧪 Beta — macOS only. Requires Full Disk Access for Terminal in System Settings → Privacy & Security.
Sync iMessage/SMS conversations from ~/Library/Messages/chat.db (read-only).
| Parameter | Type | Description |
|---|---|---|
chat_name |
string | Optional — filter by contact name or identifier (e.g. +1234567890) |
sync_signal
🧪 Beta — requires
pysqlcipher3. Signal Desktop must be installed.
Sync Signal Desktop messages from the local encrypted SQLCipher database.
Default paths: ~/.config/Signal/sql/db.sqlite (Linux), ~/Library/Application Support/Signal/sql/db.sqlite (macOS).
| Parameter | Type | Description |
|---|---|---|
conversation_id |
string | Optional — filter by conversation ID, name, or phone number |
Utility Tools
refresh_memory
Trigger an immediate sync of all configured sources. Returns count of new messages.
sync_all
Sync every configured connector and return counts per source.
CLI Reference
# Interactive setup — configure connectors, writes to .env
memoreei setup # pick from a list (spacebar to select, enter to confirm)
memoreei setup gmail # configure a specific connector directly
# Start the MCP server (stdio transport, default)
memoreei serve
# Start with SSE transport (for HTTP clients)
memoreei serve --sse --port 8080
# Show DB stats: message counts, sources, last sync times
memoreei status
# Sync all configured sources
memoreei sync
# Sync a specific source
memoreei sync discord
memoreei sync telegram
memoreei sync matrix
memoreei sync slack
memoreei sync email
memoreei sync mastodon
# Search from the terminal
memoreei search "API redesign notes"
memoreei search "printer issue" --limit 5 --source whatsapp:friends
# Import files
memoreei import-whatsapp /path/to/WhatsApp\ Chat.txt
memoreei import-sms /path/to/sms-backup.xml
memoreei import-discord-package /path/to/discord-package.zip
# Show current configuration (tokens masked)
memoreei config
Architecture
┌──────────────────────────────────────────────────────────────────────┐
│ Your Data Sources │
│ │
│ File Imports Live Sync (API) │
│ ───────────────────────────── ────────────────────────── │
│ WhatsApp .txt Instagram JSON Discord Telegram │
│ Messenger JSON SMS Backup XML Slack Matrix │
│ Discord ZIP Generic JSON/CSV Gmail Mastodon │
│ iMessage Signal │
└──────────────┬───────────────────────────────┬─────────────────────┘
│ │
▼ ▼
┌──────────────────────────────────────────────────────────────────────┐
│ Memoreei MCP Server │
│ │
│ ┌────────────────────┐ ┌──────────────────┐ ┌─────────────────┐ │
│ │ Connectors │ │ Hybrid Search │ │ MCP Tools │ │
│ │ │ │ │ │ │ │
│ │ whatsapp.py │ │ FTS5 (BM25) │ │ search_memory │ │
│ │ discord_*.py │ │ + vector cosine │ │ get_context │ │
│ │ telegram_*.py │ │ + RRF fusion │ │ add_memory │ │
│ │ slack_*.py │ │ │ │ list_sources │ │
│ │ matrix_*.py │ └────────┬─────────┘ │ ingest_* │ │
│ │ email_*.py │ │ │ import_* │ │
│ │ mastodon_*.py │ │ │ sync_* │ │
│ │ imessage_*.py │ ┌────────▼──────────┐ │ refresh_memory │ │
│ │ signal_*.py │ │ SQLite Database │ │ sync_all │ │
│ │ generic_*.py │ │ memories + FTS5 │ └────────┬────────┘ │
│ └────────────────────┘ │ embeddings BLOB │ │ │
│ │ sync checkpoints │ │ │
│ └───────────────────┘ │ │
└────────────────────────────────────────────────────────┼────────────┘
│ stdio / JSON-RPC
▼
┌─────────────────────┐
│ MCP Clients │
│ │
│ Claude Code │
│ Claude Desktop │
│ Any MCP client │
└─────────────────────┘
Configuration
Copy .env.example to .env and fill in the credentials for the sources you want to use. Unused connectors can be left blank.
Core
| Variable | Default | Description |
|---|---|---|
EMBEDDING_PROVIDER |
fastembed |
fastembed (local ONNX, no API key) or openai |
OPENAI_API_KEY |
— | Required only if EMBEDDING_PROVIDER=openai |
MEMOREEI_DB_PATH |
./memoreei.db |
SQLite database path |
AUTO_SYNC |
false |
Enable background sync loop on server start |
AUTO_SYNC_INTERVAL |
3600 |
Background sync interval in seconds |
Discord
| Variable | Description |
|---|---|
DISCORD_BOT_TOKEN |
Bot token from Discord Developer Portal |
DISCORD_CHANNEL_ID |
Default channel ID for sync_discord |
Telegram
| Variable | Description |
|---|---|
TELEGRAM_BOT_TOKEN |
Bot token from @BotFather |
TELEGRAM_CHAT_ID |
Default chat ID (positive = DM, negative = group) |
Matrix
| Variable | Description |
|---|---|
MATRIX_HOMESERVER |
Homeserver URL, e.g. https://matrix.org |
MATRIX_ACCESS_TOKEN |
User access token |
MATRIX_ROOM_ID |
Default room ID, e.g. !abc123:matrix.org |
Slack
| Variable | Description |
|---|---|
SLACK_BOT_TOKEN |
Bot token (xoxb-...), requires channels:history + users:read |
SLACK_CHANNEL_ID |
Default channel ID, e.g. C1234567890 |
Gmail
| Variable | Description |
|---|---|
GMAIL_EMAIL |
Gmail address |
GMAIL_APP_PASSWORD |
App Password (required if 2FA enabled) |
Mastodon
| Variable | Default | Description |
|---|---|---|
MASTODON_INSTANCE |
https://mastodon.social |
Instance URL |
MASTODON_HASHTAG |
— | Default hashtag (without #) |
MASTODON_ACCESS_TOKEN |
— | OAuth token (optional, for home timeline) |
iMessage (macOS only)
| Variable | Default | Description |
|---|---|---|
IMESSAGE_DB_PATH |
~/Library/Messages/chat.db |
Override path to chat.db |
Signal Desktop
| Variable | Description |
|---|---|
SIGNAL_DB_PATH |
Override path to Signal's db.sqlite |
SIGNAL_CONFIG_PATH |
Override path to Signal's config.json |
How Hybrid Search Works
Memoreei runs two searches in parallel and fuses the results:
Query: "that weird API rate limit issue"
│
├──▶ FTS5 BM25 keyword search
│ Matches "API", "rate", "limit" — fast, exact
│ Returns ranked list of IDs
│
└──▶ Vector search (cosine similarity)
Matches "throttling", "429 errors", "backoff"
Returns ranked list of IDs
│
▼
Reciprocal Rank Fusion (RRF)
─────────────────────────────
score(item) = Σ 1 / (60 + rank_i)
i ∈ {keyword_rank, vector_rank}
Items in BOTH result sets are boosted.
Items in only one set still contribute.
Top N returned, then filtered by source/participant/date.
Why RRF? Rank-based fusion requires no score normalization across different scales. The constant k=60 is the standard default from the original paper and empirically outperforms weighted linear combinations.
Default embedding model: BAAI/bge-small-en-v1.5 via FastEmbed — 384-dimensional vectors, ~23 MB ONNX model, runs fully offline.
Privacy
Local-first by design.
- All data stored in a single SQLite file on your machine
- Default embedding model (FastEmbed) runs entirely offline via ONNX — zero network calls
- OpenAI embeddings are strictly opt-in (
EMBEDDING_PROVIDER=openai) - No telemetry, no analytics, no cloud sync
- Your messages never leave your machine in the default configuration
What requires network access:
- Live sync connectors (Discord, Telegram, Slack, Matrix, Gmail, Mastodon) make outbound API calls to those services
EMBEDDING_PROVIDER=openaisends message text to OpenAI's API for embedding
The .env file and memoreei.db are in .gitignore.
Project Structure
memoreei/
├── src/memoreei/
│ ├── server.py # MCP server, all 21 tool definitions
│ ├── cli.py # Typer CLI (memoreei command)
│ ├── config.py # Config loader (.env → dataclass)
│ ├── sync_manager.py # Background sync loop + per-source dispatch
│ ├── storage/
│ │ ├── database.py # SQLite + FTS5 virtual table + vector storage
│ │ └── models.py # MemoryItem dataclass
│ ├── search/
│ │ ├── embeddings.py # FastEmbed (local ONNX) + OpenAI providers
│ │ └── hybrid.py # RRF fusion of BM25 + vector results
│ ├── connectors/
│ │ ├── whatsapp.py # WhatsApp .txt export parser
│ │ ├── discord_connector.py # Discord REST API + checkpoint sync
│ │ ├── discord_package_connector.py # Discord GDPR export importer
│ │ ├── telegram_connector.py # Telegram Bot API + checkpoint sync
│ │ ├── slack_connector.py # Slack Web API + checkpoint sync
│ │ ├── matrix_connector.py # Matrix Client-Server API
│ │ ├── email_connector.py # Gmail IMAP + UID checkpoint
│ │ ├── mastodon_connector.py # Mastodon REST API
│ │ ├── imessage_connector.py # macOS Messages chat.db (read-only)
│ │ ├── signal_connector.py # Signal Desktop SQLCipher DB
│ │ ├── messenger_connector.py # Facebook Messenger GDPR export
│ │ ├── instagram_connector.py # Instagram DMs GDPR export
│ │ ├── sms_connector.py # SMS Backup & Restore XML
│ │ └── generic_connector.py # Generic JSON/CSV importer
│ └── tools/
│ └── memory_tools.py # MCP tool implementations
├── tests/ # Unit + integration tests
├── .env.example # Config template
└── pyproject.toml # Package metadata
Example Apps
These standalone apps are built on Memoreei and show what you can build with it:
- Movie Ring — Discover what movies your friends are talking about
- Contact Dossier — CRM for your real relationships
For simpler code examples (direct search, Discord bot), see the examples/ directory.
Contributing
See CONTRIBUTING.md for guidelines on adding new connectors, running tests, and submitting pull requests.
License
MIT
Copyright (c) 2026 Memoreei Contributors
Permission is hereby granted, free of charge, to any person obtaining a copy
of this software and associated documentation files (the "Software"), to deal
in the Software without restriction, including without limitation the rights
to use, copy, modify, merge, publish, distribute, sublicense, and/or sell
copies of the Software, and to permit persons to whom the Software is
furnished to do so, subject to the following conditions:
The above copyright notice and this permission notice shall be included in all
copies or substantial portions of the Software.
THE SOFTWARE IS PROVIDED "AS IS", WITHOUT WARRANTY OF ANY KIND, EXPRESS OR
IMPLIED, INCLUDING BUT NOT LIMITED TO THE WARRANTIES OF MERCHANTABILITY,
FITNESS FOR A PARTICULAR PURPOSE AND NONINFRINGEMENT. IN NO EVENT SHALL THE
AUTHORS OR COPYRIGHT HOLDERS BE LIABLE FOR ANY CLAIM, DAMAGES OR OTHER
LIABILITY, WHETHER IN AN ACTION OF CONTRACT, TORT OR OTHERWISE, ARISING FROM,
OUT OF OR IN CONNECTION WITH THE SOFTWARE OR THE USE OR OTHER DEALINGS IN THE
SOFTWARE.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file memoreei-0.2.0.tar.gz.
File metadata
- Download URL: memoreei-0.2.0.tar.gz
- Upload date:
- Size: 102.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4958330abc07efe719ab7eac6c0943cbaf99b0176965e7d290bdfedd53fc0a50
|
|
| MD5 |
21c6a40f4c333ef54504de4642435ae9
|
|
| BLAKE2b-256 |
3f8cf04f99f265cf426214a3970b006d2abcaf29e59383c7a464ddfc276516f7
|
File details
Details for the file memoreei-0.2.0-py3-none-any.whl.
File metadata
- Download URL: memoreei-0.2.0-py3-none-any.whl
- Upload date:
- Size: 68.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c9060b204e3e717a5d1f05d637ff04b5e33edae8ddf73705a2e1ad9fed3a4d79
|
|
| MD5 |
79f40d6e915130532a178c52badb547a
|
|
| BLAKE2b-256 |
6806766340e51b496d04cbd19b7ca0adab24e9e2e7b2f23893aef068c30c5070
|