MCP Server that brings long-term memory to AI coding tools via EverMemOS
Project description
evermemos-mcp
Long-term memory for AI coding assistants. Remember once, recall forever.
You spent thirty minutes explaining your architecture, naming conventions, and why you dropped MongoDB. Next session — gone. You explain it all over again.
evermemos-mcp fixes this. One remember call stores it. One briefing call brings it back — across any session, any client.
Benchmark: 60/60 recall vs 0/60 baseline. Zero attribution errors. P95 < 2s. (evidence)
Quick Start
Get your API key from EverMemOS Cloud, then add to your MCP client config:
{
"mcpServers": {
"evermemos-mcp": {
"type": "stdio",
"command": "uvx",
"args": ["evermemos-mcp@latest"],
"env": {
"EVERMEMOS_API_KEY": "your-key-here"
}
}
}
}
Or run directly:
uvx evermemos-mcp@latest
Works with Claude Code, Cursor, Cline, Cherry Studio, and any MCP-compatible client. See docs/05-client-integrations.md for client-specific setup.
Install from source
git clone https://github.com/tt-a1i/evermemos-mcp.git
cd evermemos-mcp
cp .env.example .env # set EVERMEMOS_API_KEY
uv run evermemos-mcp
MCP client config for source installs:
{
"mcpServers": {
"evermemos-mcp": {
"type": "stdio",
"command": "uv",
"args": ["run", "--directory", "/path/to/evermemos-mcp", "evermemos-mcp"],
"env": { "EVERMEMOS_API_KEY": "your-key-here" }
}
}
}
What You Get
7 Tools
| Tool | What it does |
|---|---|
list_spaces |
Discover available memory spaces |
remember |
Store context into long-term memory. Auto-detects sensitive content (API keys, passwords) and checks for conflicting memories |
request_status |
Check if a queued write has been extracted |
recall |
Search memories with 6 retrieval strategies (keyword / hybrid / vector / RRF / agentic / auto) |
briefing |
One-call session-start context restore: profile + episodes + facts + foresights |
forget |
Targeted deletion with verification workflow |
fetch_history |
Paginate through memory timeline by type |
Key Capabilities
- Space isolation —
coding:my-app,chat:preferences,study:ml-notes— memories never bleed across projects - Multi-space search — Query up to 10 spaces in one
recallcall with automatic source attribution - Sensitive content guard — Blocks API keys, passwords, tokens, private keys before storing. Asks user to confirm
- Memory conflict detection — Auto-checks for similar memories in
chat:*spaces. Surfaces conflicts so the agent can decide - Lifecycle tracking — Every result labeled
queued,provisional,fallback, orsearchableacross all tools - Traceable citations —
memory_type,snippet,timestamp,score,source_message_idon every result - Git auto-detection — Omit
space_idand it inferscoding:<repo-name>from git remote - Robust error handling — Retry with backoff (429/5xx), GET body fallback for proxy/WAF, structured error codes
Use Cases
Persistent architecture context:
You: remember we chose PostgreSQL because our data is highly relational
[space_id: coding:my-saas]
-- next day, new session --
You: what database did we choose and why?
→ "Chose PostgreSQL — highly relational data model"
Personal preferences that stick:
You: remember I prefer dark mode, vim keybindings, and concise responses
[space_id: chat:preferences]
-- any future session --
You: recall my UI preferences
→ "dark mode, vim keybindings, concise responses"
Cross-session learning notes:
You: remember bias-variance tradeoff — high bias = underfitting, high variance = overfitting
[space_id: study:ml-notes]
-- later --
You: briefing for study:ml-notes
→ profile + recent episodes + key facts + foresights
Why evermemos-mcp
There are other memory MCP servers. Here's what makes this one different:
| evermemos-mcp | Mem0 MCP | Letta/MemGPT | Official MCP memory | |
|---|---|---|---|---|
| Space isolation | domain:slug per project/topic |
No | No | No |
| Lifecycle tracking | queued → provisional → fallback → searchable | No | No | No |
| Sensitive content guard | API keys, passwords, tokens blocked | No | No | No |
| Conflict detection | Auto for chat spaces | No | No | No |
| Multi-space search | Up to 10 spaces in one call | No | No | No |
| Retrieval strategies | 6 methods + auto merge | Semantic only | Semantic only | None |
| Benchmark verified | 60/60 recall, 0 errors | — | — | — |
| Setup | uvx evermemos-mcp |
Cloud or self-host | Self-host required | npx |
Benchmark
Tested on a fixed 60-query set across coding, chat, and study spaces.
| Metric | With memory | Without memory |
|---|---|---|
| Hit rate | 60/60 (100%) | 0/60 (0%) |
| Attribution errors | 0 | — |
| P95 latency | 1958 ms | — |
Evidence:
How It Works
MCP Client (Claude Code / Cursor / Cline / Cherry Studio)
│
│ MCP stdio
▼
┌─────────────────────────────┐
│ evermemos-mcp server │
│ ┌───────────────────────┐ │
│ │ 7 Tool Handlers │ │
│ └──────────┬────────────┘ │
│ ┌──────────▼────────────┐ │
│ │ Memory Service │ │ Content guard → Conflict check → Cloud write → Lifecycle tracking
│ └──────────┬────────────┘ │
│ ┌──────────▼────────────┐ │
│ │ Space Catalog Service │ │ Space registry, metadata sync, cross-session recovery
│ └──────────┬────────────┘ │
│ ┌──────────▼────────────┐ │
│ │ EverMemOS HTTP Client│ │ Auth, retries, rate-limit backoff, error normalization
│ └──────────┬────────────┘ │
└─────────────┼───────────────┘
│ HTTPS
▼
EverMemOS Cloud API
- Cloud-first — All memories live in EverMemOS Cloud. No local state to lose.
- Async extraction —
rememberqueues content for AI extraction. Userequest_statusto track progress. - Not a thin wrapper — 2500+ lines of orchestration: fallback hierarchies, multi-method search merging, identity mirroring, partial failure recovery.
Space Templates
| Template | Use it for |
|---|---|
chat:preferences |
Durable personal preferences, names, tone, UI likes |
chat:daily |
Ongoing chat context that shouldn't leak into projects |
coding:<repo> |
Architecture decisions, conventions, bugs, project context |
study:<topic> |
Learning notes, topic progress, revision context |
Which Tool When
| Goal | Tool | Why |
|---|---|---|
| Start a new session | briefing |
Fastest way to restore context in one call |
| Find a specific fact | recall |
Relevance-ranked search across spaces |
| Review what happened | fetch_history |
Chronological timeline > ranked search for audits |
| Verify before/after delete | fetch_history |
Stable timeline for pre/post-delete checks |
Configuration
| Variable | Default | Description |
|---|---|---|
EVERMEMOS_API_KEY |
(required) | EverMemOS Cloud API key |
EVERMEMOS_USER_ID |
mcp-user |
Default user identity |
EVERMEMOS_DEFAULT_SPACE |
(auto) | Default space. Auto-detected from git remote as coding:<repo> |
EVERMEMOS_BASE_URL |
https://api.evermind.ai |
API endpoint |
EVERMEMOS_DEFAULT_TIMEZONE |
UTC |
Timezone for metadata |
EVERMEMOS_ENABLE_CONVERSATION_META |
true |
Sync conversation metadata |
Advanced configuration
| Variable | Default | Description |
|---|---|---|
EVERMEMOS_API_VERSION |
v0 |
API version |
EVERMEMOS_LLM_CUSTOM_SETTING_JSON |
— | Custom LLM extraction settings |
EVERMEMOS_USER_DETAILS_JSON |
— | User profile details for conversations |
flush Rules
| Scenario | flush |
|---|---|
| Mid-conversation, more messages coming | false |
| End of session / topic switch / summary | true |
| Uncertain | true (safer) |
Advanced: Memory Lifecycle States
| State | Meaning |
|---|---|
queued |
Write accepted, extraction not yet confirmed |
provisional |
Answer from pending_messages while extraction is in progress |
fallback |
Answer from mirrored conversation-meta, not formal extracted memory |
searchable |
Answer from formal extracted memories |
All 7 tools expose compatible lifecycle blocks so agents always know memory maturity.
Advanced: Forget Safety
Cloud deletion is async and best-effort. evermemos-mcp provides a verification-first workflow:
- Confirm target
memory_idviafetch_historyorrecall - Call
forget(memory_ids=[...], space_id=...) - Verify with
fetch_history - If target persists, the lifecycle model surfaces this transparently
This is deliberate: expose real state to the agent rather than pretend deletion is instant.
Development
uv sync --group dev # Install dev dependencies
uv run ruff check # Lint
uv run pytest # Tests (285 pass)
Documentation
| Document | Description |
|---|---|
docs/02-architecture.md |
Technical architecture |
docs/05-client-integrations.md |
Client setup guides |
docs/auto-memory-prompt.md |
Auto-memory prompt templates |
docs/06-benchmark.md |
Benchmark protocol |
CHANGELOG.md |
Version history |
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file evermemos_mcp-0.5.6.tar.gz.
File metadata
- Download URL: evermemos_mcp-0.5.6.tar.gz
- Upload date:
- Size: 212.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b48847d16c10976ba4b4324c14f71f00f1ba78cce687e11e360f3cb233e1d532
|
|
| MD5 |
07865cf0bcda6da376453f3d5053ba25
|
|
| BLAKE2b-256 |
1510e4fbb393050ca1fd3870f2504558ba86b0cef45d44fed86e0c98d9240e38
|
Provenance
The following attestation bundles were made for evermemos_mcp-0.5.6.tar.gz:
Publisher:
publish.yml on tt-a1i/evermemos-mcp
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
evermemos_mcp-0.5.6.tar.gz -
Subject digest:
b48847d16c10976ba4b4324c14f71f00f1ba78cce687e11e360f3cb233e1d532 - Sigstore transparency entry: 1099923501
- Sigstore integration time:
-
Permalink:
tt-a1i/evermemos-mcp@92cdd9403a0a190b477d5eda3a3fc53599da006c -
Branch / Tag:
refs/tags/v0.5.6 - Owner: https://github.com/tt-a1i
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@92cdd9403a0a190b477d5eda3a3fc53599da006c -
Trigger Event:
release
-
Statement type:
File details
Details for the file evermemos_mcp-0.5.6-py3-none-any.whl.
File metadata
- Download URL: evermemos_mcp-0.5.6-py3-none-any.whl
- Upload date:
- Size: 49.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
6654b5c4cb662be0375fa6c70970f0078ec649701b19b06370ee5c877238e9df
|
|
| MD5 |
ccbe9f24fabb3f40d11ec73993c3f61c
|
|
| BLAKE2b-256 |
772e5ae6eb24d9699445d4370a9b561229e129943e90742f2949819298b84746
|
Provenance
The following attestation bundles were made for evermemos_mcp-0.5.6-py3-none-any.whl:
Publisher:
publish.yml on tt-a1i/evermemos-mcp
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
evermemos_mcp-0.5.6-py3-none-any.whl -
Subject digest:
6654b5c4cb662be0375fa6c70970f0078ec649701b19b06370ee5c877238e9df - Sigstore transparency entry: 1099923558
- Sigstore integration time:
-
Permalink:
tt-a1i/evermemos-mcp@92cdd9403a0a190b477d5eda3a3fc53599da006c -
Branch / Tag:
refs/tags/v0.5.6 - Owner: https://github.com/tt-a1i
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@92cdd9403a0a190b477d5eda3a3fc53599da006c -
Trigger Event:
release
-
Statement type: