MCP Server that brings long-term memory to AI coding tools via EverMemOS

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

tom-a1i

Project description

evermemos-mcp

English | 简体中文

Universal long-term memory layer for AI coding assistants, powered by EverMemOS.

Built for the Memory Genesis Competition 2026 — Track 2: Platform Plugins

evermemos-mcp is an MCP (Model Context Protocol) server that gives any compatible AI client — Claude Code, Cursor, Cline, Cherry Studio, and more — persistent, cross-session memory. It bridges the gap between stateless AI conversations and the contextual awareness that real-world workflows demand.

Why This Exists

AI coding assistants forget everything between sessions. You explain your architecture, your preferences, your project context — and next session, it's all gone. evermemos-mcp solves this by providing a Memory → Reasoning → Action loop:

Remember — Store decisions, preferences, and context as you work
Recall — Retrieve relevant memories using hybrid search (keyword + vector + semantic)
Brief — Get a full context restoration at the start of any new session

All memories are organized into isolated spaces (e.g. coding:my-app, study:ml-notes, chat:daily), so different projects and workflows never bleed into each other.

Demo

Final demo video will be added after the last recording pass.

Current submission-ready evidence:

Primary benchmark summary: artifacts/competition/2026-02-26-formal-real-auto-all-v3/benchmark_summary.json
Human-readable benchmark report: artifacts/competition/2026-02-26-formal-real-auto-all-v3/benchmark_report.md
Evidence release (runs.jsonl): competition-evidence-2026-02-26
Latest lifecycle appendix: artifacts/competition/2026-03-07-lifecycle-appendix-dec0612e/appendix_notes.md (remember/searchable/isolation pass; forget remains a current Cloud limitation)

Features

Tool	Description
`list_spaces`	Discover available memory spaces
`remember`	Store information into long-term memory (async extraction)
`request_status`	Check whether a queued write is still pending or already processed
`recall`	Search memories with 6 retrieval strategies (`keyword`, `hybrid`, `vector`, `rrf`, `agentic`, `auto`)
`briefing`	Get a structured context briefing: profile + episodes + facts + foresights
`forget`	Attempt targeted memory deletion by ID (Cloud behavior may vary)
`fetch_history`	Paginate through memory timeline by type

Key Capabilities

Space isolation — space_id (<domain>:<slug>) keeps memories separated by project or topic
Multi-space search — Query up to 10 spaces in a single recall call with automatic source attribution
Traceable citations — Every result includes memory_type, snippet, timestamp, score, and optional source_message_id
Multi-user support — Optional user_id filtering for shared spaces
Conversation metadata sync — Automatic conversation-meta integration with EverMemOS Cloud
Async-friendly identity fallback — Chat identity/preferences are best-effort mirrored into conversation-meta and can be surfaced as explicit fallback results when extracted search results are unavailable
Robust error handling — Retry with backoff (429 / 5xx), GET body fallback for proxy/WAF compatibility, and structured error codes

Quick Start

Get your API key from EverMemOS Cloud.

Option A: Install from PyPI (recommended)

No clone needed — just add to your MCP client config:

{
  "mcpServers": {
    "evermemos-mcp": {
      "type": "stdio",
      "command": "uvx",
      "args": ["evermemos-mcp@latest"],
      "env": {
        "EVERMEMOS_API_KEY": "your-key-here",
        "EVERMEMOS_USER_ID": "mcp-user"
      }
    }
  }
}

Or run directly from the command line:

uvx evermemos-mcp@latest

If your client still launches an older cached build after a release, refresh it with:

uv cache clean evermemos-mcp

Option B: Install from source

git clone https://github.com/tt-a1i/evermemos-mcp.git
cd evermemos-mcp
cp .env.example .env
# Edit .env and set your EVERMEMOS_API_KEY
uv run evermemos-mcp

MCP client config for source installs:

{
  "mcpServers": {
    "evermemos-mcp": {
      "type": "stdio",
      "command": "uv",
      "args": [
        "run",
        "--directory",
        "/absolute/path/to/evermemos-mcp",
        "evermemos-mcp"
      ],
      "env": {
        "EVERMEMOS_API_KEY": "your-key-here",
        "EVERMEMOS_USER_ID": "mcp-user"
      }
    }
  }
}

Client-specific setup guides (Claude Code, Cursor, Cline, Cherry Studio) are in docs/05-client-integrations.md.

Architecture

MCP Client (Claude Code / Cursor / Cline / Cherry Studio)
        │
        │  MCP stdio
        ▼
┌─────────────────────────────┐
│     evermemos-mcp server    │
│  ┌───────────────────────┐  │
│  │   7 Tool Handlers     │  │
│  └──────────┬────────────┘  │
│  ┌──────────▼────────────┐  │
│  │   Memory Service      │  │  remember / request_status / recall / briefing / forget / fetch_history
│  └──────────┬────────────┘  │
│  ┌──────────▼────────────┐  │
│  │ Space Catalog Service │  │  space registry, metadata sync, cross-session recovery
│  └──────────┬────────────┘  │
│  ┌──────────▼────────────┐  │
│  │  EverMemOS HTTP Client│  │  auth, retries, rate-limit backoff, error normalization
│  └──────────┬────────────┘  │
└─────────────┼───────────────┘
              │  HTTPS
              ▼
       EverMemOS Cloud API

Cloud-first — All memories live in EverMemOS Cloud. No local persistence, no state to lose.
Process-local cache — Space catalog is cached in-memory for fast lookups, recovered from Cloud on startup.
Async extraction — remember queues content for AI-powered extraction. Memories become searchable after processing.

Use Cases

Coding: Persistent Architecture Context

You: remember that we chose PostgreSQL over MongoDB because our data is highly relational
     [space_id: coding:my-saas]

-- next day, new session --

You: what database did we choose and why?
     → recall finds: "Chose PostgreSQL over MongoDB — highly relational data model"

Study: Cross-Session Learning Notes

You: remember: bias-variance tradeoff — high bias = underfitting, high variance = overfitting
     [space_id: study:ml-notes]

-- later --

You: briefing for study:ml-notes
     → Returns: profile (technical skills), recent episodes, key facts, foresights

Chat: Personal Preferences

You: remember I prefer dark mode, vim keybindings, and concise responses
     [space_id: chat:preferences]

-- any future session --

You: recall my UI preferences
     → "Prefers dark mode, vim keybindings, concise responses"

Configuration

Variable	Default	Description
`EVERMEMOS_API_KEY`	(required)	EverMemOS Cloud API key
`EVERMEMOS_USER_ID`	`mcp-user`	Default user identity
`EVERMEMOS_BASE_URL`	`https://api.evermind.ai`	API endpoint
`EVERMEMOS_API_VERSION`	`v0`	API version
`EVERMEMOS_ENABLE_CONVERSATION_META`	`true`	Sync conversation metadata
`EVERMEMOS_DEFAULT_TIMEZONE`	`UTC`	Timezone for metadata
`EVERMEMOS_DEFAULT_SPACE`	(auto)	Default space_id. If unset, auto-detected from git remote as `coding:<repo-name>`
`EVERMEMOS_LLM_CUSTOM_SETTING_JSON`	—	Custom LLM extraction settings
`EVERMEMOS_USER_DETAILS_JSON`	—	User profile details for conversations

Space Auto-Detection

When space_id is omitted from remember or recall, the server automatically infers a default from:

EVERMEMOS_DEFAULT_SPACE environment variable (if set)
Git remote origin URL → coding:<repo-name> (e.g. coding:my-saas)

This means inside a git project, you can simply call remember without specifying a space — memories are automatically routed to the right place.

`flush` Boundary Rules

flush controls when EverMemOS triggers memory extraction:

Scenario	`flush`
Mid-conversation, more messages coming	`false`
End of session / topic switch / summary	`true`
Uncertain	`true` (safer default)

Development

uv sync --group dev       # Install dev dependencies
uv run ruff check         # Lint
uv run pytest             # Unit tests
EVERMEMOS_RUN_INTEGRATION_TESTS=true uv run pytest -m integration  # Integration tests

CI runs on every push and PR via .github/workflows/ci.yml.

Documentation

Document	Description
`docs/01-requirements.md`	Product requirements
`docs/02-architecture.md`	Technical architecture
`docs/03-demo-playbook.md`	Demo walkthrough
`docs/04-submission.md`	Submission checklist
`docs/05-client-integrations.md`	Client setup guides
`docs/06-benchmark.md`	Benchmark protocol and acceptance gates
`docs/07-release-checklist.md`	Release readiness checklist
`docs/competition/benchmark_deep_dive.md`	Primary evidence deep dive
`docs/auto-memory-prompt.md`	Auto-memory prompt templates for CLAUDE.md / Cursor / Cline
`CHANGELOG.md`	Version history

License

See LICENSE.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

tom-a1i

Release history Release notifications | RSS feed

0.5.6

Mar 14, 2026

0.5.5

Mar 14, 2026

0.5.4

Mar 14, 2026

0.5.3

Mar 14, 2026

0.5.2

Mar 14, 2026

0.5.1

Mar 12, 2026

0.5.0

Mar 12, 2026

0.4.8

Mar 10, 2026

0.4.7

Mar 8, 2026

0.4.6

Mar 8, 2026

This version

0.4.5

Mar 7, 2026

0.4.4

Mar 7, 2026

0.4.3

Mar 7, 2026

0.4.2

Mar 6, 2026

0.4.1

Mar 6, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

evermemos_mcp-0.4.5.tar.gz (174.9 kB view details)

Uploaded Mar 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

evermemos_mcp-0.4.5-py3-none-any.whl (42.4 kB view details)

Uploaded Mar 7, 2026 Python 3

File details

Details for the file evermemos_mcp-0.4.5.tar.gz.

File metadata

Download URL: evermemos_mcp-0.4.5.tar.gz
Upload date: Mar 7, 2026
Size: 174.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for evermemos_mcp-0.4.5.tar.gz
Algorithm	Hash digest
SHA256	`e0e4b89833402f4b2469670ea6d00f29d6a412723259e6759fe24dfe16ba4dcb`
MD5	`65fa3ffc6daeb21d469ac15024910658`
BLAKE2b-256	`ed8d3356300f030d95103e97552756b904f5f8ec7efab8c131211aa16c710d2c`

See more details on using hashes here.

File details

Details for the file evermemos_mcp-0.4.5-py3-none-any.whl.

File metadata

Download URL: evermemos_mcp-0.4.5-py3-none-any.whl
Upload date: Mar 7, 2026
Size: 42.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for evermemos_mcp-0.4.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6d0741997cbb805994d990ff89c10b53ade43384ed309b56d3ddc3049248ad1d`
MD5	`b547020137c59164d46337f6957a542a`
BLAKE2b-256	`7df10633c9ef241e59fe23061d2f60a5515ddbafba0db98ca62d689edba2c828`

See more details on using hashes here.

evermemos-mcp 0.4.5

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

evermemos-mcp

Why This Exists

Demo

Features

Key Capabilities

Quick Start

Option A: Install from PyPI (recommended)

Option B: Install from source

Architecture

Use Cases

Coding: Persistent Architecture Context

Study: Cross-Session Learning Notes

Chat: Personal Preferences

Configuration

Space Auto-Detection

flush Boundary Rules

Development

Documentation

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

`flush` Boundary Rules