Persistent memory MCP server for AI development workflows — hybrid search, knowledge graph, session lifecycle.
Project description
RAWThink
Your AI assistant forgets everything between sessions. RAWThink fixes that.
Persistent memory for Claude Code — hybrid search, knowledge graph, session lifecycle. Think of it as a second brain that grows with every conversation.
The Problem
Claude Code is stateless. Every session starts from zero. You explain the same context, re-establish the same decisions, re-discover the same insights. Yesterday's breakthrough is today's cold start.
RAWThink gives your AI a memory that persists, decays naturally, and revises itself when you change your mind. It's a 15-tool MCP server that turns Claude Code into a long-term thinking partner.
Quick Start
Prerequisites
- Python 3.10+
- Docker (for Qdrant vector database)
- Ollama (for local embeddings)
- Node.js (for the SessionStart hook)
Install & set up
pip install rawthink-mcp
rawthink-install # creates vault, configures MCP, installs hook
docker run -d --name rawthink-qdrant -p 6333:6333 -v rawthink_qdrant:/qdrant/storage qdrant/qdrant
ollama pull bge-m3 # download embedding model
If you cloned the repo, you can use
docker compose up -dinstead.
Restart Claude Code. Done.
rawthink-install handles everything: vault directory, ~/.claude.json MCP config, SessionStart hook, and starter files (CLAUDE.md, THINKING_DIRECTIVES.md). Run rawthink-install --vault ~/my-vault to customize the vault location.
Manual setup (without rawthink-install)
Add to your ~/.claude.json:
{
"mcpServers": {
"rawthink": {
"command": "rawthink-mcp",
"env": {
"RAWTHINK_VAULT": "/path/to/your/vault"
}
}
}
}
Create vault structure:
mkdir -p vault/{sessions,qnotes,archive/raw,outputs}
Install from source instead of PyPI:
git clone https://github.com/ygtalp/rawthink-mcp.git
cd rawthink-mcp
pip install -e .
Your First Session
Your vault starts empty — that's the point. Open Claude Code and start talking:
You: "We're building a REST API. Let's use PostgreSQL with connection pooling."
Claude: [responds with technical discussion]
You: "/qnote we decided on PostgreSQL + pgBouncer for the API layer"
→ saved to your vault as a searchable quick note
You: "/rtclose"
→ Session exported, entities extracted, handoff written
Next session, Claude loads the handoff automatically and picks up where you left off. After a few sessions, your vault looks like this:
> search_thoughts("how did we handle the database?")
1. [2025-03-15_002] API Architecture Decisions score: 0.74
Decided on PostgreSQL + pgBouncer for connection pooling.
Key insight: pool_mode=transaction for serverless...
2. [2025-03-22_001] Performance Review score: 0.61
Revisited DB strategy — added read replicas for
reporting queries, write primary stays single-node...
> read_graph(summary=True)
Knowledge Graph: 42 entities, 38 relations
decision (5)
- postgres-connection-pooling (3 obs) act=0.97
- api-versioning-strategy (4 obs) act=0.85
- auth-jwt-vs-session (2 obs) act=0.72
concept (8)
- cqrs-pattern (3 obs) act=0.91
- event-sourcing (5 obs) act=0.88
...
What It Does
Three-layer memory
- Hot cache (MEMORY.md) — loaded every session, essential context
- Knowledge graph (JSONL) — entities, relations, temporal observations with belief revision
- Semantic search (Qdrant) — hybrid dense + BM25 across all sessions and qnotes
Session lifecycle
Session start (handoff loads) → Think together → /rtclose →
JSONL export + entity extraction + handoff for next session
Key capabilities
- Hybrid search: BGE-M3 dense embeddings + BM25 sparse vectors, fused with RRF
- Belief revision: observations can be invalidated, superseded, and tracked over time
- Activation decay: unused knowledge fades (~23-day half-life), accessed knowledge stays hot
- Epistemic typing: mark entities as
assertion,hypothesis, orspeculation - Multi-terminal safe: parallel sessions don't overwrite each other's handoffs
- Thinking directives: structured discipline for human-AI thinking partnerships
- Ollama fallback: embedding cache for when Ollama is temporarily unavailable
Architecture
┌─────────────────────────────────────────────────┐
│ Claude Code │
│ (MCP Client) │
└──────────────────┬──────────────────────────────┘
│ MCP (stdio)
┌──────────────────▼──────────────────────────────┐
│ rawthink │
│ (FastMCP Server) │
│ │
│ ┌─────────────┐ ┌──────────┐ ┌────────────┐ │
│ │ Indexer │ │ Graph │ │ Chunker │ │
│ │ (Qdrant + │ │ (JSONL │ │ (Markdown │ │
│ │ Ollama) │ │ KG) │ │ splitter) │ │
│ └──────┬──────┘ └────┬─────┘ └────────────┘ │
│ │ │ │
│ ┌──────▼──────┐ ┌────▼─────┐ │
│ │ Qdrant │ │ memory │ │
│ │ (Docker) │ │ .jsonl │ │
│ └─────────────┘ └──────────┘ │
└─────────────────────────────────────────────────┘
│
┌──────────▼──────────┐
│ Vault │
│ sessions/ qnotes/ │
│ archive/ outputs/ │
└─────────────────────┘
MCP Tools
Search (5 tools)
| Tool | Description |
|---|---|
search_thoughts |
Hybrid semantic + keyword search across vault |
get_session |
Retrieve full session content by ID |
get_related |
Quick related thoughts lookup (session-level) |
store_thought |
Store a new thought as a qnote |
reindex |
Re-index vault content into Qdrant |
Knowledge Graph (10 tools)
| Tool | Description |
|---|---|
search_nodes |
Search entities by name, type, and observations |
create_entities |
Create/merge entities with epistemic typing |
create_relations |
Create relations (controlled vocabulary) |
add_observations |
Add temporal observations to entities |
invalidate_observations |
Mark observations as invalidated (belief revision) |
delete_entities |
Delete entities and their relations |
delete_observations |
Remove specific observations |
delete_relations |
Remove specific relations |
read_graph |
Read full graph with pagination + summary mode |
open_nodes |
Open specific entities with their relations |
How It's Different
RAWThink is not a codebase analyzer or a simple key-value memory. It's a living knowledge system designed for thinking partnerships.
| RAWThink | Basic MCP Memory | Codebase Analyzers | |
|---|---|---|---|
| Search | Hybrid (dense + BM25 + RRF) | Key lookup | Graph query |
| Knowledge | Temporal KG with belief revision | Flat key-value | Static AST graph |
| Memory decay | Activation decay (~23 days) | None | None |
| Session lifecycle | Full (export, extract, handoff) | None | None |
| Epistemic tracking | assertion/hypothesis/speculation | None | extracted/inferred/ambiguous |
| Designed for | Thinking & decision memory | Simple facts | Code structure |
Why JSONL?
Human-readable, git-diffable, zero dependency. At 120 entities the graph loads in <50ms. At 10K entities ~1.4s. The optimization path is clear: mtime cache for repeated reads, lazy writes for activation updates, SQLite migration at 50K+ entities.
Configuration
All settings via environment variables or rawthink_mcp/config.py:
| Setting | Env Var | Default | Description |
|---|---|---|---|
| Vault path | RAWTHINK_VAULT |
../vault |
Path to vault directory |
| Qdrant URL | QDRANT_URL |
http://localhost:6333 |
Qdrant server URL |
| Qdrant path | QDRANT_PATH |
— | Set for embedded Qdrant (no Docker) |
| Ollama URL | OLLAMA_URL |
http://localhost:11434 |
Ollama server URL |
| Embedding model | OLLAMA_MODEL |
bge-m3 |
Ollama embedding model |
| KG file | MEMORY_FILE_PATH |
../vault/memory.jsonl |
Knowledge graph file path |
| Turkish normalization | RAWTHINK_TURKISH_NORMALIZATION |
false |
Set 1 or true to enable |
| Decay rate | — | 0.03 |
Activation decay (~23-day half-life) |
Session Lifecycle
RAWThink includes a session close command (/rtclose) that:
- Exports the current conversation to clean markdown + full transcript + HTML
- Extracts key entities and relations into the knowledge graph
- Writes a project-specific handoff file for the next session
- Updates MEMORY.md with essential context
A SessionStart hook (rawthink-handoff.js) runs when Claude Code starts and loads the most recent handoff + recent qnotes into context automatically.
Session lifecycle currently requires Claude Code. MCP-native session tools are planned for v0.2.
Customization
CLAUDE.md
Edit CLAUDE.md to customize the thinking companion's behavior:
- Role and tone
- Active modes (free-flow, socratic, debate, synthesis, deep-dive, technical, galaxy-brain)
- Epistemic transparency format
- Meta-commands
THINKING_DIRECTIVES.md
Structured discipline for the thinking partnership. Every rule was born from a violation — don't follow mechanically, understand why each exists. Customize to match your workflow.
Obsidian (optional)
Open your vault directory in Obsidian to browse sessions and qnotes visually. The vault uses standard markdown with YAML frontmatter — no plugins needed.
Entity Types
Default entity type is "concept". Common types: concept, decision, rule, insight, question, project.
Relation Types
Canonical types (non-standard accepted with warning):
supports, contradicts, evolved_into, depends_on, exemplifies, part_of, caused_by, enables, supersedes, related_to
Roadmap
- Interactive knowledge graph visualization
- Wiki export (agent-browsable markdown per entity)
.rawthinkignorefor vault indexing control- Graph traversal queries (unexpected connections, multi-hop)
- SQLite backend for 50K+ entities
- MCP-native session tools (no Claude Code dependency)
- Configurable language normalization
Article
Read the full story: Your AI Doesn't Remember You (coming soon)
License
MIT — see LICENSE.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file rawthink_mcp-0.1.0.tar.gz.
File metadata
- Download URL: rawthink_mcp-0.1.0.tar.gz
- Upload date:
- Size: 61.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fa965ec957111a4b66db9261bd1f43248746d46cb9df903d3922b2f07df43646
|
|
| MD5 |
25d29a890917926587e3b3d2c03cf0d2
|
|
| BLAKE2b-256 |
13a43d4de1ea0db1eb71acc1d6443df18501a7a2a2e6777b0840d88b21e6d942
|
File details
Details for the file rawthink_mcp-0.1.0-py3-none-any.whl.
File metadata
- Download URL: rawthink_mcp-0.1.0-py3-none-any.whl
- Upload date:
- Size: 37.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a2f1fe0b5a52aa6a4cf45510307b32ca95d1bd2ba0bb1560c53bf7d592190d6c
|
|
| MD5 |
c67433c9c47ba9befb8e1277e6f1a071
|
|
| BLAKE2b-256 |
f33786d8f542ee5b226ea41a163e0dbcb11ac1c3f8afdf3b336a58cdf25e74f6
|