Persistent memory MCP server for AI development workflows — hybrid search, knowledge graph, session lifecycle.

These details have not been verified by PyPI

Project links

Project description

RAWThink

Your AI assistant forgets everything between sessions. RAWThink fixes that.

Persistent memory for Claude Code — hybrid search, knowledge graph, session lifecycle. Think of it as a second brain that grows with every conversation.

The Problem

Claude Code is stateless. Every session starts from zero. You explain the same context, re-establish the same decisions, re-discover the same insights. Yesterday's breakthrough is today's cold start.

RAWThink gives your AI a memory that persists, decays naturally, and revises itself when you change your mind. It's a 15-tool MCP server that turns Claude Code into a long-term thinking partner.

Quick Start

Prerequisites

Python 3.10+
Docker (for Qdrant vector database)
Ollama (for local embeddings)
Node.js (for the SessionStart hook)

Install & set up

pip install rawthink-mcp
rawthink-install              # creates vault, configures MCP, installs hook
docker run -d --name rawthink-qdrant -p 6333:6333 -v rawthink_qdrant:/qdrant/storage qdrant/qdrant
ollama pull bge-m3            # download embedding model

If you cloned the repo, you can use docker compose up -d instead.

Restart Claude Code. Done.

rawthink-install handles everything: vault directory, ~/.claude.json MCP config, SessionStart hook, and starter files (CLAUDE.md, THINKING_DIRECTIVES.md). Run rawthink-install --vault ~/my-vault to customize the vault location.

Manual setup (without rawthink-install)

Add to your ~/.claude.json:

{
  "mcpServers": {
    "rawthink": {
      "command": "rawthink-mcp",
      "env": {
        "RAWTHINK_VAULT": "/path/to/your/vault"
      }
    }
  }
}

Create vault structure:

mkdir -p vault/{sessions,qnotes,archive/raw,outputs}

Install from source instead of PyPI:

git clone https://github.com/ygtalp/rawthink-mcp.git
cd rawthink-mcp
pip install -e .

Your First Session

Your vault starts empty — that's the point. Open Claude Code and start talking:

You:   "We're building a REST API. Let's use PostgreSQL with connection pooling."
Claude: [responds with technical discussion]

You:   "/qnote we decided on PostgreSQL + pgBouncer for the API layer"
       → saved to your vault as a searchable quick note

You:   "/rtclose"
       → Session exported, entities extracted, handoff written

Next session, Claude loads the handoff automatically and picks up where you left off. After a few sessions, your vault looks like this:

> search_thoughts("how did we handle the database?")

1. [2025-03-15_002] API Architecture Decisions        score: 0.74
   Decided on PostgreSQL + pgBouncer for connection pooling.
   Key insight: pool_mode=transaction for serverless...

2. [2025-03-22_001] Performance Review                 score: 0.61
   Revisited DB strategy — added read replicas for
   reporting queries, write primary stays single-node...

> read_graph(summary=True)

Knowledge Graph: 42 entities, 38 relations

  decision (5)
  - postgres-connection-pooling (3 obs) act=0.97
  - api-versioning-strategy    (4 obs) act=0.85
  - auth-jwt-vs-session        (2 obs) act=0.72
  concept (8)
  - cqrs-pattern               (3 obs) act=0.91
  - event-sourcing             (5 obs) act=0.88
  ...

What It Does

Three-layer memory

Hot cache (MEMORY.md) — loaded every session, essential context
Knowledge graph (JSONL) — entities, relations, temporal observations with belief revision
Semantic search (Qdrant) — hybrid dense + BM25 across all sessions and qnotes

Session lifecycle

Session start (handoff loads) → Think together → /rtclose →
JSONL export + entity extraction + handoff for next session

Key capabilities

Hybrid search: BGE-M3 dense embeddings + BM25 sparse vectors, fused with RRF
Belief revision: observations can be invalidated, superseded, and tracked over time
Activation decay: unused knowledge fades (~23-day half-life), accessed knowledge stays hot
Epistemic typing: mark entities as assertion, hypothesis, or speculation
Multi-terminal safe: parallel sessions don't overwrite each other's handoffs
Thinking directives: structured discipline for human-AI thinking partnerships
Ollama fallback: embedding cache for when Ollama is temporarily unavailable

Architecture

┌─────────────────────────────────────────────────┐
│                  Claude Code                    │
│              (MCP Client)                       │
└──────────────────┬──────────────────────────────┘
                   │ MCP (stdio)
┌──────────────────▼──────────────────────────────┐
│                rawthink                         │
│           (FastMCP Server)                      │
│                                                 │
│  ┌─────────────┐  ┌──────────┐  ┌────────────┐  │
│  │   Indexer   │  │  Graph   │  │  Chunker   │  │
│  │ (Qdrant +   │  │ (JSONL   │  │ (Markdown  │  │
│  │  Ollama)    │  │  KG)     │  │  splitter) │  │
│  └──────┬──────┘  └────┬─────┘  └────────────┘  │
│         │              │                        │
│  ┌──────▼──────┐  ┌────▼─────┐                  │
│  │   Qdrant    │  │ memory   │                  │
│  │  (Docker)   │  │ .jsonl   │                  │
│  └─────────────┘  └──────────┘                  │
└─────────────────────────────────────────────────┘
                   │
        ┌──────────▼──────────┐
        │       Vault         │
        │  sessions/ qnotes/  │
        │  archive/  outputs/ │
        └─────────────────────┘

MCP Tools

Search (5 tools)

Tool	Description
`search_thoughts`	Hybrid semantic + keyword search across vault
`get_session`	Retrieve full session content by ID
`get_related`	Quick related thoughts lookup (session-level)
`store_thought`	Store a new thought as a qnote
`reindex`	Re-index vault content into Qdrant

Knowledge Graph (10 tools)

Tool	Description
`search_nodes`	Search entities by name, type, and observations
`create_entities`	Create/merge entities with epistemic typing
`create_relations`	Create relations (controlled vocabulary)
`add_observations`	Add temporal observations to entities
`invalidate_observations`	Mark observations as invalidated (belief revision)
`delete_entities`	Delete entities and their relations
`delete_observations`	Remove specific observations
`delete_relations`	Remove specific relations
`read_graph`	Read full graph with pagination + summary mode
`open_nodes`	Open specific entities with their relations

How It's Different

RAWThink is not a codebase analyzer or a simple key-value memory. It's a living knowledge system designed for thinking partnerships.

	RAWThink	Basic MCP Memory	Codebase Analyzers
Search	Hybrid (dense + BM25 + RRF)	Key lookup	Graph query
Knowledge	Temporal KG with belief revision	Flat key-value	Static AST graph
Memory decay	Activation decay (~23 days)	None	None
Session lifecycle	Full (export, extract, handoff)	None	None
Epistemic tracking	assertion/hypothesis/speculation	None	extracted/inferred/ambiguous
Designed for	Thinking & decision memory	Simple facts	Code structure

Why JSONL?

Human-readable, git-diffable, zero dependency. At 120 entities the graph loads in <50ms. At 10K entities ~1.4s. The optimization path is clear: mtime cache for repeated reads, lazy writes for activation updates, SQLite migration at 50K+ entities.

Configuration

All settings via environment variables or rawthink_mcp/config.py:

Setting	Env Var	Default	Description
Vault path	`RAWTHINK_VAULT`	`../vault`	Path to vault directory
Qdrant URL	`QDRANT_URL`	`http://localhost:6333`	Qdrant server URL
Qdrant path	`QDRANT_PATH`	—	Set for embedded Qdrant (no Docker)
Ollama URL	`OLLAMA_URL`	`http://localhost:11434`	Ollama server URL
Embedding model	`OLLAMA_MODEL`	`bge-m3`	Ollama embedding model
KG file	`MEMORY_FILE_PATH`	`../vault/memory.jsonl`	Knowledge graph file path
Turkish normalization	`RAWTHINK_TURKISH_NORMALIZATION`	`false`	Set `1` or `true` to enable
Decay rate	—	`0.03`	Activation decay (~23-day half-life)

Session Lifecycle

RAWThink includes a session close command (/rtclose) that:

Exports the current conversation to clean markdown + full transcript + HTML
Extracts key entities and relations into the knowledge graph
Writes a project-specific handoff file for the next session
Updates MEMORY.md with essential context

A SessionStart hook (rawthink-handoff.js) runs when Claude Code starts and loads the most recent handoff + recent qnotes into context automatically.

Session lifecycle currently requires Claude Code. MCP-native session tools are planned for v0.2.

Customization

CLAUDE.md

Edit CLAUDE.md to customize the thinking companion's behavior:

Role and tone
Active modes (free-flow, socratic, debate, synthesis, deep-dive, technical, galaxy-brain)
Epistemic transparency format
Meta-commands

THINKING_DIRECTIVES.md

Structured discipline for the thinking partnership. Every rule was born from a violation — don't follow mechanically, understand why each exists. Customize to match your workflow.

Obsidian (optional)

Open your vault directory in Obsidian to browse sessions and qnotes visually. The vault uses standard markdown with YAML frontmatter — no plugins needed.

Entity Types

Default entity type is "concept". Common types: concept, decision, rule, insight, question, project.

Relation Types

Canonical types (non-standard accepted with warning): supports, contradicts, evolved_into, depends_on, exemplifies, part_of, caused_by, enables, supersedes, related_to

Roadmap

Interactive knowledge graph visualization
Wiki export (agent-browsable markdown per entity)
.rawthinkignore for vault indexing control
Graph traversal queries (unexpected connections, multi-hop)
SQLite backend for 50K+ entities
MCP-native session tools (no Claude Code dependency)
Configurable language normalization

Article

Read the full story: Your AI Doesn't Remember You (coming soon)

License

MIT — see LICENSE.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.4

Apr 20, 2026

0.1.3

Apr 20, 2026

0.1.2

Apr 14, 2026

0.1.1

Apr 14, 2026

This version

0.1.0

Apr 14, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rawthink_mcp-0.1.0.tar.gz (61.3 kB view details)

Uploaded Apr 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rawthink_mcp-0.1.0-py3-none-any.whl (37.7 kB view details)

Uploaded Apr 14, 2026 Python 3

File details

Details for the file rawthink_mcp-0.1.0.tar.gz.

File metadata

Download URL: rawthink_mcp-0.1.0.tar.gz
Upload date: Apr 14, 2026
Size: 61.3 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for rawthink_mcp-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`fa965ec957111a4b66db9261bd1f43248746d46cb9df903d3922b2f07df43646`
MD5	`25d29a890917926587e3b3d2c03cf0d2`
BLAKE2b-256	`13a43d4de1ea0db1eb71acc1d6443df18501a7a2a2e6777b0840d88b21e6d942`

See more details on using hashes here.

File details

Details for the file rawthink_mcp-0.1.0-py3-none-any.whl.

File metadata

Download URL: rawthink_mcp-0.1.0-py3-none-any.whl
Upload date: Apr 14, 2026
Size: 37.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.10

File hashes

Hashes for rawthink_mcp-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a2f1fe0b5a52aa6a4cf45510307b32ca95d1bd2ba0bb1560c53bf7d592190d6c`
MD5	`c67433c9c47ba9befb8e1277e6f1a071`
BLAKE2b-256	`f33786d8f542ee5b226ea41a163e0dbcb11ac1c3f8afdf3b336a58cdf25e74f6`

See more details on using hashes here.

rawthink-mcp 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

RAWThink

The Problem

Quick Start

Prerequisites

Install & set up

Your First Session

What It Does

Three-layer memory

Session lifecycle

Key capabilities

Architecture

MCP Tools

Search (5 tools)

Knowledge Graph (10 tools)

How It's Different

Why JSONL?

Configuration

Session Lifecycle

Customization

CLAUDE.md

THINKING_DIRECTIVES.md

Obsidian (optional)

Entity Types

Relation Types

Roadmap

Article

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes