Self-maintaining personal knowledge database — MCP server with figure-level search, auto-wikilinks, and Ebbinghaus-based memory compression

These details have not been verified by PyPI

Project links

Project description

second-brain MCP Server

A self-maintaining personal knowledge database — powered by MCP, DuckDB, and biological memory models.

For anyone who saves more papers, notes, and figures than they could ever re-read. second-brain turns everything you capture into a database that maintains itself — auto-linking related notes, compressing what you stop reading, and keeping every figure searchable by its content. What you saved a year ago is still one query away, at a fraction of the token cost.

Why Does This Exist?

Problem	Solution
📄 You save dozens of papers but can never find the right figure	`search_figures("UMAP melanocyte")` — returns the exact panel, across every paper you've saved
📑 arXiv gives you the abstract; you need the full paper	Auto-upgrades `/abs/` → `/html/` — fetches the complete paper with all sections, not just the abstract
🗂 Notes pile up; older ones never get cleaned up	Vault Sleep: low-access notes compress automatically every Sunday while you sleep (60–90% token reduction)
🔗 New notes stay isolated; you forget what's connected	Auto-wikilinks: every saved note is automatically linked to semantically related notes already in your vault
🔎 Semantic search needs a cloud API or Docker stack	Self-hosted `nomic-embed-text` via llama-server; BM25 fallback when offline
🔒 Every AI memory tool locks you into their format	Pure Markdown vault — sync with Google Drive, iCloud, or git; switch agents anytime
🖼 Figure context is lost when you read a paper	Every figure is downloaded, OCR'd by Claude Vision, and stored in DuckDB — searchable by gene name, p-value, axis label

The One-Command Demo

save_article("https://arxiv.org/abs/2405.01234")
  ↓
• /abs/ auto-upgraded to /html/ — full paper, not just abstract
• Full text converted to Markdown
• All figures downloaded + OCR'd by Claude Vision
• Semantic embeddings computed
• Auto-linked to related notes already in your vault   ← auto-wikilinks
• Stored in 30-resources/ — queryable immediately

search_figures("UMAP cluster batch correction")
  ↓
• Returns the exact figure from the exact paper
• Works across your entire saved literature library

What Makes It Different

flowchart LR
    subgraph input["📥 Any Content Source"]
        A1["arXiv / PubMed paper"]
        A2["Web article / blog"]
        A3["Local PDF / DOCX"]
        A4["Personal note"]
    end

    subgraph core["⚙️ second-brain-mcp"]
        B1["Markdown note<br/>30-resources/"]
        B2["Figure OCR<br/>+ VLM description"]
        B3["Semantic embedding<br/>+ auto-wikilinks"]
        B4["Ebbinghaus score<br/>ranking"]
        B5["PNG snapshots<br/>60–90% token reduction"]
    end

    subgraph query["🔍 Queryable Knowledge"]
        C1["search_figures<br/>'UMAP melanocyte'"]
        C2["search_notes<br/>'batch correction scRNA'"]
        C3["get_context<br/>top-20 relevant notes"]
    end

    input --> core
    B1 --> B2
    B1 --> B3
    B3 --> B4
    B4 --> B5
    B2 --> C1
    B3 --> C2
    B4 --> C3

Eight things most self-hosted memory tools can't do — combined in one:

Most memory tools…	second-brain
Save a link or PDF, then leave you to read and tag it	🔬 One command builds the database — `save_article` fetches any URL/PDF, converts to Markdown, downloads & OCRs every figure with Claude Vision, then semantic-indexes it
Store the arXiv abstract you pasted	📑 Full text, not abstracts — `/abs/` URLs auto-upgrade to `/html/` for the complete paper: methods, results, discussion
Leave new notes isolated until you tag them	🔗 The knowledge graph builds itself — every note is auto-linked to semantically related notes already in your vault
Cost the same whether a note is read daily or never	🧠 Memory that forgets like a brain — Ebbinghaus score ranks by recency × frequency; stale notes compress while you sleep
Search documents, not what's inside the figures	🖼 Figure-level search across your whole library — `search_figures("p < 0.001")` returns the exact panel from the exact paper
Forget your project decisions between sessions	📋 The AI learns your rules — hot notes auto-extract constraints into `memory/rules.md`, injected at every session start
Grow more expensive as the vault grows	📉 Token cost shrinks with age — PNG snapshots replace old text at 60–90% compression; frequently-read papers stay full-fidelity
Lock you into their database format	🔓 Zero lock-in — pure Markdown, any MCP agent, sync via any cloud drive or git

Cross-Session Continuity — Pick Up Where You Left Off

Every project you work on can be resumed in a new session with full context — no re-explaining, no lost progress.

flowchart LR
    A["🟢 Session Start<br/>get_context()"] --> B["AI receives:<br/>• goals.md — current priorities<br/>• Top-20 recent notes<br/>• Extracted rules"]
    B --> C["Work on project<br/>new_note / search / read"]
    C --> D["🔴 Before ending session<br/>update_goals(...)"]
    D --> E["New session<br/>get_context() again"]
    E --> B

How It Works in Practice

End of session — tell the agent to save state:

Update goals: currently working on the scRNA batch correction pipeline.
Completed: harmony integration. Blocked on: choosing n_components for PCA.
Next session: start from the PCA parameter sweep in 20-areas/research/harmony-notes.md

The agent calls update_goals() and optionally new_note("project", ...) for detailed progress.

Start of next session — just say:

Get context and continue where we left off.

The agent calls get_context() and immediately sees:

goals.md with the state you saved
The harmony-notes.md surfaced at the top (recently accessed, high Ebbinghaus score)
Rules auto-extracted from that note, e.g.:

RULE: use n_components=30 for this dataset — tested 20/30/50, 30 minimises batch effect without losing resolution
RULE: exclude sample CRC_04 — library size outlier confirmed by QC

These rules live in memory/rules.md and are injected at every get_context() call — the AI carries your hard-won decisions forward automatically, without you having to repeat them.

What Gets Persisted

What	Where	Always in context?
Current priorities / blocked items	`memory/goals.md`	✅ every session
Project progress notes	`10-projects/` or `20-areas/`	✅ if recently accessed
Decisions and rationale	`decisions/`	via `get_decisions()`
Extracted rules from notes	`memory/rules.md`	✅ every session
Saved papers and figures	`30-resources/`	via `search_notes/figures`

This works across any project — bioinformatics analysis, coding, writing, research. Save state with one sentence at the end of a session; resume instantly at the start of the next.

Example Queries

# Resume a project from last session
get_context()  # → goals + recent notes + rules loaded automatically

# Find a specific figure panel across all saved papers
search_figures("p < 0.001 UMAP cluster")

# Semantic search across all notes
search_notes("single cell integration batch correction")

# Decision records for a specific project
get_decisions("MyProject")

Memory Architecture — Biological Analogy

Biological Brain	This System
Hippocampal consolidation during sleep	Vault Sleep: weekly LLM-compression of old low-access notes
Ebbinghaus forgetting curve	Score-based ranking: `access_count / ln(age_days)`
Visual long-term memory	PNG snapshots — resolution degrades gracefully with age
Associative recall	Semantic search + auto-generated `[[wikilinks]]`
Sleep-dependent consolidation	launchd cron, runs Sunday 02:00 while you sleep

Token Efficiency

Memory that gets cheaper over time — unlike flat-file systems where old notes cost the same forever.

Note age →   fresh (0–3 mo)   3–6 months     6–12 months    1 year+
             ──────────────   ──────────     ───────────    ───────
token cost:  ██████████████   ██████         ████           ██
             ~1,000 tokens    ~400 tokens    ~256 tokens    ~100 tokens
                              ▼ 60%          ▼ 74%          ▼ 90%

Tier assigned by score × age (adaptive). Frequently-accessed notes stay full-text regardless of age.

Search Performance

Measured on Apple Silicon MacBook (20-rep average, BM25-only mode).

Vault    BM25-only p50          Hybrid BM25+semantic p50
──────   ─────────────────      ────────────────────────
10 n     ████░░░░░   21 ms      ████████████   37 ms
50 n     ██████░░░   25 ms      █████████████  39 ms
100 n    ███████░░   27 ms      ██████████████ 45 ms

Vault Size	BM25 p50	Hybrid p50	Recall@1	Recall@5	MRR
10 notes	21 ms	37 ms	30%	60%	0.42
50 notes	25 ms	39 ms	70%	90%	0.78
100 notes	27 ms	45 ms	70%	80%	0.73

Hybrid mode adds ~18 ms for embedding lookup. Both modes scale sub-linearly with vault size.

Recall figures at this scale (10–100 notes) carry high sample variance — a single ambiguous query shifts Recall@1 by 10%. Treat them as directional, not as benchmarks against large corpora; the takeaway is that hybrid consistently beats BM25-only on relevance for a fixed query set.

System Architecture

┌─────────────────────────────────────────────────────┐
│                    AI Agent Layer                    │
│         Claude Code · Gemini CLI · Any MCP           │
└──────────────────────┬──────────────────────────────┘
                       │ MCP Protocol (19 tools)
┌──────────────────────▼──────────────────────────────┐
│               Layer 2 — MCP Server                   │
│                    server.py                         │
│   get_context · search_notes · save_article · …      │
└──────┬───────────────┬────────────────┬─────────────┘
       │               │                │
┌──────▼──────┐ ┌──────▼──────┐ ┌──────▼──────┐
│  vault_sleep│ │  vault_db   │ │  figures    │
│  compress   │ │  DuckDB FTS │ │  PNG snap   │
│  Phase 3–9  │ │  + semantic │ │  OCR · VLM  │
└──────┬──────┘ └──────┬──────┘ └─────────────┘
       │               │
┌──────▼───────────────▼──────────────────────────────┐
│               Layer 0 — Markdown Vault               │
│   00-inbox · 10-projects · 20-areas · 30-resources   │
│   40-archive · decisions · memory · templates        │
│         (syncs via Google Drive / iCloud / git)      │
└─────────────────────────────────────────────────────┘

Vault Sleep — Auto-compression Flow

Every Sunday 02:00 (launchd, no interaction needed)
        │
        ▼
 sync_index + embeddings
        │
        ▼  age > 90d AND Ebbinghaus score ≤ 0.5
 ┌──────────────────────────────────────┐
 │         Adaptive Tier Selection      │
 │  score > 1.5  →  text  (keep full)  │  ← frequently-read: never compressed
 │  score > 0.8  →  large  ~400 tokens │
 │  score > 0.3  →  base   ~256 tokens │
 │  otherwise    →  small  ~100 tokens │
 └────────────────┬─────────────────────┘
                  │
  Gemini CLI → Claude CLI → naive   (auto-fallback, no LLM required)
                  │
    compressed → vault  /  original → 40-archive/  /  snapshot → .png

MCP Tools (19 total)

Tool	Description
`get_context`	Session start: goals + top-20 Ebbinghaus-ranked notes + auto-rules
`save_article`	Fetch URL/PDF → Markdown + auto-extract figures
`search_notes`	Hybrid BM25 + semantic search across all notes
`search_figures`	Search figure OCR text / VLM descriptions
`extract_figures_for`	Manually trigger figure extraction for a saved article
`read_note`	Read note + record access (updates Ebbinghaus score)
`read_note_as_image`	Return PNG snapshot for token-efficient reading
`new_note`	Create note with correct template and folder by type
`get_decisions`	List ADR decision records, optionally filtered by project
`update_goals`	Update `memory/goals.md`
`sync_index`	Rebuild DuckDB index from vault files
`index_stats`	Show note counts by type
`vault_sleep`	Compress old low-activity notes (dry_run=True by default)
`sleep_status`	Show compression candidates without acting
`snapshot_note_tool`	Render note to PNG at chosen resolution tier
`extract_rules_tool`	Extract L3 rules from frequently-accessed notes
`consolidate_tool`	Merge semantically similar notes into one abstract note
`update_links_tool`	Refresh auto-generated `[[wikilinks]]`
`prune_archive_tool`	Delete archived originals that have a PNG snapshot

Test Results

tests/test_figures.py      19 passed   (OCR, snapshots, VLM)
tests/test_server.py       13 passed   (MCP tools, path safety)
tests/test_vault_db.py     39 passed   (FTS, semantic search, embeddings)
tests/test_vault_sleep.py  44 passed   (compression, consolidation, rules, prune)
────────────────────────────────────────
115 passed in 3.37s

Installation

Prerequisites

Dependency	Required	Notes
Python 3.11+	✅
uv	✅	Package manager
Playwright	✅	PNG snapshot rendering
llama-server	Optional	Semantic search; BM25 fallback if absent
nomic-embed-text-v1.5.Q8_0.gguf	Optional	~300 MB embedding model
Gemini CLI or `ANTHROPIC_API_KEY`	Optional	Better compression quality; naive fallback if absent

Quick Start

Step 1 — Clone and install

git clone https://github.com/ddmanyes/second-brain-mcp
cd second-brain-mcp
uv sync
uv run playwright install chromium

Step 2 — Create your vault

mkdir -p ~/second-brain/{00-inbox,10-projects,20-areas,30-resources,40-archive,decisions,memory,templates}

Step 3 — Configure MCP

cp mcp_config.example.json mcp_config.json
# Edit mcp_config.json — set SECOND_BRAIN_PATH to your vault path

Step 4 — Register with your AI agent

Option A: Claude Code (CLI)

claude mcp add --scope user second-brain \
  --env SECOND_BRAIN_PATH=/path/to/your/vault \
  -- uv run --project /path/to/second-brain-mcp python server.py

Option B: Claude Desktop — add to ~/Library/Application Support/Claude/claude_desktop_config.json:

{
  "mcpServers": {
    "second-brain": {
      "command": "uv",
      "args": ["run", "--project", "/path/to/second-brain-mcp", "python", "server.py"],
      "env": { "SECOND_BRAIN_PATH": "/path/to/your/vault" }
    }
  }
}

Step 5 — Index your vault

In Claude Code or Claude Desktop, tell the agent:

Run sync_index to build the initial index.

Environment Variables

Variable	Default	Description
`SECOND_BRAIN_PATH`	`~/second-brain`	Path to your vault directory
`EMBED_URL`	`http://localhost:11435/v1/embeddings`	Embedding server endpoint
`EMBED_MODEL`	`nomic-embed-text`	Embedding model name
`EMBED_PORT`	`11435`	llama-server port

Auto-start (macOS, optional)

# Embedding server — always on, restarts on crash
cp examples/launchd/com.yourname.llama-embed.plist ~/Library/LaunchAgents/
# Edit paths inside the file, then:
launchctl load ~/Library/LaunchAgents/com.yourname.llama-embed.plist

# Weekly vault maintenance — every Sunday 02:00
cp examples/launchd/com.yourname.vault-sleep.plist ~/Library/LaunchAgents/
launchctl load ~/Library/LaunchAgents/com.yourname.vault-sleep.plist

Troubleshooting

Symptom	Likely cause	Fix
Semantic search silently falls back to BM25	llama-server not running on `EMBED_PORT`	Start the embedding server (see Auto-start); verify with `curl localhost:11435/v1/embeddings`
`read_note_as_image` / snapshots fail	Playwright chromium not installed	`uv run playwright install chromium`
`vault_sleep` never compresses anything	No Gemini CLI / `ANTHROPIC_API_KEY` → naive fallback, or no eligible notes	Install Gemini CLI or export `ANTHROPIC_API_KEY`; remember only notes >90 days old with Ebbinghaus score ≤ 0.5 are candidates (`sleep_status` shows them)
Agent sees no notes / empty results	Index not built	Run `sync_index` once after install (and after bulk file changes)
Notes land in the wrong place	`SECOND_BRAIN_PATH` unset or wrong	Set it in your MCP config `env` block; defaults to `~/second-brain`

Vault Structure

vault/
├── 00-inbox/          # Unprocessed captures — clear daily
├── 10-projects/       # Active projects
├── 20-areas/
│   ├── research/      # Ongoing research domains
│   ├── coding/        # Dev tools and workflows
│   └── consolidated/  # Auto-merged similar notes (Phase 8)
├── 30-resources/      # ← Papers and articles (save_article writes here)
├── 40-archive/        # Compressed originals (auto-managed by vault_sleep)
├── decisions/         # Architecture Decision Records (ADR format)
├── memory/
│   ├── goals.md       # Current priorities — injected at every session start
│   ├── index.md       # Vault map
│   └── rules.md       # Auto-extracted L3 rules — injected at every session start
└── templates/         # Note templates (note, decision, project, research)

Running Tests

uv run pytest tests/ -v
uv run python benchmark.py --quick --markdown   # search latency + accuracy report

References & Acknowledgements

Papers That Directly Inspired This Project

Paper	Where Used
Do Language Models Need Sleep? Offline Recurrence for Improved Online Inference (2026)	Phase 3 Vault Sleep — hippocampal replay as batch memory consolidation
Experience Compression Spectrum: Unifying Memory, Skills, and Rules in LLM Agents (2026)	Phase 9 adaptive tier — score × age dual-axis; addresses the "missing diagonal" in existing systems
DeepSeek-OCR: Contexts Optical Compression (2025)	Phase 4 PNG tiers — image as compressed medium, 10× compression at 97% fidelity
MemOCR: Layout-Aware Visual Memory for Efficient Long-Horizon Reasoning (2026)	Phase 4 vision API — Playwright render → VLM reading pipeline
Active Context Compression: Autonomous Memory Management in LLM Agents (2026)	Phase 3 design comparison — session-level vs. nightly batch consolidation
SimpleMem: Efficient Lifelong Memory for LLM Agents (2026)	Phase 8 consolidation — 3-stage semantic compression, 30× token reduction
Memory for Autonomous LLM Agents: Mechanisms, Evaluation, and Emerging Frontiers (2026)	Architecture positioning — mechanisms, evaluation, and frontiers

Cognitive Science Foundations

Ebbinghaus, H. (1885). Über das Gedächtnis. — forgetting curve; basis for access_count / ln(age_days + 1)
Stickgold, R. (2005). Nature, 437, 1272–1278. — sleep-dependent memory consolidation

Built With

MarkItDown · DuckDB · llama.cpp · nomic-embed-text · FastMCP · Playwright · Anthropic Claude API

Contributing

PRs and Issues welcome. Please open an issue first to discuss significant changes.

License

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.1.0

May 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mcp_second_brain-0.1.0.tar.gz (173.1 kB view details)

Uploaded May 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

mcp_second_brain-0.1.0-py3-none-any.whl (201.0 kB view details)

Uploaded May 29, 2026 Python 3

File details

Details for the file mcp_second_brain-0.1.0.tar.gz.

File metadata

Download URL: mcp_second_brain-0.1.0.tar.gz
Upload date: May 29, 2026
Size: 173.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.0 {"installer":{"name":"uv","version":"0.10.0","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for mcp_second_brain-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`1599f776d96d3e3a7f8b09921436c5133db5e317de77e3fd611dd18816e5bd57`
MD5	`6a3e3341402c0ea522861544b0978484`
BLAKE2b-256	`5975ce7df754cd1798a16b1bc42c80c94104a7d0b36b63a429176bf2e3a141d7`

See more details on using hashes here.

File details

Details for the file mcp_second_brain-0.1.0-py3-none-any.whl.

File metadata

Download URL: mcp_second_brain-0.1.0-py3-none-any.whl
Upload date: May 29, 2026
Size: 201.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.10.0 {"installer":{"name":"uv","version":"0.10.0","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for mcp_second_brain-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f8140e0b308b1f15650d7f93588eac33a30f3025e17b8c08f9a631a9ffb17241`
MD5	`d43cadef666a8a03349b5f88f565aa00`
BLAKE2b-256	`5c0e93b4a2d604da33b05aa0589c629a40facb466e4b2c4221698604e7c79fbf`

See more details on using hashes here.

mcp-second-brain 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

second-brain MCP Server

Why Does This Exist?

The One-Command Demo

What Makes It Different

Cross-Session Continuity — Pick Up Where You Left Off

How It Works in Practice

What Gets Persisted

Example Queries

Memory Architecture — Biological Analogy

Token Efficiency

Search Performance

System Architecture

Vault Sleep — Auto-compression Flow

MCP Tools (19 total)

Test Results

Installation

Prerequisites

Quick Start

Step 1 — Clone and install

Step 2 — Create your vault

Step 3 — Configure MCP

Step 4 — Register with your AI agent

Step 5 — Index your vault

Environment Variables

Auto-start (macOS, optional)

Troubleshooting

Vault Structure

Running Tests

References & Acknowledgements

Papers That Directly Inspired This Project

Cognitive Science Foundations

Built With

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes