Semantic recommendation engine for Obsidian vaults — embeddings + wiki-link graph boosting with MCP server for Claude Code integration.
Project description
vault-recommender
Semantic recommendation engine for Obsidian vaults. Uses sentence-transformer embeddings + wiki-link graph boosting to surface related notes, forgotten knowledge, and missing connections.
Designed as a tool for LLMs — returns context-rich results with explanations, not just ranked paths.
How it works
Your vault (markdown files)
│
Parser ─── extracts frontmatter, body, wiki-links
│
Indexer ─── embeds each note as a 384-dim vector (all-MiniLM-L6-v2)
│
Link Graph ─── builds bidirectional wiki-link adjacency
│
Recommender ─── cosine similarity + graph boost + staleness boost
│
Ranked results with reasons
Three scoring signals:
- Semantic similarity — cosine distance between note embeddings. Catches meaning, not just keywords.
- Link graph boost — notes connected through wiki-links get a bump. 2-hop neighbors (connected through a shared link) surface "bridge" connections.
- Staleness boost — notes untouched for 30+ days get a small boost. Surfaces forgotten-but-relevant knowledge.
Installation
# From PyPI
uv tool install vault-recommender
# Or from source
git clone https://github.com/JoshuaOliphant/vault-recommender.git
cd vault-recommender
uv sync
Usage
CLI
# Build the index (run once, re-run when vault changes significantly)
vault-recommender --vault /path/to/vault index
# Recommend by topic
vault-recommender --vault /path/to/vault recommend --topic "career transition strategies"
# Recommend notes similar to a specific note
vault-recommender --vault /path/to/vault recommend --note "areas/career/plan.md"
# Find missing connections (similar but not linked)
vault-recommender --vault /path/to/vault recommend --note "areas/career/plan.md" --exclude-linked
# Auto-rebuild stale index before querying
vault-recommender --vault /path/to/vault recommend --topic "python testing" --rebuild
# JSON output (for LLM consumption)
vault-recommender --vault /path/to/vault recommend --topic "python testing" --json
The --rebuild flag checks whether any vault file is newer than the index. If so, it rebuilds automatically before querying. If the index is fresh, it skips silently.
HTTP Server (for hooks and fast queries)
The CLI cold-starts the embedding model on topic queries (~13s). For latency-sensitive use cases like Claude Code hooks, run the HTTP server instead:
# Start the server (loads index once, then serves fast queries)
vault-recommender --vault /path/to/vault serve
# Custom host/port
vault-recommender --vault /path/to/vault serve --host 0.0.0.0 --port 8000
Endpoints:
# Health check
curl localhost:7532/health
# Recommend by topic
curl "localhost:7532/recommend?topic=career+transition&top_k=5"
# Recommend by note
curl "localhost:7532/recommend?note=areas/career/plan.md&top_k=3"
# Find missing connections
curl "localhost:7532/recommend?note=areas/career/plan.md&exclude_linked=true"
# Hot-reload index after re-indexing via CLI
curl -X POST localhost:7532/reload
MCP Server (Claude Code integration)
Add to your .mcp.json:
{
"mcpServers": {
"vault-recommender": {
"type": "stdio",
"command": "uv",
"args": [
"run",
"--directory",
"/path/to/vault-recommender",
"python",
"-m",
"vault_recommender.mcp_server"
],
"env": {
"VAULT_PATH": "/path/to/your/vault"
}
}
}
}
This exposes four tools:
recommend_by_topic— open-ended semantic searchrecommend_by_note— "notes like this one"find_missing_connections— similar but unlinked notesreload_index— force-reload the index after re-indexing via CLI
Python API
from pathlib import Path
from vault_recommender.recommender import create_recommender
vault = Path("/path/to/vault")
index_dir = Path(".vault-recommender-index")
rec = create_recommender(vault, index_dir)
# By topic
results = rec.similar_to_topic("career transition")
# By note
results = rec.similar_to_note("areas/career/plan.md")
# Each result has: path, title, score, snippet, tags, reason
for r in results:
print(f"{r.score:.3f} {r.title} — {r.reason}")
Performance
- ~1,500 notes indexed in ~5 seconds (M-series Mac)
- Queries return in <1 second (after model warm-up)
- Index persists as numpy + JSON (~2MB for 1,500 notes)
- Model:
all-MiniLM-L6-v2(~80MB, runs on CPU) --helpresponds instantly (heavy imports deferred until needed)
Requirements
- Python 3.12+
- An Obsidian vault (or any directory of markdown files with
[[wiki-links]])
License
MIT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vault_recommender-0.3.0.tar.gz.
File metadata
- Download URL: vault_recommender-0.3.0.tar.gz
- Upload date:
- Size: 111.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cd8dab833c62b98a875b6000131e1a6d6e1265e04e0133d50aed0ca2cf31532e
|
|
| MD5 |
b118fea299851d637561b5f0363d303e
|
|
| BLAKE2b-256 |
56e93270079ce39548a88b1034938b23c0b8bc4968a799d32e03eaf1db4c8c8b
|
Provenance
The following attestation bundles were made for vault_recommender-0.3.0.tar.gz:
Publisher:
publish.yml on JoshuaOliphant/vault-recommender
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
vault_recommender-0.3.0.tar.gz -
Subject digest:
cd8dab833c62b98a875b6000131e1a6d6e1265e04e0133d50aed0ca2cf31532e - Sigstore transparency entry: 1238475049
- Sigstore integration time:
-
Permalink:
JoshuaOliphant/vault-recommender@b26d99242353ebeb05cdbb8cd96f3e65bc4eaaab -
Branch / Tag:
refs/tags/v0.3.0 - Owner: https://github.com/JoshuaOliphant
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@b26d99242353ebeb05cdbb8cd96f3e65bc4eaaab -
Trigger Event:
release
-
Statement type:
File details
Details for the file vault_recommender-0.3.0-py3-none-any.whl.
File metadata
- Download URL: vault_recommender-0.3.0-py3-none-any.whl
- Upload date:
- Size: 17.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1c6a63fac1d1b441169b1577ccb3a6f8cedc12d1a5b6c55d1acf9f959539abc9
|
|
| MD5 |
15caf6ac81dba4fe0806c102367cb1a8
|
|
| BLAKE2b-256 |
fe39a5fcec6d0a7aba7b075a8a38adac5cbfa90612b186cc0939efe716b2ecb5
|
Provenance
The following attestation bundles were made for vault_recommender-0.3.0-py3-none-any.whl:
Publisher:
publish.yml on JoshuaOliphant/vault-recommender
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
vault_recommender-0.3.0-py3-none-any.whl -
Subject digest:
1c6a63fac1d1b441169b1577ccb3a6f8cedc12d1a5b6c55d1acf9f959539abc9 - Sigstore transparency entry: 1238475073
- Sigstore integration time:
-
Permalink:
JoshuaOliphant/vault-recommender@b26d99242353ebeb05cdbb8cd96f3e65bc4eaaab -
Branch / Tag:
refs/tags/v0.3.0 - Owner: https://github.com/JoshuaOliphant
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@b26d99242353ebeb05cdbb8cd96f3e65bc4eaaab -
Trigger Event:
release
-
Statement type: