Semantic recommendation engine for Obsidian vaults — embeddings + wiki-link graph boosting with MCP server for Claude Code integration.
Project description
vault-recommender
Semantic recommendation engine for Obsidian vaults. Uses sentence-transformer embeddings + wiki-link graph boosting to surface related notes, forgotten knowledge, and missing connections.
Designed as a tool for LLMs — returns context-rich results with explanations, not just ranked paths.
How it works
Your vault (markdown files)
│
Parser ─── extracts frontmatter, body, wiki-links
│
Indexer ─── embeds each note as a 384-dim vector (all-MiniLM-L6-v2)
│
Link Graph ─── builds bidirectional wiki-link adjacency
│
Recommender ─── cosine similarity + graph boost + staleness boost
│
Ranked results with reasons
Three scoring signals:
- Semantic similarity — cosine distance between note embeddings. Catches meaning, not just keywords.
- Link graph boost — notes connected through wiki-links get a bump. 2-hop neighbors (connected through a shared link) surface "bridge" connections.
- Staleness boost — notes untouched for 30+ days get a small boost. Surfaces forgotten-but-relevant knowledge.
Installation
git clone https://github.com/JoshuaOliphant/vault-recommender.git
cd vault-recommender
uv sync
Usage
CLI
# Build the index (run once, re-run when vault changes significantly)
uv run vault-recommender --vault /path/to/vault index
# Recommend by topic
uv run vault-recommender --vault /path/to/vault recommend --topic "career transition strategies"
# Recommend notes similar to a specific note
uv run vault-recommender --vault /path/to/vault recommend --note "areas/career/plan.md"
# Find missing connections (similar but not linked)
uv run vault-recommender --vault /path/to/vault recommend --note "areas/career/plan.md" --exclude-linked
# JSON output (for LLM consumption)
uv run vault-recommender --vault /path/to/vault recommend --topic "python testing" --json
MCP Server (Claude Code integration)
Add to your .mcp.json:
{
"mcpServers": {
"vault-recommender": {
"type": "stdio",
"command": "uv",
"args": [
"run",
"--directory",
"/path/to/vault-recommender",
"python",
"-m",
"vault_recommender.mcp_server"
],
"env": {
"VAULT_PATH": "/path/to/your/vault"
}
}
}
}
This exposes three tools:
recommend_by_topic— open-ended semantic searchrecommend_by_note— "notes like this one"find_missing_connections— similar but unlinked notes
Python API
from pathlib import Path
from vault_recommender.parser import parse_vault
from vault_recommender.indexer import build_index
from vault_recommender.graph import build_graph
from vault_recommender.recommender import VaultRecommender
vault = Path("/path/to/vault")
notes = parse_vault(vault)
index = build_index(notes)
graph = build_graph(notes)
rec = VaultRecommender(index=index, graph=graph, vault_path=vault)
# By topic
results = rec.similar_to_topic("career transition")
# By note
results = rec.similar_to_note("areas/career/plan.md")
# Each result has: path, title, score, snippet, tags, reason
for r in results:
print(f"{r.score:.3f} {r.title} — {r.reason}")
Performance
- ~1,500 notes indexed in ~5 seconds (M-series Mac)
- Queries return in <1 second (after model warm-up)
- Index persists as numpy + JSON (~2MB for 1,500 notes)
- Model:
all-MiniLM-L6-v2(~80MB, runs on CPU)
Requirements
- Python 3.12+
- An Obsidian vault (or any directory of markdown files with
[[wiki-links]])
License
MIT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vault_recommender-0.1.0.tar.gz.
File metadata
- Download URL: vault_recommender-0.1.0.tar.gz
- Upload date:
- Size: 107.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dc2eca37a2be9c4fb3d0dba97afba0617ba12d643e79987707be26b58cf2b273
|
|
| MD5 |
637623ec831022f7ed52278baf5e2a32
|
|
| BLAKE2b-256 |
9003238cf7fbf0cf0defa3f66279ad89ccb71863a5e33ce030f910b38e6d1c8d
|
Provenance
The following attestation bundles were made for vault_recommender-0.1.0.tar.gz:
Publisher:
publish.yml on JoshuaOliphant/vault-recommender
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
vault_recommender-0.1.0.tar.gz -
Subject digest:
dc2eca37a2be9c4fb3d0dba97afba0617ba12d643e79987707be26b58cf2b273 - Sigstore transparency entry: 1108046309
- Sigstore integration time:
-
Permalink:
JoshuaOliphant/vault-recommender@cf2d9eb47af19b5c9d4c8c55ed2117dd6f7a7bda -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/JoshuaOliphant
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@cf2d9eb47af19b5c9d4c8c55ed2117dd6f7a7bda -
Trigger Event:
release
-
Statement type:
File details
Details for the file vault_recommender-0.1.0-py3-none-any.whl.
File metadata
- Download URL: vault_recommender-0.1.0-py3-none-any.whl
- Upload date:
- Size: 13.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2d1230d7c1bec919efc632f9e63858e5d8ad253e8f4cd739a6de0f5fc83a3358
|
|
| MD5 |
0b440ab36d84bc8661b16d3cb59f9651
|
|
| BLAKE2b-256 |
0ebced8034e22b2198ac7c216ae0b8437ec8d800a3e67630700e0fcb45038fa4
|
Provenance
The following attestation bundles were made for vault_recommender-0.1.0-py3-none-any.whl:
Publisher:
publish.yml on JoshuaOliphant/vault-recommender
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
vault_recommender-0.1.0-py3-none-any.whl -
Subject digest:
2d1230d7c1bec919efc632f9e63858e5d8ad253e8f4cd739a6de0f5fc83a3358 - Sigstore transparency entry: 1108046310
- Sigstore integration time:
-
Permalink:
JoshuaOliphant/vault-recommender@cf2d9eb47af19b5c9d4c8c55ed2117dd6f7a7bda -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/JoshuaOliphant
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@cf2d9eb47af19b5c9d4c8c55ed2117dd6f7a7bda -
Trigger Event:
release
-
Statement type: