Build, maintain, and search your knowledge vault. CLI + MCP server with stale note detection, semantic search, and neuroscience-grounded memory.
Project description
Long-term memory for AI agents. NeuroStack indexes your Markdown vault into a knowledge graph your AI can search -- structured facts, summaries, or full content, automatically tiered by query complexity. It persists agent memories, detects stale notes, and recovers context across sessions. Your vault files are never modified.
Works with Claude Code, Cursor, Windsurf, Codex, and Gemini CLI via MCP.
Get started
npm install -g neurostack
neurostack init
The wizard walks you through everything: cloud or local, lite or full, Ollama models, vault path, profession pack, and full indexing -- one command.
| Cloud | Local | |
|---|---|---|
| Best for | Zero friction, any machine | Privacy-first, offline, power users |
| Indexing | Gemini API (server-side) | Ollama on your machine |
| Search | Local SQLite | Local SQLite (same DB) |
| GPU required | No | Recommended for Full mode |
| Multi-device | Push once, pull anywhere | Manual DB sync |
| Cost | Free tier / Pro plans | Free (your hardware) |
Cloud
No GPU, no Ollama, no ML dependencies. Gemini indexes your vault server-side and returns a ready-to-use SQLite database. All search runs locally against that DB.
neurostack init # choose "Cloud" → vault setup → login → push
Free tier: 500 queries/month, 200 notes. Dashboard: app.neurostack.sh
Files are uploaded for indexing via HTTPS, processed by Gemini, and not retained after indexing completes.
Local
Run everything on your machine with Ollama. Choose a tier during neurostack init:
- Lite (~130 MB) -- FTS5 search, wiki-link graph, stale detection, MCP server. No GPU or Ollama required.
- Full (~560 MB) -- adds semantic search, AI summaries, knowledge graph triples, and attractor basin community detection via local Ollama. GPU or 6+ core CPU recommended.
Full mode automatically runs the complete indexing pipeline: embeddings, summaries, triples, and community detection.
Non-interactive mode:
neurostack init --mode full --pull-models ~/brain
neurostack init --cloud ~/brain
Alternative install methods
# PyPI
pipx install neurostack
pip install neurostack # inside a venv
uv tool install neurostack
# One-line script
curl -fsSL https://raw.githubusercontent.com/raphasouthall/neurostack/main/install.sh | bash
# Lite mode (no ML deps)
curl -fsSL https://raw.githubusercontent.com/raphasouthall/neurostack/main/install.sh | NEUROSTACK_MODE=lite bash
On Ubuntu 23.04+, Debian 12+, Fedora 38+, bare
pip installoutside a venv is blocked by PEP 668. Usenpm,pipx, oruv tool install.
To uninstall: neurostack uninstall
Connect to your AI
Claude Code (one command)
claude mcp add neurostack -- neurostack serve
Claude Desktop
Download the .mcpb bundle from Releases and double-click to install. Or auto-configure:
neurostack setup-desktop
Remote MCP (no local install)
Connect Claude to your vault via NeuroStack Cloud -- no Python, no Ollama, nothing to install locally:
claude mcp add neurostack --transport http https://mcp.neurostack.sh/mcp
Other MCP clients
Auto-configure Cursor, Windsurf, Gemini CLI, VS Code, or Codex:
neurostack setup-client cursor # or: windsurf, gemini, vscode, claude-code
neurostack setup-client --list # show all supported clients
Manual JSON config
Add to your client's MCP config file:
{
"mcpServers": {
"neurostack": {
"command": "neurostack",
"args": ["serve"],
"env": {}
}
}
}
After connecting, all 21 MCP tools are available. Search your vault, save memories, detect stale notes -- all from your AI chat.
Search
Retrieval is tiered. Most queries resolve at the cheapest tier:
| Tier | Tokens | What your AI gets | Example |
|---|---|---|---|
| Triples | ~15 | Structured facts: Alpha API -> uses -> PostgreSQL 16 |
Quick lookups, factual questions |
| Summaries | ~75 | AI-generated note summary | "What is this project about?" |
| Full content | ~300 | Actual Markdown content | Deep dives, editing context |
| Auto | Varies | Starts at triples, escalates only if coverage is low | Default for most queries |
Full mode adds hybrid semantic + keyword search with neuroscience-grounded ranking: energy landscape convergence, lateral inhibition, and prediction error feedback. Workspace scoping restricts queries to a vault subdirectory.
neurostack search "deployment checklist"
neurostack tiered "auth flow" --top-k 3
neurostack search -w "work/" "query" # workspace scoping
neurostack --json search "query" | jq # machine-readable output
Maintain
Stale note detection. When a note keeps appearing in search contexts where it doesn't belong, NeuroStack flags it as a prediction error. Old decisions, superseded specs, reversed conclusions -- without detection, your AI cites these confidently. Notes with unresolved prediction errors are automatically demoted in future search results.
Excitability decay. Recently accessed notes score higher in search results. Unused notes fade over time. Modeled on CREB-regulated neuronal excitability (Han et al. 2007).
Co-occurrence learning. Notes retrieved together frequently get their connection weights strengthened automatically. The search graph learns your actual workflow, not just your file structure.
Topic clusters. Attractor basin community detection groups notes into thematic clusters for broad "what do I know about X?" queries. Uses Hopfield-style dynamics with a blended similarity matrix (embeddings + co-occurrence + wiki-links). Included in Full mode -- no extra dependencies.
Lateral inhibition. Higher-ranked search results suppress semantically similar competitors, promoting diversity. Prevents five near-identical notes from dominating your results.
neurostack prediction-errors # stale note detection
neurostack decay # excitability report
neurostack communities build # rebuild topic clusters
neurostack watch # auto-index on vault changes
Agent memories
AI assistants can write typed memories back to NeuroStack: observation, decision, convention, learning, context, bug. Memories are stored in SQLite and surfaced automatically in vault_search results.
- Near-duplicate detection with merge support
- Optional TTL for ephemeral memories
- Tag suggestions on save
- Update in place or merge two memories with audit trail
neurostack memories add "postgres 16 requires --wal-level=replica" --type decision --tags "db,postgres"
neurostack memories search "postgres"
neurostack memories merge <target> <source>
neurostack memories prune --expired
Session harvest
Scans Claude Code JSONL session transcripts, extracts insights (observations, decisions, conventions, bugs), and deduplicates against existing memories before saving.
neurostack harvest --sessions 5 # extract from last 5 sessions
neurostack hooks install # install systemd timer for hourly harvest
neurostack sessions search "query" # search raw transcripts
Context recovery
Two modes for rebuilding working context after /clear or starting a new session:
vault_context-- task-anchored. Assembles relevant notes, memories, and triples for a specific task within a token budget.session_brief-- time-anchored. Compact briefing of recent activity, hot notes, and alerts.
neurostack context "migrate auth to OAuth2" --budget 2000
neurostack brief
Build
NeuroStack scaffolds new vaults or onboards existing Markdown directories. Six profession packs provide domain-specific templates, seed notes, and workflow guidance.
neurostack init # interactive setup, offers profession packs
neurostack onboard ~/my-notes # import existing notes with frontmatter generation
neurostack scaffold devops # apply a pack to an existing vault
neurostack scaffold --list # researcher, developer, writer, student, devops, data-scientist
~/your-vault/ # your Markdown files (never modified)
~/.config/neurostack/config.toml # configuration
~/.local/share/neurostack/
neurostack.db # SQLite + FTS5 knowledge graph
sessions.db # session transcript index
MCP tools (21 tools)
| Tool | Description |
|---|---|
vault_search |
Hybrid search with tiered depth (triples, summaries, full, auto) |
vault_ask |
RAG Q&A with inline citations |
vault_summary |
Pre-computed note summary |
vault_graph |
Wiki-link neighborhood with PageRank scores |
vault_related |
Semantically similar notes by embedding distance |
vault_triples |
Knowledge graph facts (subject-predicate-object) |
vault_communities |
GraphRAG queries across topic clusters |
vault_context |
Task-scoped context assembly within token budget |
session_brief |
Compact session briefing |
vault_stats |
Index health, excitability breakdown, memory stats |
vault_record_usage |
Track note hotness |
vault_prediction_errors |
Surface stale notes |
vault_remember |
Store a memory (returns duplicate warnings + tag suggestions) |
vault_update_memory |
Update a memory in place |
vault_merge |
Merge two memories (unions tags, audit trail) |
vault_forget |
Delete a memory |
vault_memories |
List or search memories |
vault_harvest |
Extract insights from session transcripts |
vault_capture |
Quick-capture to vault inbox |
vault_session_start |
Begin a memory session |
vault_session_end |
End session with optional summary and auto-harvest |
CLI reference
# Setup
neurostack init # one-command setup: deps, vault, index
neurostack init --mode full ~/brain # non-interactive full mode
neurostack init --cloud ~/brain # non-interactive cloud mode
neurostack onboard ~/my-notes # import existing Markdown notes
neurostack scaffold researcher # apply a profession pack
neurostack update # pull latest source + re-sync deps
neurostack uninstall # complete removal
# Search & retrieval
neurostack search "query" # hybrid search
neurostack ask "question" # RAG Q&A with citations
neurostack tiered "query" # tiered: triples -> summaries -> full
neurostack triples "query" # knowledge graph triples
neurostack summary "note.md" # AI-generated note summary
neurostack related "note.md" # semantically similar notes
neurostack graph "note.md" # wiki-link neighborhood
neurostack communities query "topic" # GraphRAG across topic clusters
neurostack context "task" --budget 2000 # task-scoped context recovery
neurostack brief # session briefing
# Maintenance
neurostack index # build/rebuild knowledge graph
neurostack watch # auto-index on vault changes
neurostack decay # excitability report
neurostack prediction-errors # stale note detection
neurostack backfill [summaries|triples|all] # fill gaps in AI data
neurostack communities build # rebuild topic clusters
neurostack reembed-chunks # re-embed all chunks
# Memories
neurostack memories add "text" --type observation # store (--ttl 7d)
neurostack memories search "query" # search memories
neurostack memories list # list all
neurostack memories update <id> --content "revised"
neurostack memories merge <target> <source>
neurostack memories forget <id> # remove
neurostack memories prune --expired # clean up
# Sessions
neurostack harvest --sessions 5 # extract session insights
neurostack sessions search "query" # search transcripts
neurostack hooks install # hourly harvest timer
# Cloud
neurostack cloud login # browser OAuth login
neurostack cloud status # auth + vault info
neurostack cloud push # upload + index vault
neurostack cloud pull # download indexed DB
neurostack cloud query "query" # search via cloud API
# Diagnostics
neurostack stats # index health
neurostack doctor # validate all subsystems
neurostack demo # interactive demo with sample vault
FAQ
Does it modify my vault files? No. All data lives in NeuroStack's own SQLite databases. Your Markdown files are strictly read-only.
Do I need a GPU? No. Use NeuroStack Cloud for zero-GPU setup. For local mode, Lite has zero ML dependencies. Full mode runs on CPU but summarization is slow without a GPU.
How large a vault can it handle? Tested with ~5,000 notes. FTS5 search stays fast at any size. Cloud indexing handles 500+ notes in minutes.
Can I use it without MCP? Yes. The CLI works standalone. Pipe output into any LLM.
Is my vault private? In local mode, nothing leaves your machine. In cloud mode, your Markdown files are uploaded for indexing via HTTPS, processed by Gemini, and the indexed DB is returned. Files are not stored after indexing completes.
Requirements
- Linux or macOS
- Cloud mode: just Node.js. No GPU, no Ollama, no Python ML deps.
- Local Full mode: Ollama with
nomic-embed-textand a summary model. GPU or 6+ core CPU recommended.
Neuroscience basis
Each feature is modeled on a specific mechanism from memory neuroscience:
| Feature | Mechanism | Citation |
|---|---|---|
| Stale detection + demotion | Prediction error signals trigger reconsolidation | Sinclair & Bhatt 2022 |
| Excitability decay | CREB-elevated neurons preferentially join new memories | Han et al. 2007 |
| Co-occurrence learning | Hebbian "fire together, wire together" plasticity | Hebb 1949 |
| Topic clusters | Hopfield attractor basin dynamics, inverse temperature | Ramsauer et al. 2020 |
| Convergence confidence | Energy landscape retrieval, basin width = robustness | Krotov & Hopfield 2016 |
| Lateral inhibition | PV+/SOM+ interneuron winner-take-all competition | Rashid et al. 2016 |
| Tiered retrieval | Complementary learning systems | McClelland et al. 1995 |
Full citations: docs/neuroscience-appendix.md
Get involved
- Website: neurostack.sh
- Contributing: CONTRIBUTING.md
- Contact: hello@neurostack.sh
- Sponsor: GitHub Sponsors | Buy me a coffee
License
Apache-2.0 -- see LICENSE. No GPL dependencies.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file neurostack-0.10.3.tar.gz.
File metadata
- Download URL: neurostack-0.10.3.tar.gz
- Upload date:
- Size: 291.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5194edc1f465147e6062531b708270ef6a08cdf9041b7c08c6b3946d891735f7
|
|
| MD5 |
f4a06edf4ade682bdde2e55ed7453173
|
|
| BLAKE2b-256 |
6c879fcaecaba0eb569cea85e998ba5c4ff818c0ba32ed8149618a4c13cf6330
|
Provenance
The following attestation bundles were made for neurostack-0.10.3.tar.gz:
Publisher:
publish.yml on raphasouthall/neurostack
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
neurostack-0.10.3.tar.gz -
Subject digest:
5194edc1f465147e6062531b708270ef6a08cdf9041b7c08c6b3946d891735f7 - Sigstore transparency entry: 1185578418
- Sigstore integration time:
-
Permalink:
raphasouthall/neurostack@24ab0223d23883e10c37804c65fe5f01a2fe3b97 -
Branch / Tag:
refs/tags/v0.10.3 - Owner: https://github.com/raphasouthall
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@24ab0223d23883e10c37804c65fe5f01a2fe3b97 -
Trigger Event:
push
-
Statement type:
File details
Details for the file neurostack-0.10.3-py3-none-any.whl.
File metadata
- Download URL: neurostack-0.10.3-py3-none-any.whl
- Upload date:
- Size: 165.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f5b452968b7c0b9b9ef7df6c1414f0b86b3b1ddcdfbba7b2c97f4772d6665ce1
|
|
| MD5 |
fb12c1a29c4ebe23e56393a9911ae6c2
|
|
| BLAKE2b-256 |
da82e025015ae6809779c744b3be1b7e6456477899963e362abbf32d7f07be59
|
Provenance
The following attestation bundles were made for neurostack-0.10.3-py3-none-any.whl:
Publisher:
publish.yml on raphasouthall/neurostack
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
neurostack-0.10.3-py3-none-any.whl -
Subject digest:
f5b452968b7c0b9b9ef7df6c1414f0b86b3b1ddcdfbba7b2c97f4772d6665ce1 - Sigstore transparency entry: 1185578430
- Sigstore integration time:
-
Permalink:
raphasouthall/neurostack@24ab0223d23883e10c37804c65fe5f01a2fe3b97 -
Branch / Tag:
refs/tags/v0.10.3 - Owner: https://github.com/raphasouthall
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@24ab0223d23883e10c37804c65fe5f01a2fe3b97 -
Trigger Event:
push
-
Statement type: