MCP server for Neruva agent memory + reasoning substrate. v0.21 adds code_kg_callees/callers/class_of/module_of/imports for sub-ms structural code queries on KGs built locally via neruva-record-code-index. Removes agent_postmortem_prompt (transport stall). Plus Records, 5-engine KG, federated agent_remember/recall/context, Pearl do-operator, HD analogy, CBR, ToM, counterfactual rollouts, schema lifting, EFE, continual K-gram, rule induction, replay, .neruva V3. MCP drop-in for Claude Code, Cursor, Codex.
Project description
neruva-mcp
MCP server for Neruva — memory + reasoning substrate for AI agents. Knowledge graph (5 engines), Pearl do-operator, HD analogy, episodic CBR, deterministic replay. Drop into Claude Code / Cursor / Codex / Gemini CLI in one line.
For Claude Code users: see neruva.io/claude-code for the 30-second install + first-queries to try.
What's new in 0.21.0 — code-graph MCP tools + postmortem removed
- 5 new
code_kg_*tools for sub-ms structural code queries against KGs built locally vianeruva-record-code-index:code_kg_callees,code_kg_callers,code_kg_class_of,code_kg_module_of,code_kg_imports. Thin wrappers overhd_kg_querywith "Call this when..." routing nudges. agent_postmortem_prompttool removed. Paired with theneruva-recordv0.10.1 strip ofNERUVA_AUTO_POSTMORTEM— the synchronous fetch in the failure hook was stalling sessions under transport hiccups./v1/agent/postmortem_promptsubstrate endpoint now returns 410 Gone.- Tool-description routing nudges. All high-leverage tools (records_*, agent_recall/context/remember, hd_kg_query, hd_analogy, hd_causal_query, agent_counterfactual_rollout, agent_model_belief(_add), agent_register_action, agent_plan_efe, agent_induce_rule, agent_extract_schema, agent_hierarchical_decode) lead with "Call this when..." so LLMs route into the right substrate primitive without explicit prompting.
What's new in 0.18.3 — depth-unlimited theory of mind + 125× faster cleanup
- Theory of mind is now depth-unlimited (v0.5.4 substrate fix). Position-tagged at every chain index via non-commutative permutation binding. Inner-position swaps correctly reject; recursive self- reference (same agent at multiple chain positions) works natively.
- Cleanup acceleration via FAISS-binary popcount. OPB query stage 2 uses SIMD popcount over sign-quantized atoms with deterministic float32 cosine rerank. Substantially faster on warm queries; replay bit-identical.
- 551× compression on stored OPB pages (rank-12 SVD). Persistence blobs that were >100 MB now fit in under 1 MB at perfect recall on round-trip.
The 9-level cognitive ladder — no LLM vendor ships rows 3-9
The substrate now exposes the full 9-level cognitive ladder. Every primitive runs sub-100ms, deterministic from seed, behind one MCP install.
| # | Capability | MCP tool(s) | Frontier LLM equivalent |
|---|---|---|---|
| 1 | Vector retrieval (OPB pages + spectral routing) | records_query(engine="opb") |
Pinecone/Zep (Level 1 only) |
| 2 | KG + Pearl do-operator + HD analogy + CBR | hd_kg_* · agent_causal_query · hd_analogy · hd_cbr_* |
nobody |
| 3 | Theory of Mind (nested belief) | agent_model_belief_add · agent_model_belief |
hallucinates at depth |
| 4 | Counterfactual rollouts ("what if k → a'?") | agent_counterfactual_rollout |
confabulates |
| 5 | Schema lifting (analogical pattern matching) | agent_extract_schema |
needs fine-tuning |
| 6 | Active Inference planning (Friston EFE) | agent_register_action · agent_plan_efe |
not a primitive |
| 7 | Few-shot rule induction | agent_induce_rule |
fine-tune (many examples) |
| 8 | Persistent rule storage | agent_persist_rule · agent_recall_rule |
re-feed demos every recall |
| 9 | Continual learning, zero forgetting | agent_continual_train · agent_continual_predict |
catastrophic forgetting |
| + | Hierarchical chunking (recursive L^K decode) | agent_hierarchical_add · agent_hierarchical_decode |
not a primitive |
~90 tools across Records, KG, Causal, Analogy, CBR, Blend, Vector memory, federated agent_*, the 9 cognitive primitives above, self-introspection.
Why this is unique
Every primitive in rows 3-9 is a graduated, production-shipped engine. No published memory vendor offers more than rows 1-2. Substrate-augmented small LLMs can match frontier-class agentic capabilities at a fraction of the cost per recall.
Install
# In Claude Code (any directory, user scope):
claude mcp add-json neruva '{"command":"npx","args":["-y","@neruva/mcp@latest"],"env":{"NERUVA_API_KEY":"nv_..."}}'
Or one-line install via npx for any MCP host:
npx -y @neruva/mcp@latest # one-off
npm i -g @neruva/mcp # then `neruva-mcp`
Get an API key at https://app.neruva.io (free tier, no credit card).
Wire into a host
Claude Code
claude mcp add-json neruva '{"command":"npx","args":["-y","@neruva/mcp@latest"],"env":{"NERUVA_API_KEY":"..."}}'
Cursor (~/.cursor/mcp.json)
{
"mcpServers": {
"neruva": {
"command": "npx",
"args": ["-y", "@neruva/mcp@latest"],
"env": { "NERUVA_API_KEY": "..." }
}
}
}
Codex (~/.codex/config.toml)
[mcp_servers.neruva]
command = "npx"
args = ["-y", "@neruva/mcp@latest"]
env = { NERUVA_API_KEY = "..." }
Gemini CLI (~/.gemini/settings.json)
{ "mcpServers": { "neruva": { "command": "npx", "args": ["-y", "@neruva/mcp@latest"], "env": { "NERUVA_API_KEY": "..." } } } }
The substrate, in one paragraph
Five layers, one API. Records = typed agentic events (decisions, mistakes, tool_calls, llm_turns; auto-embedded at D=1024). Knowledge Graph = mutable structured state across 5 engines, sub-ms cosine retrieval, matrix-power N-hop derive. Causal = Pearl's do-operator (observation vs intervention arithmetically distinct). Analogy = a:b::c:? in HD feature space. Concept Blending = provenance-preserving merge of multiple memories. CBR = factored episode store. The new federated agent_* layer (agent_remember / agent_recall / agent_context) routes across all substrates so a single call handles "where does X store, and how do I get it back?"
Deterministic from a seed. Replayable bit-exactly. Portable as .neruva containers — your data is yours.
Three-line LangChain integration
# pip install neruva-langchain
from neruva_langchain import NeruvaChatMessageHistory
history = NeruvaChatMessageHistory(namespace="user_alice")
# wire into any chain that takes BaseChatMessageHistory
Same pattern: neruva-langgraph (BaseCheckpointSaver + BaseStore), neruva-crewai (Storage interface + 3 memory flavors).
Auto-record for Claude Code
pip install neruva-record && neruva-record-install
Every Claude Code session lands in your Neruva account: tool calls, chat turns, secrets-redacted client-side, queryable across sessions.
Why use this over a vector DB or Zep
| Vector DB | Zep | Neruva | |
|---|---|---|---|
| KG engines | 0 | 1 | 5 |
| Causal queries (Pearl do-operator) | ❌ | ❌ | ✅ |
| Provable replay (deterministic snapshot/restore) | ❌ | ❌ | ✅ |
| Anomaly detection (quorum disagreement) | ❌ | ❌ | ✅ |
| Federated context (records+KG one call) | ❌ | partial | ✅ |
| Portable container | ❌ | ❌ | ✅ .neruva |
| p95 latency | varies | varies | <100ms |
| Cost per recall vs context-stuffing | varies | varies | dramatically lower |
Auth
Set NERUVA_API_KEY in env. NERUVA_URL defaults to https://api.neruva.io.
Optional: NERUVA_AUTO_RECORD=namespace[:ttl_days] — every tool call this agent makes auto-records into the named records namespace. Fire-and-forget, never blocks or breaks the call.
Update flow
The startup banner prints when a newer version is available:
[neruva-mcp] update available: you have 0.16.0, latest is 0.16.1.
If registered with @neruva/mcp@latest, a Claude Code restart auto-updates.
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file neruva_mcp-0.21.0.tar.gz.
File metadata
- Download URL: neruva_mcp-0.21.0.tar.gz
- Upload date:
- Size: 29.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c4c8e701444c4f5eaf409709fd69f0a40cda1195a10495ac179731630129566f
|
|
| MD5 |
b02ed5b7c42000b09dfbc1d23cc22fd2
|
|
| BLAKE2b-256 |
2538be15bed13be0052d0c266a5f8d6d813ba00c8fd7de2007276727858feee1
|
File details
Details for the file neruva_mcp-0.21.0-py3-none-any.whl.
File metadata
- Download URL: neruva_mcp-0.21.0-py3-none-any.whl
- Upload date:
- Size: 25.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d042b9a6e6680885cb36214364a3e36b0361cc0ccede87f57cc1b314ca143daa
|
|
| MD5 |
3f57203952e48d2182b8f735bbc1909f
|
|
| BLAKE2b-256 |
9fe5a08d701a0e637dbd75b9341419b88b8a951f1f25e5eec2528e55d4998ef9
|