A local-first, semantic memory MCP server. A 'subconscious' for any MCP-compatible LLM agent.
Project description
subconscious-mcp
A local-first, learning, semantic memory layer for any MCP-compatible LLM agent.
The server runs as an MCP stdio process on your machine and exposes four tools — recall, remember, forget, stats — that let an agent ask "have I seen this task before?" and, if so, get back the previous answer in milliseconds without re-running the work.
Embeddings come from sentence-transformers/all-MiniLM-L6-v2 (384-dim, runs on CPU). Storage is a persistent local ChromaDB collection. No data leaves your machine.
Install
# Once published:
pip install subconscious-mcp
# Local development:
git clone https://github.com/vishaltorc/subconscious-mcp
cd subconscious-mcp
pip install -e ".[dev]"
After install you can run the server from anywhere:
subconscious-mcp --help
The first time a tool is called, the embedding model (~80MB) is downloaded into the local Hugging Face cache. Subsequent starts are fast.
Configure your MCP client
Claude Desktop
Edit your config file:
- macOS:
~/Library/Application Support/Claude/claude_desktop_config.json - Windows:
%APPDATA%\Claude\claude_desktop_config.json
Add an mcpServers entry:
{
"mcpServers": {
"subconscious": {
"command": "subconscious-mcp",
"args": []
}
}
}
If subconscious-mcp isn't on Claude Desktop's PATH, use the absolute path printed by which subconscious-mcp, e.g. "command": "/Users/you/.local/bin/subconscious-mcp".
Then quit and restart Claude Desktop. The new tools appear under the 🔌 indicator.
Claude Code
Option A — register from the CLI (recommended):
claude mcp add subconscious-mcp -- subconscious-mcp
Option B — edit ~/.claude.json (or your project's .mcp.json) and add:
{
"mcpServers": {
"subconscious-mcp": {
"command": "subconscious-mcp",
"args": [],
"type": "stdio"
}
}
}
Reload the Claude Code session and the four tools become available.
A copy-pasteable file is in examples/claude_desktop_config.json and examples/claude_code_config.json.
Tools
recall(task, threshold=0.85, top_k=1)
Semantic search for a previously remembered task.
| arg | type | default | meaning |
|---|---|---|---|
task |
str | — | the task description to look up |
threshold |
float | 0.85 | minimum cosine similarity for a hit |
top_k |
int | 1 | how many candidates to consider |
Returns:
{
"hit": true,
"similarity": 0.91,
"answer": "...",
"task_text": "...",
"entry_id": "uuid",
"stored_at": 1731000000.0,
"tags": ["..."]
}
On a miss, hit is false, answer is null, and similarity is the best similarity observed in top_k — so callers can see how close they came.
remember(task, answer, tags=[], ttl_seconds=null)
Persist a (task, answer) pair. Returns {stored, entry_id, embedding_dim}.
ttl_seconds=null means never expire. Pass an integer to have the entry filtered out of future recalls after that many seconds.
forget(entry_id)
Delete the entry with this id. Returns {"forgotten": true} if it existed, else false.
stats()
Returns {"total_entries", "last_hit_at", "hit_rate_last_100"}. hit_rate_last_100 is a sliding window over the most recent 100 recall calls — useful to see whether memory is actually paying off.
Configuration
Configuration is resolved in priority order:
- Environment variables (highest)
~/.subconscious-mcp/config.json- Built-in defaults
| key | default | env var |
|---|---|---|
storage_dir |
~/.subconscious-mcp/data |
SUBCONSCIOUS_STORAGE_DIR |
embedding_model |
all-MiniLM-L6-v2 |
SUBCONSCIOUS_EMBEDDING_MODEL |
default_threshold |
0.85 |
SUBCONSCIOUS_DEFAULT_THRESHOLD |
default_ttl_seconds |
null |
— |
log_level |
INFO |
SUBCONSCIOUS_LOG_LEVEL |
Inspect the resolved config without starting the server:
subconscious-mcp --print-config
Files written on disk
~/.subconscious-mcp/
├── config.json (optional, user-edited)
├── data/ ChromaDB collection (sqlite + parquet)
└── logs/server.log rotating, 2MB x 3 backups
To wipe your memory: rm -rf ~/.subconscious-mcp/data.
Demo session
See examples/demo_session.md for a worked example of an agent calling recall (miss → remember), then on a later turn calling recall again with a paraphrase and getting a hit.
Architecture
See docs/architecture.md for the layered design (server / tools / memory / config), the rationale behind ChromaDB + cosine similarity, and the TTL strategy.
Troubleshooting
subconscious-mcp: command not found after install
Your shell's PATH doesn't include the install location. Try python -m subconscious_mcp.server --help to confirm the package works, then use the absolute path in your MCP client config.
Claude Desktop says "Server disconnected"
Check ~/.subconscious-mcp/logs/server.log for the traceback. Most common causes:
- The model download failed (offline at first launch) — re-run with network connectivity.
- The
storage_diris on a read-only volume.
First recall is slow The first invocation lazily loads the sentence-transformer model (~5s on a modest CPU). Subsequent calls reuse the loaded model and respond in milliseconds.
Recall keeps missing on obvious paraphrases
Lower the threshold (recall(task=..., threshold=0.7)) or raise top_k to see candidates. all-MiniLM-L6-v2 is small and fast — for higher-quality matching set SUBCONSCIOUS_EMBEDDING_MODEL=all-mpnet-base-v2.
Tests fail with a sentence-transformers download error
You're offline or behind a proxy. Set HF_HUB_OFFLINE=1 once you've pre-downloaded the model, or run python -c "from sentence_transformers import SentenceTransformer; SentenceTransformer('all-MiniLM-L6-v2')" once with connectivity.
License
MIT © 2026 Vishal Jayaprakash
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file subconscious_mcp-0.1.0.tar.gz.
File metadata
- Download URL: subconscious_mcp-0.1.0.tar.gz
- Upload date:
- Size: 15.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fbcff93483b67958361c57368abdc61dbdace5787ed3da0d3d32852e3324c114
|
|
| MD5 |
684c7ef13c03d7bb19b0311a5057893b
|
|
| BLAKE2b-256 |
2b9fa73587ed1af88e976f24e43b4e54911936f156c1a8cd473b96c0bce5ae14
|
File details
Details for the file subconscious_mcp-0.1.0-py3-none-any.whl.
File metadata
- Download URL: subconscious_mcp-0.1.0-py3-none-any.whl
- Upload date:
- Size: 12.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.13
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c65b311ee7d841edb96ea240f04c4ffd131de12ba59fa33495c1bcfe1b638580
|
|
| MD5 |
2955e42721b4698a0c47c475f893981b
|
|
| BLAKE2b-256 |
4992104364a8c76b3fa8f59a007c57d9ceff75066f0a53fe8643d67b27ea6645
|