Local-first Markdown + SQLite memory for LLM agents, with keyword search, optional embeddings, MCP, and bounded citation tools.

These details have not been verified by PyPI

Project description

Vault-for-LLM

Local-first project memory for AI agents.

Vault-for-LLM turns project notes, decisions, bugs, SOPs, Obsidian notes, and agent-written memory candidates into a portable SQLite vault that agents can search, read in bounded ranges, cite, test, back up, and sync when needed.

It is not trying to replace your model, your wiki, or hosted memory systems. It sits between them: a small governed memory layer that helps agents reuse project knowledge without losing source, scope, or reviewability.

The default path is agent-driven: ask your coding agent to install Vault, choose where the database should live, and run a small search/read/propose smoke test. Manual commands are still here, but they are no longer the main story.

Why It Exists

Most agent failures are practical, not mysterious:

a new session forgets why a decision was made
an agent reads the wrong outdated note
useful fixes stay buried in chat history
private observations get mixed with shared project knowledge
a team cannot tell whether retrieval is actually working

Vault-for-LLM is built for that practical gap. It gives agents a place to ask:

What has this project already learned, where did it come from, and am I allowed to use it?

What You Get

Local-first memory - Markdown and SQLite by default. No cloud service is required for core use.
Agent-friendly retrieval - CLI and MCP tools for search, bounded reads, candidate memory, Document Map inspection, and optional remote reads.
Candidate-first writes - agents can propose memory before it becomes active knowledge.
Governance metadata - scope, sensitivity, owner agent, allowed agents, memory type, and expiry travel with each memory.
Obsidian bridge - import existing Obsidian notes into Vault, or export compiled Vault knowledge back into Obsidian-readable Markdown.
Optional remote sharing - Supabase sync and read-only RPC templates let agents on different machines share reviewed memory.
Report-first automation - generated cron, LaunchAgent, and n8n templates can run memory maintenance without silently deleting or promoting memory.
Measurable recall - Search QA and onboarding benchmarks measure whether agents can find the right source, not just sound confident.

When To Use It

Use Vault-for-LLM when:

you work with Claude Code, Codex, Hermes Agent, OpenClaw, OpenCode, n8n, or another agent that needs project context across sessions
you want a shared project memory without giving every agent raw private notes
you already have Markdown or Obsidian notes and want agents to search them with citations
you need local-first storage but optional Supabase sharing for other hosts
you care about retrieval quality enough to test it

Do not start here if you only need a hosted vector database, a personal notes app, or an automatic conversation memory product.

Install

Agent-Driven Install

For most users, the right path is to ask an agent to install it:

Install Vault-for-LLM for this project. Use vault-for-llm[mcp]==0.6.66.
Ask whether the vault should be shared, private, domain-specific, or temporary.
Ask for a stable project directory and generate a stable venv script for
long-lived agent jobs. Ask separately about MCP, semantic search, Supabase,
Obsidian import, Headroom compression, and memory-agent guidance. Install only
the optional dependencies I approve. Finish with a search/read/propose smoke test.

The agent should use the guided installer:

python3 -m venv .venv
source .venv/bin/activate
pip install "vault-for-llm[mcp]==0.6.66"

vault setup-agent

For non-interactive agent installs:

vault setup-agent \
  --non-interactive \
  --agent codex \
  --scope shared \
  --agent-project-dir ~/Vaults/my-project \
  --features core,mcp,supabase,headroom \
  --write-stable-venv-script \
  --supabase-setup simple \
  --remote-reader shell \
  --automation-schedule cron \
  --json

This can generate agent-install/setup-stable-venv.sh, so scheduled jobs and MCP commands do not depend on a disposable /tmp virtualenv.

Manual Quickstart

pip install "vault-for-llm[mcp]==0.6.66"

vault init ~/Vaults/demo
vault add "First lesson" \
  --content "The bug was caused by a missing cache key. The fix was adding provider metadata." \
  --project-dir ~/Vaults/demo
vault compile --project-dir ~/Vaults/demo --no-embed
vault search "cache key" --project-dir ~/Vaults/demo

Daily Agent Flow

The intended loop is simple:

Search first - find likely source notes.
Read bounded ranges - avoid dumping whole documents into context.
Answer with sources - keep citations tied to original Vault content.
Propose memory - let agents suggest new lessons as candidates.
Review before promotion - keep active memory clean and auditable.

Agents can also turn a completed session transcript into reviewable candidates:

vault capture session codex-session.jsonl --project-dir ~/Vaults/my-project --pretty
vault capture session codex-session.jsonl --project-dir ~/Vaults/my-project --write-candidates

The first command is a dry-run preview. The second writes candidate memories only; it does not promote active knowledge.

For MCP-capable runtimes:

vault-mcp --project-dir ~/Vaults/my-project --tool-profile core

Recommended core tools:

vault_search
vault_read_range
vault_memory_propose
vault_stats

MCP guides:

Practical tool reference: docs/mcp_tool_reference.md
Workflow and token-budget guidance: docs/mcp_memory_workflow.md

Memory Model

Vault uses depth layers for how memory is used:

Layer	Purpose
`L0`	identity and project framing
`L1`	stable facts, rules, and preferences
`L2`	active context, summaries, current work
`L3`	detailed knowledge, SOPs, bugs, decisions, source notes

Access is not controlled by layer alone. Use governance metadata for policy:

scope: private, project, shared, public
sensitivity: low, medium, high, restricted
owner_agent
allowed_agents
memory_type
expires_at

Searches record lightweight usage counters (access_count, citation_count, last_accessed_at). The default lightweight reranker uses these signals as a small, saturated boost, so frequently useful memories can rise slightly without overriding source relevance, trust, freshness, or access policy.

Short-lived memories with expires_at can be moved to status: archived instead of deleted:

vault usage stats
vault usage archive-expired --apply

Design notes: docs/memory_governance.md.

Policy-based automation lets agents handle routine maintenance while humans keep ownership of the rules:

vault automation plan --write-policy
vault automation run
vault automation run --apply
vault automation cycle --apply
vault automation inbox --limit 5

vault capture session is the ingestion side of that loop. It scans agent transcripts for reusable decisions, pitfalls, workflows, and source-of-truth signals, then routes them through the normal candidate gates. Automation and Dream can later rank and clean those candidates, but promotion remains explicit.

Balanced automation can pre-fill the memory candidate queue with Dream and Forgetting suggestions when --apply is used, but it still never promotes candidates or hard-deletes memory. Use conservative mode when scheduled jobs should only write reports.

vault automation eval reads promoted/rejected/blocked candidate outcomes and shows which suggestion sources are earning trust over time. The signal guides future curation priority; it does not override review, privacy, or access policy.

Rejected and blocked candidates can be recorded directly:

vault candidate-review mem_123 --outcome rejected --reason "Too vague to keep."

This is useful for agents because "do not keep this" becomes structured feedback instead of disappearing into chat history.

For scheduled agents, vault automation eval --write-learning-policy also writes reports/automation/learning_policy.json. That file contains bounded priority hints, such as preferring a suggestion source that is often promoted or downgrading one that is often rejected. The bounds are deliberately small and never authorize auto-promotion, deletion, or privacy bypass.

Dream and scheduled automation can read that policy on the next run. They use it to annotate and sort candidate suggestions, so reviewers see better-ranked cleanup work first while the formal promote/reject decision remains explicit. When Dream finds duplicate groups, it can also write a consolidation_suggestion candidate that asks for a reviewed merge/archive decision without changing active knowledge by itself. vault automation cycle runs that feedback-to-curation loop in one command: evaluate reviewed outcomes, write the bounded learning policy, then run safe automation so Dream can consume the latest hints.

vault automation inbox is the short review surface for that loop. It does not mutate memory. It ranks privacy-blocked, sensitive, duplicate, weak-quality, and automation-generated candidates, hides raw content by default, and shows only the smallest useful queue for a human or trusted agent to review. Scheduled automation templates write the same view to reports/automation/inbox-latest.json after each successful run.

Agent installers can generate cron, LaunchAgent, or n8n templates with vault setup-agent --automation-schedule cron|launchagent|n8n|all. Scheduled templates default to vault automation cycle, so long-running agents can learn from reviewed outcomes before the next maintenance pass. The schedule is still report-first unless the user explicitly opts into --automation-apply; pass --automation-command run for a simpler maintenance-only schedule.

Automation details: docs/automation.md.

Memory Maintenance Agents

Vault can generate guidance for Profile, Dream, and Forgetting agents. These agents are conservative by default: Dream runs produce reports first, cleanup checks look for stale entries, duplicates, and weak metadata, and apply_safe paths should create backups so rollback remains possible. Promotion, deletion, archive, or expiry actions should stay candidate-only until a user-approved policy allows stronger automation.

Setup guide: docs/agent_install.md. Governance details: docs/memory_governance.md.

Integrations

System	Path
Claude Code / Codex / OpenCode	CLI or local stdio MCP
Hermes Agent / OpenClaw	CLI, MCP, generated agent install files
n8n	generated Supabase sync and remote-reader workflow templates
Coze or hosted agents	Supabase read-only RPC and OpenAPI template
Obsidian	import existing notes, export compiled Vault knowledge
Headroom	optional compression after Vault has narrowed context

Start here: docs/agent_integrations.md.

Optional Supabase Sharing

SQLite remains the source of truth. Supabase is optional.

Use it when agents on different machines or hosted platforms need to read a shared, filtered copy of reviewed project memory. Remote readers should pass the search result id directly into map/read; it may be an integer or a Supabase UUID.

pip install "vault-for-llm[supabase]==0.6.66"
python -m scripts.sync_to_supabase --db ~/Vaults/my-project/vault.db --document-map --health

Setup guide: docs/supabase_setup.md. Read policy template: docs/supabase_read_policy.sql.

Obsidian

Import an existing Obsidian vault:

vault import obsidian --vault ~/Documents/ObsidianVault --project-dir ~/Vaults/my-project --dry-run
vault import obsidian --vault ~/Documents/ObsidianVault --project-dir ~/Vaults/my-project --compile

Export compiled Vault knowledge back into Obsidian-readable notes:

vault export obsidian --project-dir ~/Vaults/my-project --vault ~/Documents/ObsidianVault

The importer skips .obsidian, .trash, .git, and generated Vault export folders by default.

Retrieval Quality

Vault includes lightweight QA tools so retrieval can be tested instead of trusted by intuition alone.

vault search-qa run \
  --qa-file benchmarks/search_qa/basic.en.json \
  --mode keyword \
  --output /tmp/vault-searchqa.json

Current evidence is intentionally described as retrieval evidence, not final answer quality:

project onboarding benchmark: Vault found source-backed project memory across 28/28 tasks in local proof runs
LoCoMo retrieval probes: hierarchical session + local evidence-window retrieval reached high evidence recall in official-scored categories
official answerer/judge scores are separate and require model-provider runs

More detail: docs/agent_onboarding_benchmark.md and docs/search_qa_benchmarking.md.

Maturity

Area	Status
local SQLite, Markdown compile, keyword search	stable
CLI setup, candidate memory, bounded reads	usable
MCP tools	usable, profile selection recommended
Obsidian import/export	usable
Supabase sync and remote read templates	advanced optional
policy-based memory automation	usable-alpha
semantic search, API/local embedding providers, rerank, benchmark adapters	evolving
Profile / Dream / Forgetting agents	guidance-first, not autonomous deletion

Vault-for-LLM is still pre-1.0. The core local path is intentionally conservative; advanced integrations are powerful, but should be enabled deliberately.

Documentation Map

Agent install runbook: docs/agent_install.md
CLI reference: docs/cli_reference.md
Agent integrations: docs/agent_integrations.md
Memory automation: docs/automation.md
Memory governance: docs/memory_governance.md
Supabase setup: docs/supabase_setup.md
MCP tool reference: docs/mcp_tool_reference.md
MCP workflow: docs/mcp_memory_workflow.md
PageIndex / Headroom comparison: docs/comparisons/pageindex_headroom.md
Vision notes: docs/vision.md

Development

git clone https://github.com/zycaskevin/Vault-for-LLM.git
cd Vault-for-LLM
python3 -m venv .venv
source .venv/bin/activate
pip install -e ".[dev,mcp]"
pytest -q

License

Apache-2.0. See LICENSE.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.7.14

Jun 26, 2026

0.7.13

Jun 26, 2026

0.7.12

Jun 26, 2026

0.7.10

Jun 25, 2026

0.7.9

Jun 25, 2026

0.7.8

Jun 25, 2026

0.7.7

Jun 25, 2026

0.7.6

Jun 25, 2026

0.7.5

Jun 25, 2026

0.7.4

Jun 25, 2026

0.7.3

Jun 25, 2026

0.7.2

Jun 25, 2026

0.7.1

Jun 25, 2026

0.7.0

Jun 25, 2026

0.7.0rc2 pre-release

Jun 25, 2026

0.7.0rc1 pre-release

Jun 25, 2026

0.6.124

Jun 25, 2026

0.6.123

Jun 25, 2026

0.6.122

Jun 25, 2026

0.6.121

Jun 25, 2026

0.6.120

Jun 25, 2026

0.6.119

Jun 25, 2026

0.6.118

Jun 25, 2026

0.6.117

Jun 25, 2026

0.6.116

Jun 25, 2026

0.6.115

Jun 25, 2026

0.6.114

Jun 25, 2026

0.6.113

Jun 25, 2026

0.6.112

Jun 25, 2026

0.6.111

Jun 25, 2026

0.6.110

Jun 25, 2026

0.6.109

Jun 25, 2026

0.6.108

Jun 25, 2026

0.6.107

Jun 24, 2026

0.6.106

Jun 24, 2026

0.6.105

Jun 24, 2026

0.6.104

Jun 24, 2026

0.6.103

Jun 24, 2026

0.6.102

Jun 24, 2026

0.6.101

Jun 24, 2026

0.6.100

Jun 24, 2026

0.6.99

Jun 24, 2026

0.6.98

Jun 24, 2026

0.6.97

Jun 24, 2026

0.6.96

Jun 24, 2026

0.6.95

Jun 24, 2026

0.6.94

Jun 24, 2026

0.6.93

Jun 24, 2026

0.6.92

Jun 24, 2026

0.6.91

Jun 24, 2026

0.6.90

Jun 24, 2026

0.6.89

Jun 24, 2026

0.6.88

Jun 24, 2026

0.6.87

Jun 24, 2026

0.6.86

Jun 24, 2026

0.6.85

Jun 24, 2026

0.6.84

Jun 24, 2026

0.6.83

Jun 24, 2026

0.6.82

Jun 24, 2026

0.6.81

Jun 24, 2026

0.6.80

Jun 24, 2026

0.6.79

Jun 24, 2026

0.6.78

Jun 24, 2026

0.6.77

Jun 24, 2026

0.6.76

Jun 24, 2026

0.6.75

Jun 24, 2026

0.6.74

Jun 24, 2026

0.6.73

Jun 24, 2026

0.6.72

Jun 24, 2026

0.6.71

Jun 24, 2026

0.6.70

Jun 24, 2026

0.6.69

Jun 24, 2026

0.6.68

Jun 24, 2026

0.6.67

Jun 23, 2026

This version

0.6.66

Jun 23, 2026

0.6.65

Jun 23, 2026

0.6.64

Jun 23, 2026

0.6.63

Jun 23, 2026

0.6.62

Jun 23, 2026

0.6.61

Jun 23, 2026

0.6.60

Jun 23, 2026

0.6.59

Jun 23, 2026

0.6.58

Jun 23, 2026

0.6.57

Jun 23, 2026

0.6.56

Jun 23, 2026

0.6.55

Jun 23, 2026

0.6.54

Jun 23, 2026

0.6.53

Jun 23, 2026

0.6.52

Jun 23, 2026

0.6.51

Jun 23, 2026

0.6.50

Jun 23, 2026

0.6.49

Jun 23, 2026

0.6.48

Jun 23, 2026

0.6.45

Jun 23, 2026

0.6.44

Jun 22, 2026

0.6.43

Jun 22, 2026

0.6.42

Jun 22, 2026

0.6.41

Jun 22, 2026

0.6.40

Jun 22, 2026

0.6.39

Jun 22, 2026

0.6.38

Jun 22, 2026

0.6.37

Jun 22, 2026

0.6.36

Jun 22, 2026

0.6.35

Jun 22, 2026

0.6.34

Jun 22, 2026

0.6.33

Jun 22, 2026

0.6.32

Jun 22, 2026

0.6.31

Jun 22, 2026

0.6.30

Jun 22, 2026

0.6.29

Jun 22, 2026

0.6.28

Jun 22, 2026

0.6.27

Jun 22, 2026

0.6.26

Jun 21, 2026

0.6.25

Jun 21, 2026

0.6.24

Jun 21, 2026

0.6.23

Jun 21, 2026

0.6.22

Jun 20, 2026

0.6.21

Jun 17, 2026

0.4.2

May 17, 2026

0.4.1

May 17, 2026

0.4.0

May 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vault_for_llm-0.6.66.tar.gz (493.5 kB view details)

Uploaded Jun 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vault_for_llm-0.6.66-py3-none-any.whl (312.4 kB view details)

Uploaded Jun 23, 2026 Python 3

File details

Details for the file vault_for_llm-0.6.66.tar.gz.

File metadata

Download URL: vault_for_llm-0.6.66.tar.gz
Upload date: Jun 23, 2026
Size: 493.5 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vault_for_llm-0.6.66.tar.gz
Algorithm	Hash digest
SHA256	`f6e49c23ee88b9dd7c3972c9477b4c68736a1cb9fc6ef0e3a993e7b9d07cd1a0`
MD5	`680753f3d4f84187aef0e4119c8c795e`
BLAKE2b-256	`18239be790a68c8d7cfa55206d4e7689f2b04ee86af9167cf55c0d08a46b8400`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vault_for_llm-0.6.66.tar.gz:

Publisher: publish.yml on zycaskevin/Vault-for-LLM

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vault_for_llm-0.6.66.tar.gz
- Subject digest: f6e49c23ee88b9dd7c3972c9477b4c68736a1cb9fc6ef0e3a993e7b9d07cd1a0
- Sigstore transparency entry: 1932402186
- Sigstore integration time: Jun 23, 2026
Source repository:
- Permalink: zycaskevin/Vault-for-LLM@07e6fd6a6c04c2ba57c4074fc39b77131dc0e43a
- Branch / Tag: refs/tags/v0.6.66
- Owner: https://github.com/zycaskevin
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@07e6fd6a6c04c2ba57c4074fc39b77131dc0e43a
- Trigger Event: release

File details

Details for the file vault_for_llm-0.6.66-py3-none-any.whl.

File metadata

Download URL: vault_for_llm-0.6.66-py3-none-any.whl
Upload date: Jun 23, 2026
Size: 312.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for vault_for_llm-0.6.66-py3-none-any.whl
Algorithm	Hash digest
SHA256	`476ef729c4f5f2efa3b2c16faf459088f7a48d078a3ece2f250648c95ffec530`
MD5	`4a66f08900ca87596165c648d1659c54`
BLAKE2b-256	`a8a91eb934a40727bfb1d97a01cbc0c07199f9f5b17055ea17a2a0ff204d1796`

See more details on using hashes here.

Provenance

The following attestation bundles were made for vault_for_llm-0.6.66-py3-none-any.whl:

Publisher: publish.yml on zycaskevin/Vault-for-LLM

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: vault_for_llm-0.6.66-py3-none-any.whl
- Subject digest: 476ef729c4f5f2efa3b2c16faf459088f7a48d078a3ece2f250648c95ffec530
- Sigstore transparency entry: 1932402286
- Sigstore integration time: Jun 23, 2026
Source repository:
- Permalink: zycaskevin/Vault-for-LLM@07e6fd6a6c04c2ba57c4074fc39b77131dc0e43a
- Branch / Tag: refs/tags/v0.6.66
- Owner: https://github.com/zycaskevin
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@07e6fd6a6c04c2ba57c4074fc39b77131dc0e43a
- Trigger Event: release

vault-for-llm 0.6.66

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Vault-for-LLM

Why It Exists

What You Get

When To Use It

Install

Agent-Driven Install

Manual Quickstart

Daily Agent Flow

Memory Model

Memory Maintenance Agents

Integrations

Optional Supabase Sharing

Obsidian

Retrieval Quality

Maturity

Documentation Map

Development

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance