agentvault-memory

Unified memory layer that consolidates history from all your AI coding agents — searchable by humans (Obsidian) and by AI (MCP).

These details have not been verified by PyPI

Project links

Project description

Unified memory layer for AI coding agents

Searchable by humans (Obsidian) and by AI (MCP)

Every conversation you have across Claude Code, OpenCode, Codex, Cursor — all siloed. Start a new session and your agent has zero context from any of them. AgentVault Memory fixes that.

The Problem

Developers now use 3-4 AI coding tools daily — Claude Code in one terminal, Cursor in the IDE, Codex for quick tasks, OpenCode for another project. Every decision, every debugging session, every architecture discussion happens in these conversations. Then the session ends and it's gone.

6 months of daily AI use = ~19.5 million tokens of conversations. That's every decision, every "we tried X and it failed because Y", every debugging session. All trapped in separate tools that don't talk to each other.

You open a new Claude Code session and ask "how did we handle auth?" — it has no idea. The answer is in a Cursor session from last month. Or a Codex session from last week. Or three different Claude Code sessions across two projects.

Why AgentVault Memory

There are many AI memory tools — MemPalace, Mem0, Zep, Letta, Pieces. None of them solve this specific problem.

	AgentVault Memory	MemPalace	Mem0	Zep	Pieces	Claude Auto Dream
Reads history from Claude Code	Yes	No	No	No	No	Yes
Reads history from Cursor	Yes	No	No	No	No	No
Reads history from Codex	Yes	No	No	No	No	No
Reads history from OpenCode	Yes	No	No	No	No	No
Cross-tool semantic search	Yes	—	—	—	—	No
Obsidian output	Yes	No	No	No	No	No
MCP server for AI querying	Yes	Yes	No	No	No	—
Zero API keys	Yes	Yes	No	No	No	Yes
Fully local	Yes	Yes	No	No	No	Yes
Free	Yes	Yes	Free tier	$25/mo+	$10/mo	Yes

The core difference: Other tools store what the AI decides to remember. AgentVault Memory reads the raw conversation history files that your tools already store on disk — every word, every decision, every debugging session — and makes it all searchable. Nothing is lost because nothing is summarized away.

Token Efficiency

The obvious question: if you have 19.5M tokens of history, how do you use it without blowing up the context window?

You don't load it. AgentVault Memory uses vector search — it only retrieves what's relevant to your question. And it optimizes every response to minimize tokens.

Approach	Tokens per search	Annual cost	What you lose
Paste everything into context	19.5M (impossible)	Doesn't fit	—
LLM summarization (Mem0, etc.)	~650K	~$507	Nuance, exact quotes, reasoning
AgentVault Memory (full search)	~800	$0	Nothing
AgentVault Memory (lite search)	~200	$0	Nothing — summaries first, full on demand

Built-in Token Optimizations

Every MCP search response goes through 5 optimizations before reaching your AI:

Optimization	What It Does	Token Savings
Summary-first search	`vault_search_lite` returns one-line summaries, not full content. AI fetches full content only for what it needs	~80%
Tool noise stripping	Removes `[Used tools: Read]`, `[Tools used: Edit]` artifacts	10-15%
Code block truncation	Long code blocks trimmed to 4 lines + `(truncated)`	20-30%
Result deduplication	Near-identical results from the same session are merged	15-20%
Compact metadata	One-line format instead of 4 separate lines per result	75%

How It Works

You: "How did we handle rate limiting?"
      │
      ▼ AI calls vault_search_lite (summaries only, ~200 tokens)
      │
      ▼ Sees: "#1 78% — my-saas-app | claude-code | 2026-03-15"
      │        "    Implemented rate limiting using upstash..."
      │
      ▼ AI calls vault_search for full details on result #1 (~300 tokens)
      │
      ▼ Total: ~500 tokens (vs ~1,500 without optimization)

10 searches in a session = ~5,000 tokens. That's 97% of a 200K context window still free for actual work.

Everything runs locally — ChromaDB + HNSW index on your machine, embedding model (~80MB, downloaded once), no data ever leaves your machine.

How It Works

Terminal 1: Claude Code (project A)  ─┐
Terminal 2: Claude Code (project B)  ─┤
Terminal 3: OpenCode                 ─┼──→ AgentVault Memory ──→ ChromaDB (AI search)
Terminal 4: Codex                    ─┤                └──→ Obsidian (you browse)
IDE: Cursor                          ─┘

New session starts → MCP tools → semantic search → relevant context returned

No context window bloat — loads ~250 tokens on startup, searches on demand (~500-2000 tokens per query)
Fast — local ChromaDB with HNSW index, ~30-50ms per search
Private — everything stays on your machine, zero API calls, zero cloud
Obsidian-native — browsable markdown files with frontmatter, daily digests, per-project folders
Auto-save — new Claude Code sessions are automatically ingested via Stop hook

Quick Start

# Install
pip install agentvault-memory

# Initialize — auto-detects AI tools, Obsidian, installs MCP + auto-save hook
agentvault init

# One-time bulk import of all history
agentvault ingest

# Search anything
agentvault search "why did we switch to GraphQL"
agentvault search "auth bug" --project my-saas-app
agentvault search "rate limiting" --source claude-code

# View decisions extracted from your conversations
agentvault decisions
agentvault decisions --project my-saas-app

# Incremental sync — only new sessions since last run
agentvault sync

# Delete data you don't want
agentvault forget --project old-project
agentvault forget --source cursor

# Export your data
agentvault export backup.json
agentvault export report.md --format markdown --project my-saas-app

# Check status (with per-tool and per-project breakdown)
agentvault status

That's it. Two commands to set up, then it runs automatically.

What `init` does

Detects AI tools — scans for Claude Code, OpenCode, Codex, Cursor history
Auto-detects Obsidian — finds your vault by looking for .obsidian/ in common locations
Installs MCP server — for every detected tool that supports MCP (Claude Code, Cursor, OpenCode)
Installs auto-save hook — Claude Code Stop hook that ingests new sessions automatically

No manual --obsidian flag or mcp-install needed. Everything is auto-detected.

What Gets Indexed

From each session, AgentVault Memory extracts:

Field	Source
Conversations	User messages + AI responses
Project	Working directory
Git branch	Active branch during session
Files touched	Files read/edited/written by the AI
Timestamps	Session start/end, per-message
Tool usage	Which tools the AI called

Secrets (API keys, tokens, passwords, private keys, connection strings) are automatically redacted before storage.

MCP Tools

After init, your AI tools have these search tools available via MCP:

Tool	What It Does	Tokens
`vault_search_lite`	Start here — returns one-line summaries, not full content	~200
`vault_search`	Full semantic search with project/source/branch filters	~800
`vault_project_context`	"What have I done on project X recently?"	~800
`vault_cross_reference`	"Did I solve this problem before in another project?"	~800
`vault_decisions`	"What decisions did I make about auth?"	~500
`vault_status`	Overview of indexed sessions and projects	~100

Your AI calls these automatically when you ask questions like:

"Remember that auth bug we fixed last week?"
"How did we handle rate limiting in the other project?"
"What decisions did I make about the database?"

Obsidian Integration

If an Obsidian vault is detected (or provided via --obsidian), AgentVault Memory writes browsable markdown:

obsidian-vault/
  agent-history/
    2026-04-09.md                      ← daily digest
    my-saas-app/
      2026-04-09-4f66f16f.md           ← session transcript
    my-api-server/
      2026-04-08-aa20d038.md

Each session file has YAML frontmatter (source, project, date, branch, tags) — searchable and linkable in Obsidian. No Obsidian? No problem — it's optional. ChromaDB search works without it.

Supported Tools

Tool	History	MCP	Auto-Save Hook
Claude Code	`~/.claude/projects/` (JSONL)	Yes	Yes
OpenCode	`~/.local/state/opencode/` (JSONL)	Yes	—
Codex (OpenAI)	`~/.codex/sessions/` (JSONL)	—	—
Cursor	`~/Library/Application Support/Cursor/` (SQLite)	Yes	—
ChatGPT	Planned (manual export)	—	—

Auto-Save (Future Sessions)

After init, a Claude Code Stop hook is installed that runs agentvault ingest --source claude-code after every session ends. New conversations are automatically indexed — no manual ingest needed.

For other tools, run agentvault ingest periodically or after significant work.

Session Summaries

Every session is auto-summarized during ingestion using keyword extraction (no LLM needed). Summaries appear in Obsidian files and daily digests, making it easy to scan what each session was about without reading the full transcript.

Decision Log

AgentVault Memory automatically extracts decisions from your conversations — "decided to use X", "chose Y over Z", "switching to W". These are surfaced via:

CLI: agentvault decisions — view all extracted decisions, filter by project
MCP: vault_decisions tool — your AI can query past decisions mid-session
Obsidian: ## Key Decisions section added to each session file

Forget (Data Control)

Delete any data you don't want in the vault:

agentvault forget --session <id>     # one session
agentvault forget --project old-app  # all sessions for a project
agentvault forget --source cursor    # all sessions from a tool
agentvault forget --all              # wipe everything (with confirmation)

Adding a New Adapter

Each adapter is one file implementing 3 methods:

from agentvault.adapters.base import BaseAdapter

class MyToolAdapter(BaseAdapter):
    name = "my-tool"
    description = "My AI tool"

    def default_history_path(self) -> Path:
        return Path.home() / ".my-tool" / "history"

    def detect(self) -> bool:
        return self.history_path.exists()

    def discover_sessions(self) -> list[Path]:
        return list(self.history_path.glob("*.json"))

    def parse_session(self, path: Path) -> AgentSession | None:
        # Convert native format → AgentSession schema
        ...

See agentvault/adapters/claude_code.py for a full example.

Architecture

agentvault/
  cli.py                  ← CLI (click + rich)
  config.py               ← ~/.agentvault/config.json
  mcp_server.py           ← MCP server (stdio transport)
  core/
    schema.py             ← AgentSession, Exchange, Chunk
    store.py              ← ChromaDB wrapper
    ingester.py           ← Session → chunks (with secret redaction)
    redactor.py           ← Secret pattern detection (15 patterns)
  adapters/
    base.py               ← BaseAdapter interface
    claude_code.py        ← Claude Code JSONL parser
    opencode.py           ← OpenCode prompt history parser
    codex.py              ← Codex event-driven JSONL parser
    cursor.py             ← Cursor SQLite DB parser
  writers/
    obsidian.py           ← Markdown + daily digests
    chromadb_writer.py    ← Batch ingestion + dedup

Security

Secret redaction (API keys, tokens, passwords, private keys, connection strings) on all content before storage
MCP server input validation (query length limits, top_k capped at 50, type checking)
Path traversal protection in Obsidian writer
ChromaDB + vault directories set to 0700 (owner-only)
Obsidian files written with 0600 permissions
Atomic config file writes with backups
No telemetry, no cloud, no data leaves your machine

Requirements

Python 3.9+
No API keys
No Docker
No internet after install (embedding model downloaded once, ~80MB)

Inspiration

Inspired by MemPalace — which proved that raw ChromaDB retrieval achieves 96.6% recall on LongMemEval with zero API calls. AgentVault Memory applies this to the specific problem of multi-agent session consolidation for developers.

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.6.0

Apr 14, 2026

This version

0.5.2

Apr 13, 2026

0.5.1

Apr 13, 2026

0.5.0

Apr 13, 2026

0.4.1

Apr 13, 2026

0.4.0

Apr 13, 2026

0.3.2

Apr 10, 2026

0.3.1

Apr 9, 2026

0.3.0

Apr 9, 2026

0.2.1

Apr 9, 2026

0.2.0

Apr 9, 2026

0.1.0

Apr 9, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentvault_memory-0.5.2.tar.gz (317.9 kB view details)

Uploaded Apr 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentvault_memory-0.5.2-py3-none-any.whl (42.5 kB view details)

Uploaded Apr 13, 2026 Python 3

File details

Details for the file agentvault_memory-0.5.2.tar.gz.

File metadata

Download URL: agentvault_memory-0.5.2.tar.gz
Upload date: Apr 13, 2026
Size: 317.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for agentvault_memory-0.5.2.tar.gz
Algorithm	Hash digest
SHA256	`c6f76d69c83241c089153ae7337ea614fc969c2afde345cd3d988c6ce21468a0`
MD5	`81ee91575d8b08f68d28e2b48f2911c0`
BLAKE2b-256	`865481512c9ff4c7329a36f72f635720e8f84bb1762891a18f704ed1a5e80398`

See more details on using hashes here.

File details

Details for the file agentvault_memory-0.5.2-py3-none-any.whl.

File metadata

Download URL: agentvault_memory-0.5.2-py3-none-any.whl
Upload date: Apr 13, 2026
Size: 42.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for agentvault_memory-0.5.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8d75eddc3e86ef7e033eb04c0c0db6a5e568f2fc6fbb4c264621f6141cfc541e`
MD5	`45ace2bd1e65c5c8231df8f412563ce1`
BLAKE2b-256	`0d9b226e1adc74f5aa78a67badd7ba63abba4db10c13c9b039169f48db1fbf7d`

See more details on using hashes here.

agentvault-memory 0.5.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Unified memory layer for AI coding agents

The Problem

Why AgentVault Memory

Token Efficiency

Built-in Token Optimizations

How It Works

How It Works

Quick Start

What init does

What Gets Indexed

MCP Tools

Obsidian Integration

Supported Tools

Auto-Save (Future Sessions)

Session Summaries

Decision Log

Forget (Data Control)

Adding a New Adapter

Architecture

Security

Requirements

Inspiration

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

What `init` does