Local-first agent memory: an Obsidian markdown vault as source of truth, with a rebuildable DuckDB index.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

cfigueir

These details have not been verified by PyPI

Project links

Homepage

Project description

🪨 agentcairn

Local-first memory for AI agents — that you can actually read, edit, and own.

cairn /kɛən/ · noun — a stack of stones raised to mark a trail or a place worth remembering, left for whoever comes next.

agentcairn gives your coding agent durable, high-quality memory — but instead of locking it in an opaque database or a cloud service, your memories live as plain Markdown in an Obsidian vault you own. A fast, rebuildable DuckDB index sits on top for retrieval. Open your vault, read what the agent remembered, fix a wrong fact by hand, or drop in your own notes — and the agent picks it all up.

Why agentcairn is different

Most agent-memory systems make a database or cloud store the source of truth and treat files (if any) as a one-way export. agentcairn inverts that:

📂 Your vault is the source of truth — not an export. Memory is human-readable Markdown with frontmatter and [[wikilinks]]. Edit it in Obsidian; the index honors your edits.
♻️ The index is disposable. DuckDB is a rebuildable cache (cairn reindex). Your memory survives a model upgrade, a corrupted index, a schema change, or uninstalling the tool — zero data loss, because the truth is just files on disk.
🧠 Non-lossy by construction. The full note is always retained. Distillation only adds derived notes that link back to the source — it never silently drops facts it didn't think to extract at write time.
🔒 Redaction before every write. Secrets are scrubbed (regex + entropy + URL-credential detection) before anything — body, title, or tags — reaches the plaintext vault. We write files you can read, so we treat a leaked credential as the worst failure mode.
🕸️ A free, deterministic knowledge graph. Your [[wikilinks]] and frontmatter are the graph — no LLM extraction, no hallucinated entities.
🪶 Daemonless, zero external DB. One embedded DuckDB file does semantic vector search, BM25 full-text, and graph traversal. No always-on server, no Neo4j/Postgres/Qdrant, no required cloud key — just a cairn CLI and an on-demand MCP server.
🔍 Honestly measured. A reproducible LongMemEval-S + LoCoMo harness ships in benchmarks/ — with real numbers, ablations, and explicit caveats instead of one cherry-picked headline (see below).

How it works

flowchart LR
    T["Session transcripts<br/>(out-of-band)"]
    H["You · Obsidian<br/>(hand edits)"]
    V["📂 Obsidian vault<br/>Markdown + frontmatter + wikilinks<br/><b>source of truth</b>"]
    I["♻️ DuckDB index<br/>vector + BM25 + graph<br/><b>rebuildable cache</b>"]
    M["MCP tools<br/>remember · recall · search · build_context · recent"]

    T -- "redact → dedup → distill" --> V
    H -- "edit" --> V
    V -- "parse / reconcile-on-spawn" --> I
    I -- "READ_ONLY hybrid recall" --> M
    M -. "remember (redacted write)" .-> V

    classDef truth fill:#eaf1ff,stroke:#317cff,color:#191919;
    classDef cache fill:#f5f5f3,stroke:#999999,color:#191919;
    class V truth
    class I cache

Capture reads your agent harness's session transcripts (append-only, already on disk) out-of-band — robust by design, with no fragile live hooks — then redacts → dedups → importance-gates → distills into the vault, non-lossily. Plus an agent-driven remember tool for curated, high-value memories.
Retrieval fuses BM25 + semantic vectors with Reciprocal Rank Fusion, applies an optional graph-boost, and degrades gracefully down to keyword-only when no embedding model is available — so recall is never silently dead. An optional cross-encoder reranker adds precision.
Hybrid intelligence: offline local embeddings (FastEmbed / nomic-embed-text-v1.5 by default) out of the box — strong on its own and in the hybrid fusion (with nomic, vector-only edges out BM25 even on short turns; see the benchmark). Set CAIRN_EMBED_MODEL to pick another FastEmbed model, or run CAIRN_EMBEDDER=ollama / a cloud tier to go further.
Temporal memory: notes may carry valid_from/valid_until/superseded_by frontmatter. Recall is validity-aware — it soft-demotes superseded and expired facts (the current fact wins) without ever hiding them (non-lossy), and annotates each result's status (current/superseded/expired/not_yet_valid) plus an as_of anchor so the agent can reason over time. Inert for notes with no validity fields.

CLI

uvx agentcairn                                       # on-demand MCP server for your agent harness
cairn ingest --vault ~/vault                         # distill recent agent sessions into the vault
cairn sweep  --vault ~/vault                          # ingest + reindex in one pass (cron-friendly)
cairn recall "how did we fix the auth bug?"          # hybrid recall from the CLI
cairn reindex ~/vault                                # rebuild the index from Markdown (always safe)
cairn doctor                                         # health-check the index

Honestly measured

We benchmark agentcairn the way we'd want a memory system measured — reproducibly, with ablations, and without a single cherry-picked headline number. The harness (benchmarks/) runs LongMemEval-S and LoCoMo through a version-pinned downloader (datasets are never vendored), scores retrieval deterministically (recall/nDCG@k, MRR — no API key needed, runs in CI on a synthetic fixture), and offers an opt-in LLM-judged QA layer.

Retrieval ablation on the full LoCoMo set (turn-level, macro-avg, FastEmbed nomic-embed-text-v1.5 — the default):

arm	recall@5	recall@10	MRR
BM25 only	0.527	0.604	0.459
vector only	0.536	0.637	0.433
hybrid (RRF)	0.562	0.648	0.477
hybrid + graph-boost	0.562	0.648	0.477
hybrid + reranker	0.662	0.735	0.608

What we read from this — and say out loud:

Hybrid beats either arm alone — RRF fusion is worth it.
The cross-encoder reranker is the biggest lever (+0.10 recall@5 over hybrid); the "ms-marco domain-shift might hurt" worry didn't materialize on conversational data.
The embedder default now pulls its weight — with nomic, vector-only edges out BM25 (0.536 vs 0.527); switching from the old bge-small default (which trailed at 0.483) closed the gap. A 5-model FastEmbed sweep settled the pick — nomic (768-d) wins on quality-per-dim; bigger 1024-d models don't beat it. Full table: benchmarks/README.md.
graph-boost is inert on these corpora — LoCoMo/LongMemEval have no native [[wikilink]] graph, so the boost has nothing to fire on. It's for real interlinked vaults, not chat logs, and we don't pretend otherwise.

LongMemEval-S (50-instance sample) is an easier retrieval task with well-separated evidence sessions. At session level (the granularity prior work reports) retrieval is essentially perfect — recall@5 = 1.00 for every arm (hybrid+reranker nDCG@10 0.993 / MRR 0.990); at the finer turn level, hybrid+reranker reaches 0.96 recall@5. Two caveats we say out loud: session-recall@5 saturates at 1.0 here (even BM25 hits it), so it isn't a discriminating metric on this corpus; and it's a 10% sample — comparable for relative signal, not a leaderboard claim.

Context efficiency. On LongMemEval-S's ~136k-token sessions, agentcairn answers from the ~2,500 tokens it recalls (top-10) rather than the full history — a ~55× reduction in what the model has to read (estimate, ~4 chars/token; 20-query sample). It measures context size, independent of retrieval quality.

QA-accuracy numbers (LLM-judged) are available too, but use an Anthropic judge rather than the papers' GPT-4o, so they are not comparable to published leaderboards — valid for relative ablation signal only. See benchmarks/README.md for how to run it and how to read the numbers.

Roadmap

v1 — done. The core loop: transcript ingestion → redaction → Markdown → rebuildable DuckDB index → hybrid recall; MCP server + CLI; secret redaction; local embeddings; reproducible benchmark harness.
v1.1 — next, prioritized by the benchmark above:
- ✅ Reranker on by default — the largest measured retrieval lever; CAIRN_RERANK=0 to disable. (shipped)
- Ollama embedding tier — ✅ local models via CAIRN_EMBEDDER=ollama (CAIRN_EMBED_MODEL/OLLAMA_HOST); cloud (OpenAI/Voyage) still pending.
- ✅ Bi-temporal validity — frontmatter valid_from/valid_until/superseded_by; recall soft-demotes superseded/expired facts (non-lossy — never hidden) and annotates each result's currency + an as_of anchor, so the current fact wins and the agent can reason over time. (shipped)
- In-memory HNSW for large-vault retrieval latency.
v2 — Obsidian plugin surface, MotherDuck cloud sync, optional LLM entity extraction.

Prior art & thanks

Learning from basic-memory (Markdown-as-memory + rebuildable index), Simon Späti's Obsidian RAG on DuckDB, and DuckDB's VSS + FTS extensions. Benchmarks use LongMemEval (MIT) and LoCoMo (CC BY-NC 4.0).

Development

agentcairn uses uv exclusively for dependency management and tooling.

Do not use pip, poetry, or global virtual environments.

# First-time setup
uv sync                         # create .venv and install all deps (including dev)
uv run pre-commit install       # install git hooks (ruff + pytest run on every commit)

# Daily use
uv run pytest                   # run the test suite
uv run cairn --help             # run the CLI
uvx agentcairn                  # run the installed tool ephemerally (as the MCP server does)

# Formatting and linting
uv run ruff format .            # format all Python files
uv run ruff check --fix .       # lint with auto-fix
uv run pre-commit run --all-files

# Benchmarks (offline retrieval metrics need no API key)
uv run pytest benchmarks/tests/                                      # offline synthetic-fixture suite
PYTHONPATH=benchmarks uv run --group bench python -m cairn_bench.run --dataset locomo

The MCP server is launched via uvx agentcairn — no global install required.

License

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

cfigueir

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.9.5

Jun 13, 2026

0.9.4

Jun 12, 2026

0.9.3

Jun 12, 2026

0.9.2

Jun 12, 2026

0.9.1

Jun 12, 2026

0.9.0

Jun 12, 2026

0.8.0

Jun 12, 2026

0.7.2

Jun 12, 2026

0.7.1

Jun 12, 2026

0.7.0

Jun 11, 2026

0.6.2

Jun 11, 2026

0.6.1

Jun 11, 2026

0.6.0

Jun 11, 2026

0.5.0

Jun 11, 2026

0.4.0

Jun 11, 2026

This version

0.3.0

Jun 11, 2026

0.2.0

Jun 10, 2026

0.1.0

Jun 10, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

agentcairn-0.3.0.tar.gz (471.9 kB view details)

Uploaded Jun 11, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

agentcairn-0.3.0-py3-none-any.whl (56.7 kB view details)

Uploaded Jun 11, 2026 Python 3

File details

Details for the file agentcairn-0.3.0.tar.gz.

File metadata

Download URL: agentcairn-0.3.0.tar.gz
Upload date: Jun 11, 2026
Size: 471.9 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for agentcairn-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`372e7a43288127da82c0bf2b4803418deb3514bfb17ef6ed9106a009f78bd9b2`
MD5	`a2910506ff2133fa3698592a7144396c`
BLAKE2b-256	`aa411228cfd803cac6b72aad85d2d6ac5d7b4ed657e237f1ca0cefa3abf5892c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agentcairn-0.3.0.tar.gz:

Publisher: release.yml on ccf/agentcairn

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agentcairn-0.3.0.tar.gz
- Subject digest: 372e7a43288127da82c0bf2b4803418deb3514bfb17ef6ed9106a009f78bd9b2
- Sigstore transparency entry: 1785619393
- Sigstore integration time: Jun 11, 2026
Source repository:
- Permalink: ccf/agentcairn@f5b805900dab672056737874a172d4b4a4b8146d
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/ccf
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@f5b805900dab672056737874a172d4b4a4b8146d
- Trigger Event: push

File details

Details for the file agentcairn-0.3.0-py3-none-any.whl.

File metadata

Download URL: agentcairn-0.3.0-py3-none-any.whl
Upload date: Jun 11, 2026
Size: 56.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for agentcairn-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`1f574e276f14fbda1a3ef0da56fad9c8fa37cb3ef48ea725f4712c1826fa68bc`
MD5	`ee5ebcdf1236c69a7a23eba6f594b295`
BLAKE2b-256	`e985a704dc7fa40226440e445f79ef0b063fb1ff9a11b88f48f0e7f9a7932d24`

See more details on using hashes here.

Provenance

The following attestation bundles were made for agentcairn-0.3.0-py3-none-any.whl:

Publisher: release.yml on ccf/agentcairn

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: agentcairn-0.3.0-py3-none-any.whl
- Subject digest: 1f574e276f14fbda1a3ef0da56fad9c8fa37cb3ef48ea725f4712c1826fa68bc
- Sigstore transparency entry: 1785619525
- Sigstore integration time: Jun 11, 2026
Source repository:
- Permalink: ccf/agentcairn@f5b805900dab672056737874a172d4b4a4b8146d
- Branch / Tag: refs/tags/v0.3.0
- Owner: https://github.com/ccf
- Access: private
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@f5b805900dab672056737874a172d4b4a4b8146d
- Trigger Event: push

agentcairn 0.3.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

🪨 agentcairn

Why agentcairn is different

How it works

CLI

Honestly measured

Roadmap

Prior art & thanks

Development

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance