Local-first documentation graph for AI agents. CodeGraph for docs, exposed through MCP.

These details have not been verified by PyPI

Project links

Project description

Cairn

The DocsGraph for AI agents. CodeGraph helps agents navigate code; Cairn helps them navigate docs. Install it as docsgraph; keep the cairn name for the product and compatibility alias.

Cairn demo: repository documentation graph and MCP tools

Cairn is a local-first, MCP-native DocsGraph for software repositories and large structured documents. It turns README files, specs, ADRs, docs folders, PDFs, and optional MarkItDown-converted Office/data/web files into a navigable map: document catalog, hierarchical sections, multi-granularity summaries, entity mentions, cross-reference edges, and a semantic vector overlay.

Instead of dumping whole docs into context or relying on anonymous chunks, an agent can ask Cairn to list_documents, search_documents, inspect an outline, and drill into exact sections with stable cairn:// anchors. The same engine also works for standalone handbooks, papers, and PDFs.

The result: better retrieval accuracy, lower token spend, and a practical MCP tool layer between your project documentation and every AI coding agent you use. Local-first. Vendor-neutral. Designed for open-source repos.

🚀 Alpha — 0.1.0a2. Markdown + PDF ingest, all eight MCP tools, the full structure-aware index (tree + summaries + entities + xrefs + vectors), repo-level init/sync/status, repo-scoped MCP with list_documents, search_documents, repo_context, repo_graph, and repo_impact, failure-isolated sync, static graph inspector, Doubao multimodal embeddings, and a benchmark harness with headline numbers. See CHANGELOG.md for what's in this release and ROADMAP.md for what's next.

Why Cairn?

Today	With Cairn
AI coding agents guess from README snippets or grep.	Agent gets a repo-level documentation map with stable section anchors.
Dump the whole document into context. Burns tokens, dilutes attention.	Agent fetches only what it needs, at the granularity it needs.
Naive RAG splits structure into context-free chunks.	The document's own structure is the index.
Cross-references and entities are lost in chunking.	They are first-class objects.
Locked into one vendor's embeddings / vector DB.	Pluggable everything. Local-first defaults.
Different tool stacks for Claude / Cursor / Cline / Goose.	One MCP server. Any compliant agent works.

For the in-depth motivation, see PRODUCT.md. For the technical design, see ARCHITECTURE.md. For the public documentation quality contract Cairn optimizes for, see docs/golden-docs-standard.md.

How It Works (90 seconds)

Discover. docsgraph init -y writes .cairn/config.toml; docsgraph sync discovers README, Markdown docs, ADRs, specs, and PDFs from conservative repo globs.
Index. Each document becomes a normal Cairn index: structural tree (T), multi-level summaries (S), entity index (E), cross-reference graph (X), and vector overlay (V). A bad source file is isolated instead of breaking the whole repo sync.
Serve. docsgraph serve exposes repo-scoped MCP tools: list_documents, search_documents, plus outline, get_section, expand, search_semantic, search_keyword, find_mentions, get_related, and read_range routed by optional doc.
Navigate. Your agent searches across the repo, picks a document, drills into promising sections, and only fetches full text when justified. Every result carries stable anchors for verification.

A visual explainer comparing Cairn's approach to RAPTOR, BookRAG, and A-RAG lives at docs/canvas.html. Open it in any browser.

Quickstart

The fastest way to see Cairn work is to index this repo's own documentation. Zero API keys, zero model downloads — the --fake flag uses deterministic in-process plugins so the whole thing runs offline.

The PyPI distribution is docsgraph; the primary CLI command is docsgraph. The older cairn command is installed as a compatibility alias:

pip install docsgraph

Or run it without installing:

uvx docsgraph --help

Repository Workflow

Inside any repository:

docsgraph init -y
docsgraph sync --fake
docsgraph status
docsgraph query repo "where are docs indexed?" --fake
docsgraph doctor
docsgraph mcp config --client claude --fake
docsgraph serve --fake

docsgraph doctor checks repo config, index freshness, primary-doc routing, and model settings. docsgraph mcp config prints copy-pasteable stdio snippets for Claude, Cursor, Codex, and Goose:

docsgraph mcp config --client claude
docsgraph mcp config --client cursor
docsgraph mcp config --client codex
docsgraph mcp config --client goose

For local development from source:

git clone https://github.com/jokeuncle/cairn.git
cd cairn

python3.11 -m venv .venv
.venv/bin/pip install -e ".[dev]"

# 1. Create .cairn/config.toml with conservative documentation globs.
.venv/bin/docsgraph init -y

# 2. Index README, Markdown docs, and PDFs.
.venv/bin/docsgraph sync --fake

# 3. Inspect freshness and indexed document ids.
.venv/bin/docsgraph status

# 4. Search across all indexed repository docs.
.venv/bin/docsgraph query repo "where are docs indexed?" --fake

# 5. Start the repo-scoped MCP stdio server for Claude Code / Cursor / Cline / Goose.
.venv/bin/docsgraph serve --fake

Repo mode writes a shareable config plus ignored runtime data:

.cairn/
  config.toml       # commit this if you want a stable repo docs policy
  manifest.json     # generated
  documents/        # generated per-document Cairn indexes
    readme/
    architecture/
    docs-specs-mcp-tools/

Repo-scoped MCP adds:

Tool	Use it for
`list_documents`	See every indexed doc, its source path, freshness, and section count.
`search_documents`	Search across all indexed docs and get globally ranked, explainable section hits with `doc` ids, skipped docs, and stale-doc warnings.
`repo_context`	Get a ready-to-read context pack: ranked hits, selected section text, hit explanations, and a relationship map.
`repo_graph`	Inspect the repo documentation graph: document, section, entity, contains, xref, and mention edges. Cross-document links are exposed through shared entity nodes.
`repo_impact`	Estimate documentation surfaces affected by a document or section change.
normal Cairn tools + `doc`	Drill into a chosen document with `outline`, `get_section`, `search_semantic`, `get_related`, etc.

Repo behavior is intentionally configurable in .cairn/config.toml:

Setting	Default	Impact
`include`	README, top-level Markdown/PDF, `docs/**` Markdown/PDF, one-level nested README	Expands or narrows what Cairn treats as repository documentation. Broader globs improve coverage but can index noisy generated files.
`exclude`	`.git`, `.cairn`, `.codegraph`, caches, virtualenvs, build output, `node_modules`	Keeps generated or tool-owned docs out of search. Simple `name/**` directory excludes match at any depth, so `frontend/node_modules/...` and `apps/web/dist/...` are skipped. Add project-specific generated doc folders here.
`enable_markitdown`	`false`	Enables non-Markdown/PDF conversion when the `markitdown` extra is installed. Useful for DOCX/PPTX/XLSX/HTML-heavy repos, slower and less deterministic than native Markdown/PDF parsing.
`primary_doc`	`readme`	Chooses the default document for normal tools when `doc` is omitted in repo mode.
`search_sections_per_doc`	`1`	Default diversity for `search_documents`. `1` helps agents find the right doc first; raise it when a repo has a few long docs and you want deeper hits from each doc by default.
`preferred_locales`	`[]`	Optional locale preference for repo search, for example `["en"]` or `["zh"]`. When omitted, English queries prefer English or locale-neutral docs without hiding other languages.

MarkItDown integration is local-file only and optional. Cairn uses it as a conversion layer, then feeds the generated Markdown into the same canonical Markdown parser. This expands coverage to formats such as DOCX, PPTX, XLSX, HTML, CSV, JSON, XML, and EPUB without making the base install heavy:

.venv/bin/pip install -e ".[markitdown]"
.venv/bin/docsgraph init -y --force --markitdown
.venv/bin/docsgraph sync --fake

Generate a standalone graph inspector for the primary repo doc:

docsgraph inspect --out /tmp/cairn-repo-inspector.html

Single Document Workflow

Cairn still works as a focused index for one large document:

# Index Cairn's own architecture document.
.venv/bin/docsgraph index ARCHITECTURE.md --out /tmp/cairn-arch --fake

# Get the map — gists only, never full text.
.venv/bin/docsgraph outline /tmp/cairn-arch --depth 2

# Keyword search: every section that mentions "LanceDB".
.venv/bin/docsgraph query keyword /tmp/cairn-arch LanceDB

# Multi-term keyword search with mode=all.
.venv/bin/docsgraph query keyword /tmp/cairn-arch progressive disclosure --mode all

# Generate a standalone graph inspector for the built index.
.venv/bin/docsgraph inspect /tmp/cairn-arch --out /tmp/cairn-arch/inspector.html

# Start a single-document MCP stdio server.
.venv/bin/docsgraph serve /tmp/cairn-arch --fake

A walkthrough with full output and an MCP-client config snippet is in examples/hero-demo.md.

Benchmarks

Cairn ships with cairn-bench, a small framework that compares Cairn against a naive 512-word-chunk vector-RAG baseline (both backed by LanceDB and the same embedder, so the comparison is apples-to-apples).

Running the starter suite (10 hand-curated questions over Cairn's own ARCHITECTURE.md) with deterministic in-process plugins:

docsgraph bench benchmarks/architecture.toml --fake

metric	naive vector RAG	Cairn
mean recall@8	25%	25%
mean tokens returned	3,670	1,388 (37.8% of naive)

Caveat — these numbers come from the deterministic FakeEmbedder (a bag-of-words hash with no semantic understanding). Recall ties because neither system has semantics; the 2.6× token efficiency win is independent of the embedder: it comes from progressive disclosure and section-aware retrieval, not from vector quality. Cairn now returns a short evidence snippet with every semantic hit by default, which raises the token count but makes ranking errors easier to inspect. Reproduce these numbers in under a second on any machine — and re-run with Ollama (nomic-embed-text) or Doubao for the real-semantics version. See benchmarks/README.md for caveats and how to author your own suites.

Repo-level smoke tests are also public and reproducible:

python scripts/eval_repos.py --repo all --refresh --strict
python scripts/smoke_many_repos.py --limit 37 --strict

The labeled eval set covers astral-sh/uv, pydantic/pydantic-ai, modelcontextprotocol/python-sdk, and fastapi/full-stack-fastapi-template. The broad smoke matrix currently spans 37 public repositories across Python, JavaScript/TypeScript, Rust, and Go ecosystems. It is not an accuracy leaderboard; it verifies clone/discovery/sync/search/drilldown robustness and latency across different documentation shapes.

Latest fake-plugin runs on this machine:

suite	result
`pydantic-ai` labeled eval	178/178 docs indexed, 8/8 top1, 8/8 top5, 8/8 drilldown
`uv` labeled eval	89/89 docs indexed, 15/16 top1, 16/16 top3/top5, 16/16 drilldown
`mcp-python-sdk` labeled eval	17/17 docs indexed, 4/4 top1, 4/4 drilldown
`fastapi-template` labeled eval	7/7 docs indexed, 4/4 top1, 4/4 drilldown
37-repo smoke matrix	2931 docs indexed, 0 sync failures, 185/185 searches with hits, 185/185 drilldowns

search_documents uses a general hybrid ranker: dense vector similarity, BM25-style sparse evidence, structure-aware field support, weighted query-term coverage, path/title identity prior, and local graph-neighborhood propagation. Repo search builds a process-local cache and scores dense vectors in batches so large documentation sets stay warm-query friendly. On large section sets it uses a two-stage path: dense seeds, cheap lexical/path seeds, and graph neighbors form a wide shortlist, then the full BM25/graph/explanation ranker scores only that candidate set. Cold cache construction loads per-document indexes concurrently while preserving per-document failure isolation. Search responses expose ranker.mode, total_sections, and scored_sections so the performance path is visible to clients and benchmarks. Each hit includes a score breakdown and short explanation so agents and humans can see whether dense, lexical, sparse, or graph evidence dominated the result. Changelog, release-note, and migration-history documents are intent-gated: they stay first-class results for release/version/change queries, but broad topic queries prefer guides, API docs, and README-style docs when comparable evidence exists. Search candidates are freshness-aware: repo status records a file-level fingerprint, and query responses expose stale_documents when source files have changed since the last sync. repo_context composes search, section content, and local relationships into one agent-ready payload; repo_graph and repo_impact expose the documentation graph without reimplementing source-code analysis. Pair Cairn with CodeGraph when you need AST symbols, callers/callees, or code impact. The ranker does not special-case repository names, document ids, or benchmark answers.

Real LLM + real embeddings

The --fake plugins are great for offline reproducibility but they have no semantic understanding. For production indexing, point Cairn at any OpenAI-compatible endpoint. The defaults target a local Ollama so you keep the local-first promise without paying for API tokens:

ollama serve
ollama pull llama3.2:3b
ollama pull nomic-embed-text

.venv/bin/docsgraph index ARCHITECTURE.md --out /tmp/cairn-arch   # no --fake

OpenAI, vLLM, Together, Anyscale, …all of them work the same way; override CAIRN_LLM_* and CAIRN_EMBED_* environment variables.

For Doubao's vision embedding model, use the dedicated provider because the model is served through Volcengine's /embeddings/multimodal endpoint:

export CAIRN_LLM_BASE_URL=https://ark.cn-beijing.volces.com/api/v3
export CAIRN_LLM_MODEL=doubao-seed-2-0-code-preview-260215
export CAIRN_LLM_API_KEY=...

export CAIRN_EMBED_PROVIDER=doubao-vision
export CAIRN_EMBED_MODEL=doubao-embedding-vision-251215
export CAIRN_EMBED_API_KEY=...

docsgraph index ARCHITECTURE.md --out /tmp/cairn-arch

To run the public-repo eval with the real provider configured by your environment instead of the deterministic fake plugins:

python scripts/eval_repos.py --repo pydantic-ai \
  --provider env \
  --workdir /tmp/cairn-repo-eval-real \
  --refresh

The eval report includes provider mode, model names, and vector dimension, but never prints API keys. Cairn also invalidates old indexes when the summarizer, embedder, vector dimension, entity extractor, or xref extractor changes, so switching from --fake to Doubao rebuilds the affected documents instead of quietly reusing incompatible vectors.

Useful operational knobs when running against hosted APIs:

variable	default	purpose
`CAIRN_LLM_TIMEOUT`	`60`	per-request summary timeout in seconds
`CAIRN_LLM_MAX_RETRIES`	`2`	retries for 429/5xx and transport errors
`CAIRN_EMBED_TIMEOUT`	`60`	per-request embedding timeout in seconds
`CAIRN_EMBED_MAX_RETRIES`	`2`	retries for embedding 429/5xx and transport errors
`CAIRN_SUMMARY_CONCURRENCY`	`4`	concurrent summary calls during indexing and benchmarks
`CAIRN_EMBED_BATCH_SIZE`	`32`	sections/chunks per embedding batch

Inspiration and Lineage

Cairn synthesizes two strands of recent research and ships them as a real, agent-ready tool:

BookRAG (Dec 2025): structure-aware index combining a hierarchical tree with an entity graph, queried via an Information-Foraging-Theory-inspired agent. Cairn implements this vision in production-grade form.
A-RAG (Feb 2026): clean agent loop with hierarchical retrieval tools (keyword/semantic/chunk). Cairn borrows the agent-tool philosophy and replaces A-RAG's chunk-based index with a structure-first one.
RAPTOR (ICLR 2024): the seminal recursive-summarization tree. Cairn's summary layer takes inspiration from it while anchoring summaries to the document's own structure instead of clustered chunks.

We are deeply grateful to these authors; see ADRs for the specific design choices we adopted, modified, or declined.

Status & Roadmap

Phase	Status	What
0 — Foundation	☑	Authoritative docs in place (PRODUCT, ARCHITECTURE, CLAUDE, ROADMAP, ADR-0001)
1 — v0.1 walking skeleton	☑	Markdown ingest, Tree + Summaries + Vectors indexes, 5 MCP tools, stdio server, CLI, hero demo
2 — v0.2 structure-aware retrieval	☑	Entities, cross-references, PDF ingest, digest summaries, incremental rebuild, static inspector, `cairn-bench`
3 — v0.3 repo docs graph	◐	Repo `init/sync/status`, repo-scoped MCP, `list_documents`, `search_documents`, `repo_context`, `repo_graph`, `repo_impact`, shareable `.cairn/config.toml`; hosted inspector and telemetry still next
4 — v0.4 polish for production	☐	DOCX/RTF/EPUB, VSCode extension, security review
v1.0 GA	☐	All `PRODUCT.md` §7 success criteria met

Full plan: ROADMAP.md. Current test suite: 436 passing, mypy strict clean, ruff clean.

Maintainer release gate: docs/release-checklist.md.

Contributing

Cairn is opinionated. Before opening a PR, please read:

PRODUCT.md — especially the non-goals.
ARCHITECTURE.md — the end-state design we're building toward.
CONTRIBUTING.md — workflow and PR expectations.
docs/decisions/ — existing ADRs.

If you're an AI agent helping a contributor, you'll find your session anchor in CLAUDE.md.

License

Apache 2.0. See LICENSE.

A cairn is a small stack of stones marking a trail through difficult terrain. This project is one for AI agents lost in large documents.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.0a6 pre-release

Jun 26, 2026

0.1.0a5 pre-release

Jun 25, 2026

0.1.0a4 pre-release

Jun 23, 2026

0.1.0a3 pre-release

Jun 22, 2026

This version

0.1.0a2 pre-release

Jun 22, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docsgraph-0.1.0a2.tar.gz (241.8 kB view details)

Uploaded Jun 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

docsgraph-0.1.0a2-py3-none-any.whl (140.2 kB view details)

Uploaded Jun 22, 2026 Python 3

File details

Details for the file docsgraph-0.1.0a2.tar.gz.

File metadata

Download URL: docsgraph-0.1.0a2.tar.gz
Upload date: Jun 22, 2026
Size: 241.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.11 {"installer":{"name":"uv","version":"0.11.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for docsgraph-0.1.0a2.tar.gz
Algorithm	Hash digest
SHA256	`461de849a3d9544d95a888606f1f7b259199d6572e18968066c698311aff7062`
MD5	`05b4f0d251ea99b7013fd01bd89c8eab`
BLAKE2b-256	`3d0108ebadf4d843ba3854ea30a413a21c060059bcf8a58418b512181f3bbf48`

See more details on using hashes here.

File details

Details for the file docsgraph-0.1.0a2-py3-none-any.whl.

File metadata

Download URL: docsgraph-0.1.0a2-py3-none-any.whl
Upload date: Jun 22, 2026
Size: 140.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: uv/0.11.11 {"installer":{"name":"uv","version":"0.11.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for docsgraph-0.1.0a2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`88080b93d7cd012319caa5a274d1ec2f43b0d4f0b2e7cc82bc7d1460208b9854`
MD5	`17ad959bd6edf268003164d34c213994`
BLAKE2b-256	`709b7b0420514096f45fd6226358eaddd80d59043b7a840a83ebf31bc0141ada`

See more details on using hashes here.

docsgraph 0.1.0a2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Cairn

Why Cairn?

How It Works (90 seconds)

Quickstart

Repository Workflow

Single Document Workflow

Benchmarks

Real LLM + real embeddings

Inspiration and Lineage

Status & Roadmap

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes