docmancer

Compress local documentation context for coding agents.

These details have not been verified by PyPI

Project description

Your agents' memory, unified, local, and yours.

Install | First run | What you get | Wiki

Your coding agents (Claude Code, Codex, Cursor, Gemini, OpenCode, Cline, Windsurf, and more) already write memory, instructions, and rules all over this machine, each locked inside its own tool. Docmancer discovers all of it, syncs it into one local hybrid (lexical + dense) index, and lets you recall any past decision instantly and offline. The full memory loop is four steps:

Sync (docmancer memory sync): discover and index every agent's memory, instructions, and rules into one local SQLite index. Local, offline, no keys.
Recall (docmancer memory query): hybrid search across everything your agents have ever written, with source provenance.
Consolidate (docmancer memory consolidate): use Mistral AI to turn the scattered, duplicated memory into one coherent, review-only master-memory draft.
Apply (docmancer memory apply): materialize the reviewed draft into an agent's always-loaded file, so the context loads every session with no tool call.

The same engine also does docs RAG as a secondary capability: point it at a folder of Markdown / PDF / DOCX / RTF / HTML or a docs URL (GitBook, Mintlify, generic web, GitHub) and query it the same way. A fresh install ships everything you need for the local path: SQLite FTS5 for lexical search, a static embedding model (potion-base-8M) vendored in the package so there is no large model download and no network at runtime, and sqlite-vec for dense vectors in a single local file with no daemon.

Install

pipx install docmancer    # Python 3.11, 3.12, or 3.13

If pipx picks an unsupported interpreter, pin one: pipx install docmancer --python python3.13.

First run

Two commands take you from a fresh install to recalling your agents' memory:

docmancer setup                                  # discovers and indexes the agent memory already on this machine
docmancer memory query "why did we pick Railway" # recall a past decision, offline

setup creates ~/.docmancer/ with the config and SQLite database, syncs the memory, instructions, and rules your coding agents already wrote (Claude Code, Codex, Cursor, Gemini, OpenCode, Cline, Windsurf, and more) plus repo-level CLAUDE.md / AGENTS.md / GEMINI.md, auto-detects installed agents, and installs their skill files. There is no large model download and no network at runtime: the static embedding model is vendored in the package.

Re-sync any time and see exactly what was indexed and from where:

docmancer memory sync                # discover, redact, and (re)index everything
docmancer memory sources             # provenance: agent, type, scope, title, path, char count
docmancer memory sources --preview   # live re-harvest (what WOULD index) without writing

Want docs RAG too? The same engine indexes documentation:

docmancer ingest ./docs                             # index local files
docmancer add https://docs.pytest.org               # or a docs URL
docmancer query "How do I parametrize a fixture?"   # hybrid search across the docs index

Consolidate and carry memory across agents (Mistral AI)

Syncing gives you one searchable index. Consolidation turns that pile into a single coherent memory. docmancer memory consolidate sends your retrieved local memory (privacy-redacted first) to Mistral AI by default and gets back a review-only master-memory draft: deduplicated, grouped into compact sections, with conflicts surfaced as warnings instead of silently resolved.

export MISTRAL_API_KEY=...    # the only extra step; the local commands never need a key
docmancer memory consolidate \
  --query "deployment and infra decisions" \
  --output master-memory-draft.md \
  --draft-quality fast \
  --timeout 180

Once you have reviewed the draft, materialize it into an agent's always-loaded file so the context loads every session with no tool call and, crucially, so memory written in one agent shows up in the others:

docmancer memory apply --agent codex   # uses master-memory-draft.md by default
docmancer memory apply --agent codex --dry-run   # preview the diff first

apply is local and keyless. It writes only inside a clearly delimited managed block, takes a timestamped backup first, and never touches your own surrounding content. --remove strips the block for a clean uninstall. This is the only command that writes consolidated memory into agent-owned files, and it is never automatic. (docmancer install codex / claude-code also inject a short recall instruction into the same files, in their own managed block.)

Mistral is used directly through the official mistralai client: Mistral structured outputs extract durable memory facts, and a Mistral chat model (mistral-small-2506 by default) produces the review-only consolidated draft. Pick any Mistral model your account provisions with --model, or set DOCMANCER_MISTRAL_MODEL to change the default once. Consolidation uses smaller bounded batches by default, --max-output-tokens caps generated output per request, and --draft-quality fast uses more aggressive compression.

OpenRouter is available as an explicit fallback for consolidation when you want another hosted model:

export OPENROUTER_API_KEY=...
docmancer memory consolidate \
  --provider openrouter \
  --model openai/gpt-4.1-nano \
  --output master-memory-draft.md \
  --yes

With OpenRouter, --model accepts any OpenRouter chat model id your account can use, and DOCMANCER_OPENROUTER_MODEL changes the default. Use --timeout, DOCMANCER_MISTRAL_TIMEOUT_SECONDS, or DOCMANCER_OPENROUTER_TIMEOUT_SECONDS to bound each provider request, with a finite 180 second default and 0 for the provider default. Optionally, mistral-embed-2312 can build the local vector index (docmancer init --embedding-provider mistral). Every cloud-backed command fails gracefully with a clear message when the provider key is not set or the API call fails, prints a cloud-use notice before the first call, sends a tiny preflight chat request before large memory payloads, logs each request before sending it, and runs secret redaction before any text leaves your machine. See the Configuration and Commands pages for details.

What you get

Your agents' memory, unified. docmancer memory sync discovers and indexes the memory, instructions, and rules your coding agents already wrote (Claude Code, Codex, Cursor, Gemini, OpenCode, Cline, Windsurf, and more, plus repo-level CLAUDE.md / AGENTS.md / GEMINI.md), then answers questions about them through one local index. docmancer memory sources shows exact provenance per file. The local path uploads nothing.

Consolidate with Mistral AI. docmancer memory consolidate turns the scattered index into one review-only master-memory draft via direct Mistral by default, and docmancer memory apply bakes the reviewed result into an agent's always-loaded file so context carries across agents. OpenRouter is available as an explicit fallback with --provider openrouter --model <model-id>. Key-gated, privacy-redacted, and review-only.

Callable over MCP. The packaged docmancer-mcp stdio server exposes local memory and docs search to MCP clients. docmancer mcp install codex (or claude-code, claude-desktop) wires it up; optional Mistral tools appear when MISTRAL_API_KEY is set. Requires the mcp extra.

Hybrid search by default. query and memory query fan out across SQLite FTS5 (lexical, BM25-reranked) and dense vectors from a vendored static model (potion-base-8M) in sqlite-vec, then fuse results with Reciprocal Rank Fusion. Sparse (SPLADE) signals are available on the optional heavy Qdrant backend. The token budget keeps responses small so your agent has room for actual work:

Context pack: ~900 tokens vs ~4800 raw docs tokens (81.2% less docs overhead, 5.33x agentic runway)

No large model download, offline at runtime. The static embedding model ships inside the wheel, so there are no API keys and no network needed to embed or query. Optional OpenAI / Voyage / Cohere providers exist if you want them; a heavier FastEmbed + Qdrant backend is available via pipx install "docmancer[embeddings-heavy]".

Where your data lives and how to remove it

The local memory index is stored in SQLite-backed files under ~/.docmancer/ (override the main database with DOCMANCER_MEMORY_DB). Sync, query, status, sources, apply, and clear run locally. Mistral-backed commands are optional, key-gated, and send selected memory text only after privacy redaction and a cloud-use confirmation. You can preview exactly what would be indexed with docmancer memory sync --dry-run, scope the harvest with --include / --exclude globs, and delete the local memory index files with docmancer memory clear. There is no telemetry and no phone-home.

Inspectable. Every section is written to ~/.docmancer/extracted/ as Markdown plus JSON. docmancer inspect shows index stats. docmancer query --explain shows which signal (lexical / dense / sparse) placed each result.

Agent integration built in. docmancer setup drops skill files for Claude Code, Cursor, Codex, Cline, Claude Desktop, Gemini, GitHub Copilot, and OpenCode. For Claude Code and Codex it also injects a memory-recall instruction into the always-loaded CLAUDE.md / ~/.codex/AGENTS.md (a managed block), so the agent reliably calls docmancer memory query before answering questions about past work.

Where to next

The wiki is the authoritative reference for everything else. Pick a page based on what you need:

Page	When to read it
Commands	Core docs commands and Qdrant lifecycle commands
Configuration	All YAML keys, env vars, and the API-key reference
Architecture	How ingest, retrieval, and Qdrant lifecycle work
Supported Sources	What file formats and URL providers are covered
Install Targets	Where each agent's skill file lands
Troubleshooting	Common errors and fixes

Wiki home | Changelog | PyPI

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.6.5

Jun 21, 2026

0.6.4

Jun 19, 2026

0.6.3

Jun 19, 2026

0.6.2

Jun 19, 2026

0.6.1

Jun 18, 2026

0.6.0

Jun 18, 2026

0.5.2

May 16, 2026

0.5.1

May 16, 2026

0.5.0

May 15, 2026

0.4.9

Apr 28, 2026

0.4.8

Apr 28, 2026

0.4.7

Apr 28, 2026

0.4.6

Apr 27, 2026

0.4.5

Apr 21, 2026

0.4.4

Apr 21, 2026

0.4.3

Apr 21, 2026

0.4.2

Apr 21, 2026

0.4.1

Apr 21, 2026

0.4.0

Apr 21, 2026

0.3.4

Apr 15, 2026

0.3.3

Apr 15, 2026

0.3.2

Apr 14, 2026

0.3.1

Apr 14, 2026

0.3.0

Apr 12, 2026

0.2.3

Apr 10, 2026

0.2.2

Apr 8, 2026

0.2.1

Apr 7, 2026

0.2.0

Apr 7, 2026

0.1.11

Apr 3, 2026

0.1.9

Apr 1, 2026

0.1.8

Apr 1, 2026

0.1.7

Apr 1, 2026

0.1.6

Mar 31, 2026

0.1.5

Mar 30, 2026

0.1.4

Mar 30, 2026

0.1.3

Mar 30, 2026

0.1.2

Mar 30, 2026

0.1.1

Mar 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docmancer-0.6.5.tar.gz (29.9 MB view details)

Uploaded Jun 21, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

docmancer-0.6.5-py3-none-any.whl (28.6 MB view details)

Uploaded Jun 21, 2026 Python 3

File details

Details for the file docmancer-0.6.5.tar.gz.

File metadata

Download URL: docmancer-0.6.5.tar.gz
Upload date: Jun 21, 2026
Size: 29.9 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for docmancer-0.6.5.tar.gz
Algorithm	Hash digest
SHA256	`48976d3e4ce52096cb5605a9393ca385b7b6c6f0719ba79ac3e92bc48d417617`
MD5	`5ee5692b37ed40159a426c0e8c7d0ca4`
BLAKE2b-256	`cd8980bdd3628f33c78ca016b807be13be23361a8dd1e2a10e17955a21ea02fb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for docmancer-0.6.5.tar.gz:

Publisher: publish.yml on docmancer/docmancer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: docmancer-0.6.5.tar.gz
- Subject digest: 48976d3e4ce52096cb5605a9393ca385b7b6c6f0719ba79ac3e92bc48d417617
- Sigstore transparency entry: 1897899568
- Sigstore integration time: Jun 21, 2026
Source repository:
- Permalink: docmancer/docmancer@15b884ec7a72faa6fd30cde3d926e5141a322f1f
- Branch / Tag: refs/tags/v0.6.5
- Owner: https://github.com/docmancer
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@15b884ec7a72faa6fd30cde3d926e5141a322f1f
- Trigger Event: push

File details

Details for the file docmancer-0.6.5-py3-none-any.whl.

File metadata

Download URL: docmancer-0.6.5-py3-none-any.whl
Upload date: Jun 21, 2026
Size: 28.6 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for docmancer-0.6.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a75f87436b3a9c1d1687a3e157183c2b59800ca4452dcbc7ab6408ad705773e1`
MD5	`722ed3b6aee3a49d36fc06068a8419f0`
BLAKE2b-256	`b600c0b25ce27f8bbb3f69ba72cb3a58489b3143632a69b20b3c1159018d66b6`

See more details on using hashes here.

Provenance

The following attestation bundles were made for docmancer-0.6.5-py3-none-any.whl:

Publisher: publish.yml on docmancer/docmancer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: docmancer-0.6.5-py3-none-any.whl
- Subject digest: a75f87436b3a9c1d1687a3e157183c2b59800ca4452dcbc7ab6408ad705773e1
- Sigstore transparency entry: 1897899800
- Sigstore integration time: Jun 21, 2026
Source repository:
- Permalink: docmancer/docmancer@15b884ec7a72faa6fd30cde3d926e5141a322f1f
- Branch / Tag: refs/tags/v0.6.5
- Owner: https://github.com/docmancer
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@15b884ec7a72faa6fd30cde3d926e5141a322f1f
- Trigger Event: push

docmancer 0.6.5

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Install

First run

Consolidate and carry memory across agents (Mistral AI)

What you get

Where your data lives and how to remove it

Where to next

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance