Fetch docs, embed locally, expose to AI agents via skills.

Project description

docmancer

Stop your AI from hallucinating APIs. Ground it in real docs, locally.

✅ Up-to-date, version-specific documentation
✅ Only the chunks your agent needs, not the whole doc site
✅ Completely free. No rate limits, no quotas, no tiers.
✅ Private and internal docs work out of the box
✅ No MCP server. Installs as a skill, runs as a CLI.
✅ 100% local. Embeddings, storage, retrieval all on your machine.
✅ Works offline once ingested

pipx install docmancer --python python3.13

The Problem · How It Works · Why Local? · Install · Quickstart · Commands · Configuration · Troubleshooting

Quickstart

# 1. Install pipx
brew install pipx
pipx ensurepath

# 2. Open a new shell, then install docmancer
pipx install docmancer --python python3.13

# 3. Ingest a docs source
docmancer ingest https://docs.example.com

# 4. Install the skill into your agents
docmancer install claude-code
docmancer install cursor
docmancer install codex

# 5. Query from the CLI
docmancer query "How do I authenticate?"

No server to start. Config and the default vector store are created under ~/.docmancer/ on first use.

The Problem

Claude Code sessions have a context limit. Every time you paste docs into a session, or let the agent browse and re-fetch the same pages, you're burning that budget on setup instead of actual work. Once the session gets noisy enough, the agent starts guessing: made-up CLI flags, stale API shapes, behaviors from old versions.

The obvious fix (dumping whole doc sites into context) makes it worse. You burn thousands of tokens on irrelevant text and bury the one paragraph that actually matters.

Cloud-based doc tools add rate limits, privacy exposure, and server dependencies you don't need.

Docmancer solves this differently. You ingest docs once, they're chunked and indexed locally, and the agent retrieves only the matching sections when it needs them: a few hundred tokens instead of tens of thousands.

Works With Every Agent

Docmancer installs a skill file into each agent that teaches it to call the CLI directly. One local index, one ingest step, every agent covered.

Agent	Install command
Claude Code	`docmancer install claude-code`
Codex	`docmancer install codex`
Cursor	`docmancer install cursor`
Gemini CLI	`docmancer install gemini`
OpenCode	`docmancer install opencode`
Claude Desktop	`docmancer install claude-desktop`

Skills are plain markdown files. No background daemon, no MCP server, no ports.

How Docmancer Fixes It

Chunk and embed locally. Docmancer splits docs into 800-token chunks and embeds them with FastEmbed, fully on your machine. No embedding API costs, no data leaving your system.

Hybrid retrieval. Queries run dense + sparse (BM25) retrieval in parallel and merge results with reciprocal rank fusion. Dense vectors catch semantic meaning; BM25 catches exact terms like flag names, error codes, and method signatures.

Return only what matches. A query returns 5 chunks by default (a few hundred tokens). The whole site stays indexed; only the relevant slice lands in context.

Concurrent-safe. Multiple CLI calls from parallel agents or different terminals are serialized with a file lock. No corruption.

How It Works

┌──────────────────────────────────────────────────────────────────────────┐
│  DOCMANCER FLOW                                                          │
├──────────────────────────────────────────────────────────────────────────┤
│                                                                          │
│  INGEST                 INDEX                      RETRIEVE              │
│  ┌────────────┐         ┌────────────┐         ┌──────────────────────┐  │
│  │ GitBook    │         │ Chunk text │         │ docmancer query      │  │
│  │ Mintlify   │   ─►    │ FastEmbed  │   ─►    │ e.g. how to auth?    │  │
│  │ Web docs   │         │ vectors on │         │                      │  │
│  │ Local docs │         │ disk Qdrant│         │ → top matching       │  │
│  │ .md / .txt │         │            │         │   chunks only        │  │
│  └────────────┘         └────────────┘         │                      │  │
│       │                       ▲                └──────────────────────┘  │
│       └───────────────────────┴── dense + sparse (BM25); file lock       │
│                                                                          │
│  SKILL INSTALL                           AGENT                           │
│  ┌──────────────────────────┐            ┌──────────────────────────┐    │
│  │ docmancer install        │            │ Claude Code, Cursor,     │    │
│  │ claude-code, cursor, …   │    ─►      │ Codex, … run the CLI     │    │
│  └──────────────────────────┘            │ via installed SKILL.md   │    │
│                                          └──────────────────────────┘    │
│                                                                          │
└──────────────────────────────────────────────────────────────────────────┘

docmancer ingest: fetches docs from GitBook, Mintlify, generic web docs, or local files. Chunks and embeds them locally with FastEmbed. Stores vectors in on-disk Qdrant.
docmancer install: drops a SKILL.md into your agent's skills directory. The skill teaches the agent when and how to call the CLI.
Agent queries automatically: when your agent needs docs, it runs docmancer query and gets back only the relevant chunks.

Why Local?

	DocMancer
Cost	Free, always. No tiers, no quotas.
Rate limits	None. Query as much as you want.
Private docs	Supported free. No paid plan required.
Data privacy	Nothing leaves your machine.
Infrastructure	No server. CLI + local storage.
Offline use	Yes, after ingestion.
Embedding	Local FastEmbed. No API key needed.

Commands

Command	What it does
`docmancer ingest <url-or-path>`	Fetch, chunk, embed, and index docs locally
`docmancer query <text>`	Retrieve relevant chunks from the local index
`docmancer install <agent>`	Install skill file for a supported agent
`docmancer list`	List ingested sources with timestamps
`docmancer fetch <url>`	Download GitBook docs as markdown (no embedding)
`docmancer remove <source>`	Remove an ingested source from the index
`docmancer inspect`	Show collection stats and config
`docmancer doctor`	Health check: PATH, config, Qdrant, installed skills
`docmancer init`	Create a project-local `docmancer.yaml`

Use --full with docmancer query to return the entire chunk body (default truncates at 1500 characters). Use --limit N to change how many chunks are returned.

Install

Recommended: install pipx with Homebrew, then install docmancer with an explicit supported Python version.

brew install pipx
pipx ensurepath

Open a new shell, then install docmancer:

pipx install docmancer --python python3.13

Supports Python 3.11-3.13. Pass the version explicitly: pipx may pick the wrong interpreter on some machines.

On Apple Silicon, prefer the native Homebrew Python:

pipx install docmancer --python /opt/homebrew/bin/python3.13

Upgrade

pipx upgrade docmancer

If you want to keep a specific Python version, reinstall explicitly:

pipx reinstall docmancer --python python3.13

Install Targets

Command	Where the skill lands
`docmancer install claude-code`	`~/.claude/skills/docmancer/SKILL.md`
`docmancer install codex`	`~/.codex/skills/docmancer/SKILL.md` (also mirrors to `~/.agents/skills/docmancer/SKILL.md`)
`docmancer install cursor`	`~/.cursor/skills/docmancer/SKILL.md` + marked block in `~/.cursor/AGENTS.md` when needed
`docmancer install opencode`	`~/.config/opencode/skills/docmancer/SKILL.md`
`docmancer install gemini`	`~/.gemini/skills/docmancer/SKILL.md`
`docmancer install claude-desktop`	`~/.docmancer/exports/claude-desktop/docmancer.zip`: upload via Customize → Skills

Use --project with claude-code or gemini to install under .claude/skills/... or .gemini/skills/... in the current working directory.

Configuration

Resolution order: --config → ./docmancer.yaml in the current directory → ~/.docmancer/docmancer.yaml (auto-created on first use).

Configuration Reference

Section	Key	Default	What it controls
`embedding`	`provider`	`fastembed`	Embedding provider
`embedding`	`model`	`BAAI/bge-small-en-v1.5`	Embedding model name
`embedding`	`batch_size`	`256`	Chunks embedded per local batch
`embedding`	`parallel`	`0`	FastEmbed worker count (`0` = all cores)
`embedding`	`lazy_load`	`true`	Defer model loading for worker processes
`vector_store`	`provider`	`qdrant`	Vector store backend
`vector_store`	`local_path`	`~/.docmancer/qdrant`	On-disk storage path
`vector_store`	`url`	(unset)	Remote Qdrant URL (overrides `local_path`)
`vector_store`	`collection_name`	`knowledge_base`	Qdrant collection name
`vector_store`	`retrieval_limit`	`5`	Max chunks returned per query
`vector_store`	`score_threshold`	`0.35`	Minimum similarity score
`ingestion`	`chunk_size`	`800`	Tokens per chunk
`ingestion`	`chunk_overlap`	`120`	Overlap between chunks
`ingestion`	`bm25_model`	`Qdrant/bm25`	Sparse retrieval model

Example `docmancer.yaml`

embedding:
  provider: fastembed
  model: BAAI/bge-small-en-v1.5
  batch_size: 256
  parallel: 0
  lazy_load: true

vector_store:
  provider: qdrant
  local_path: .docmancer/qdrant # resolved relative to this file's directory
  collection_name: knowledge_base
  retrieval_limit: 5
  score_threshold: 0.35

ingestion:
  chunk_size: 800
  chunk_overlap: 120
  bm25_model: Qdrant/bm25

embedding.batch_size must be at least 1. For large local ingests, increase batch_size cautiously and use parallel: 0 to let FastEmbed use all available CPU cores.

Supported Sources

Source	Strategy
GitBook sites	`--provider gitbook`: `/llms-full.txt` → `/llms.txt`
Mintlify sites	`--provider mintlify` or `auto`: `/llms-full.txt` → `/llms.txt` → `/sitemap.xml`
Generic web docs	`--provider web`: generic crawler for non-GitBook / non-Mintlify sites
Local `.md` / `.txt`	Read from disk

Troubleshooting

`pip install` succeeds, but `docmancer` is `command not found`

This usually means the scripts directory is not on your PATH. The install output will show the path:

WARNING: The script docmancer is installed in '/Users/your-user/Library/Python/3.13/bin' which is not on PATH.

Recommended fix:

brew install pipx
pipx ensurepath
pipx install docmancer --python python3.13

Or confirm the install by running the script directly:

~/Library/Python/3.13/bin/docmancer doctor

`pipx install docmancer` says `No matching distribution found`

This means pipx picked an unsupported Python version. docmancer requires Python 3.11–3.13.

pipx install docmancer --python python3.13

If Python 3.13 is not installed:

brew install python@3.13
pipx install docmancer --python python3.13

`pipx install` fails: Apple Silicon / architecture mismatch

On macOS, pipx and Python can end up on different architectures (arm64 vs x86_64). Use the native Homebrew Python explicitly:

pipx install docmancer --python /opt/homebrew/bin/python3.13

If needed:

arch -arm64 pipx install docmancer --python /opt/homebrew/bin/python3.13

`docmancer doctor` crashes with `pydantic_core` or architecture error

The virtualenv was created with the wrong architecture. Recreate it:

deactivate
rm -rf .venv
arch -arm64 /opt/homebrew/bin/python3.13 -m venv .venv
source .venv/bin/activate
pip install -e .[dev]

Contributing

For development setup and contributing, see CONTRIBUTING.md.

License

MIT License. See LICENSE for details.

Your agents are guessing. Fix that in two commands.

Project details

Release history Release notifications | RSS feed

0.4.9

Apr 28, 2026

0.4.8

Apr 28, 2026

0.4.7

Apr 28, 2026

0.4.6

Apr 27, 2026

0.4.5

Apr 21, 2026

0.4.4

Apr 21, 2026

0.4.3

Apr 21, 2026

0.4.2

Apr 21, 2026

0.4.1

Apr 21, 2026

0.4.0

Apr 21, 2026

0.3.4

Apr 15, 2026

0.3.3

Apr 15, 2026

0.3.2

Apr 14, 2026

0.3.1

Apr 14, 2026

0.3.0

Apr 12, 2026

0.2.3

Apr 10, 2026

0.2.2

Apr 8, 2026

0.2.1

Apr 7, 2026

0.2.0

Apr 7, 2026

0.1.11

Apr 3, 2026

0.1.9

Apr 1, 2026

This version

0.1.8

Apr 1, 2026

0.1.7

Apr 1, 2026

0.1.6

Mar 31, 2026

0.1.5

Mar 30, 2026

0.1.4

Mar 30, 2026

0.1.3

Mar 30, 2026

0.1.2

Mar 30, 2026

0.1.1

Mar 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docmancer-0.1.8.tar.gz (93.0 kB view details)

Uploaded Apr 1, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

docmancer-0.1.8-py3-none-any.whl (70.2 kB view details)

Uploaded Apr 1, 2026 Python 3

File details

Details for the file docmancer-0.1.8.tar.gz.

File metadata

Download URL: docmancer-0.1.8.tar.gz
Upload date: Apr 1, 2026
Size: 93.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for docmancer-0.1.8.tar.gz
Algorithm	Hash digest
SHA256	`3ad0bb0ee323c48558139dd17fe3b82dfccb2b531128d59156bb98ddc8947b9b`
MD5	`1f1c74e92475ea7628c855e0eab0f1f2`
BLAKE2b-256	`e35e31d17999beff6e47e64b2d828cdeecb9ed1f709f3d013a078c4b2f9d4c23`

See more details on using hashes here.

Provenance

The following attestation bundles were made for docmancer-0.1.8.tar.gz:

Publisher: publish.yml on docmancer/docmancer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: docmancer-0.1.8.tar.gz
- Subject digest: 3ad0bb0ee323c48558139dd17fe3b82dfccb2b531128d59156bb98ddc8947b9b
- Sigstore transparency entry: 1208668254
- Sigstore integration time: Apr 1, 2026
Source repository:
- Permalink: docmancer/docmancer@55c396b39169cd7de0ef9e8a7cb06826e3f6f846
- Branch / Tag: refs/tags/v0.1.8
- Owner: https://github.com/docmancer
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@55c396b39169cd7de0ef9e8a7cb06826e3f6f846
- Trigger Event: push

File details

Details for the file docmancer-0.1.8-py3-none-any.whl.

File metadata

Download URL: docmancer-0.1.8-py3-none-any.whl
Upload date: Apr 1, 2026
Size: 70.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for docmancer-0.1.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`e85bf299c0cdcfaace42e7a7b6f4cf33e6a9f1b35b2b59e04891921f9e00b43e`
MD5	`d195141f3480f0beb8a67aad66276a1f`
BLAKE2b-256	`a9cd3334d9d96fdd67943fb44e38ee450395c6a0a4eb1636faea8df3faa910eb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for docmancer-0.1.8-py3-none-any.whl:

Publisher: publish.yml on docmancer/docmancer

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: docmancer-0.1.8-py3-none-any.whl
- Subject digest: e85bf299c0cdcfaace42e7a7b6f4cf33e6a9f1b35b2b59e04891921f9e00b43e
- Sigstore transparency entry: 1208668359
- Sigstore integration time: Apr 1, 2026
Source repository:
- Permalink: docmancer/docmancer@55c396b39169cd7de0ef9e8a7cb06826e3f6f846
- Branch / Tag: refs/tags/v0.1.8
- Owner: https://github.com/docmancer
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@55c396b39169cd7de0ef9e8a7cb06826e3f6f846
- Trigger Event: push

docmancer 0.1.8

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

docmancer

Quickstart

The Problem

Works With Every Agent

How Docmancer Fixes It

How It Works

Why Local?

Commands

Install

Upgrade

Install Targets

Configuration

Configuration Reference

Example docmancer.yaml

Supported Sources

Troubleshooting

pip install succeeds, but docmancer is command not found

pipx install docmancer says No matching distribution found

pipx install fails: Apple Silicon / architecture mismatch

docmancer doctor crashes with pydantic_core or architecture error

Contributing

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Example `docmancer.yaml`

`pip install` succeeds, but `docmancer` is `command not found`

`pipx install docmancer` says `No matching distribution found`

`pipx install` fails: Apple Silicon / architecture mismatch

`docmancer doctor` crashes with `pydantic_core` or architecture error