A structure-aware chapter search MCP server.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

marcomq

These details have not been verified by PyPI

Project description

chapter-mcp

chapter-mcp gives coding agents a compact way to search a project by useful sections instead of opening whole files.

It indexes a workspace locally, splits files into "chapters", and exposes MCP tools for searching, listing, and reading those chapters. The goal is simple: help an assistant find the right part of a repo before it burns context on raw file reads.

It is intentionally boring in a good way: local SQLite FTS5, deterministic full-text search, no embeddings, no vector database, and no external service.

What it is good for

Use chapter-mcp when an assistant needs project context but does not yet know which file or section matters. Good fits:

project README sections
AGENTS.md, CLAUDE.md, instructions, and local notes
ADRs and design docs
Markdown/TXT documentation
source files where top-level classes or functions make useful chapters
focused reads after a search result points to the right line range

Not a good fit:

exact code matching with punctuation, operators, routes, or config keys
symbol references, definitions, diagnostics, or safe edits
fuzzy semantic search across very different wording

For those, keep using the sharper tool: rg for exact text, Serena or another language-aware tool for symbols, raw file reads for known ranges, and vector/RAG tooling for real semantic retrieval.

How chapters work

chapter-mcp turns files into smaller units: Markdown by headings, Python by top-level classes/functions, and other text files by paragraphs. Search results point to chapter names and line ranges. Reads can return a whole chapter or a limited slice of chapter content.

By default, if no explicit config is provided, chapter-mcp discovers visible top-level project entries and indexes them. In a Git repo it uses tracked files, skips hidden top-level entries and .chapter-mcp, respects .gitignore, and applies .aiignore rules.

That means the happy path is: add the MCP server to your assistant, start the assistant in a repo, and let discovery do the first pass.

Install

chapter-mcp is available on PyPI. The quickest way to run it is with uvx:

uvx chapter-mcp --root .

For a persistent install, use:

uv tool install chapter-mcp

or:

pip install chapter-mcp

For local development from this checkout, see Development.

Use Case 1: Local Development Project

For a normal local project, do not start chapter-mcp manually in a terminal. Add it to the MCP config for the assistant you use in that project.

When the assistant starts the server, chapter-mcp indexes the project root automatically unless you override it. No project config is required for the first run.

Codex

Add a project-local Codex MCP entry, for example in .codex/config.toml:

[mcp_servers.chapter-mcp]
command = "uvx"
args = ["chapter-mcp", "--root", "."]
cwd = "."
startup_timeout_sec = 20
required = false

Then add instructions in AGENTS.md so Codex actually reaches for the tool:

## Project context

Use `chapter-mcp` before broad file reads when looking for project docs,
instructions, README sections, ADRs, or chapterized source sections.

Prefer this flow:
1. Search with `chapter-mcp` to find the relevant section.
2. Read the matching chapter or a small slice of it.
3. Use Serena for symbol definitions, references, diagnostics, and safe edits.
4. For literal code/config queries, pass `exact_code_matches=true` to
   `search` or `read_search`.
5. Use `rg` when you need exhaustive exact matching, absence checks, or
   verification after edits.

Use `chapter-mcp` for discovery and focused reads. Use `rg` as the final source
of truth for repo-wide exact matching.

Claude Code

For Claude Code, add a project .mcp.json:

{
  "mcpServers": {
    "chapter-mcp": {
      "command": "uvx",
      "args": ["chapter-mcp", "--root", "${CLAUDE_PROJECT_DIR:-.}"]
    }
  }
}

Or add it through Claude's MCP command:

claude mcp add-json chapter-mcp '{"command":"uvx","args":["chapter-mcp","--root","${CLAUDE_PROJECT_DIR:-.}"]}'

Then add instructions in CLAUDE.md:

## Project context

Use the `chapter-mcp` MCP server before reading large files. It is best for
finding relevant README sections, docs, local instructions, ADRs, and
chapterized source sections.

Use `chapter-mcp` for discovery and focused chapter reads. Use normal file
reads only after the relevant file or line range is known.

For literal code/config queries, pass `exact_code_matches=true` to `search` or
`read_search`. Use exact search tools such as `rg` for exhaustive repo-wide
matching, absence checks, and verification after edits. Use language-aware tools
for symbols.

Optional project config

Auto-discovery is the default. Add .chapter-mcp/config.json only when you want to pin categories, exclude noisy top-level folders, or index a specific set of paths:

{
  "paths": [
    "docs=docs",
    "src=src",
    "tests=tests",
    "readme=README.md"
  ]
}

Supported config fields are root, db, paths, watch, watch_interval, and sync_startup. paths is required when using config. CLI flags override project config when both are present.

Useful server flags:

chapter-mcp --root .
chapter-mcp --root . --path docs --path src
chapter-mcp --root . --path knowledge=.serena/memories
chapter-mcp --root . --no-watch
chapter-mcp --root . --no-sync-startup
chapter-mcp --root . --watch-interval 0.5

Use Case 2: Assistants and Agents

The agent use case is similar, but the emphasis is different: chapter-mcp becomes a context-routing layer. When a task starts vague, the first move can be a small search over indexed chapters instead of a broad file read.

Recommended flow:

Search local instructions and docs with chapter-mcp.
Read the best chapter with read_search(..., content_limit=40) or read_chapter(..., content_limit=40).
Switch to symbol tools if the task becomes code-aware.
Use rg for exact verification.
If you know a file and line, try read_chapter_at before a raw range read.
Use raw reads only after the path and range are clear.

This is especially useful when pairing chapter-mcp with tools such as Serena:

chapter-mcp answers "which section should I inspect?"
Serena answers "which symbol is this, and who references it?"
rg answers "where does this exact text occur?"
raw reads answer "what is the exact source in this known range?"

Minimal agent instruction block

If you only want a small instruction, this is enough:

Use `chapter-mcp` before broad file reads for indexed project context:
README sections, docs, instructions, ADRs, Markdown/TXT files, and chapterized
source sections.

Use `read_chapter` with `content_limit` for focused reads. Use `rg` for exact
literals and Serena/language tools for symbols and references. If you want code
or config hits to contain the raw query exactly, pass `exact_code_matches=true`
to `search` or `read_search`. If you know a file and approximate line, use
`read_chapter_at` before reading a raw line range.

Compared with file-read-mcp

file-read-mcp is useful when the assistant already knows the file or range it needs. It is direct and close to the filesystem.

chapter-mcp is useful one step earlier. It helps the assistant discover the right section before choosing what to read.

In practice:

use chapter-mcp to search and shortlist relevant sections
use read_chapter to inspect the matching chapter without flooding context
use file-read-mcp, sed, or editor reads for exact raw ranges

The distinction is small but important. file-read-mcp is about access. chapter-mcp is about choosing what is worth accessing.

MCP tools

The server exposes search, search_chapter, read_search, read_chapter_at, read_chapter, list_chapters, list_chapters_as_columns, list_files, stats, and reindex.

search and read_search also accept exact_code_matches=false. When set to true, docs keep normal FTS behavior, but code/config chapters must contain the raw query as an exact substring.

read_search, read_chapter_at, and read_chapter accept content_limit for small first reads. read_chapter_at is useful when the alternative would be a raw line-range read and you want the indexed chapter around that line first.

Example calls:

stats()
list_files()
search("config handling", limit=3, include_snippet=True)
search("extract-codex-session", exact_code_matches=True)
search_chapter("Style guide", category="docs", limit=3)
read_search("config handling", category="docs", content_limit=40)
read_chapter_at(file="docs/style-guide.md", line=25, content_limit=40)
read_chapter("Style guide", file="docs/style-guide.md", content_limit=40)
list_chapters_as_columns(fields=["name"])

Benchmark basics

chapter-mcp does not measure savings inside the server. The benchmark helper summarizes observed tool traffic from outside the server so you can compare a baseline run with a chapter-first run.

For Codex sessions, extract a neutral JSONL log from the latest session:

uv run chapter-mcp benchmark extract-codex-session --out baseline.jsonl

Or pass a specific session log:

uv run chapter-mcp benchmark extract-codex-session \
  ~/.codex/sessions/2026/05/25/example.jsonl \
  --out chapter-first.jsonl

Summarize one run:

uv run chapter-mcp benchmark summarize baseline.jsonl

Compare two runs:

uv run chapter-mcp benchmark compare baseline.jsonl chapter-first.jsonl

The input format is deliberately simple JSONL:

{"tool":"sed","cmd":"rtk sed -n '1,220p' src/foo.py","chars_out":12340,"lines_out":220}
{"tool":"rg","cmd":"rtk rg column src","chars_out":1840,"lines_out":35}
{"tool":"chapter-mcp","call_name":"search","chars_out":2300,"lines_out":40}

Required fields are tool and chars_out. Optional fields include bytes_out, lines_out, cmd, path, range, and timestamp.

The report is meant as a rough comparison, not a scientific token meter. It is most useful for spotting repeated broad reads and checking whether chapter-first workflows reduce raw file output.

Limits

chapter-mcp is full-text search, not semantic search. Literal wording matters. If users ask fuzzy conceptual questions and the source uses very different phrasing, use a vector or RAG tool instead.

It is also not an exact source-code matcher. SQLite FTS tokenization is not the right tool for punctuation-heavy code, operators, or paths. Use rg for that.

The intended tradeoff is narrow and practical: fast local chapter lookup that keeps an assistant oriented before it reaches for heavier tools.

Development

uv sync
uv run pytest

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

marcomq

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.1

May 26, 2026

0.1.0

May 25, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

chapter_mcp-0.1.1.tar.gz (137.4 kB view details)

Uploaded May 26, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

chapter_mcp-0.1.1-py3-none-any.whl (30.8 kB view details)

Uploaded May 26, 2026 Python 3

File details

Details for the file chapter_mcp-0.1.1.tar.gz.

File metadata

Download URL: chapter_mcp-0.1.1.tar.gz
Upload date: May 26, 2026
Size: 137.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for chapter_mcp-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`1e503d1f1444ad4a7694c97b32861d1d7c275b7f02c70901f16359b88997d055`
MD5	`5c8cbe2307410f3fcc55e06236c68acb`
BLAKE2b-256	`564582c3d5382f07a65fd43800ba4f3ae955c4c7fefe985f68d92d0423489687`

See more details on using hashes here.

Provenance

The following attestation bundles were made for chapter_mcp-0.1.1.tar.gz:

Publisher: publish.yml on marcomq/chapter-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: chapter_mcp-0.1.1.tar.gz
- Subject digest: 1e503d1f1444ad4a7694c97b32861d1d7c275b7f02c70901f16359b88997d055
- Sigstore transparency entry: 1635387235
- Sigstore integration time: May 26, 2026
Source repository:
- Permalink: marcomq/chapter-mcp@ca37c1c38c54fd3aa42945f0671f2ea36f55083b
- Branch / Tag: refs/tags/v0.1.1
- Owner: https://github.com/marcomq
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ca37c1c38c54fd3aa42945f0671f2ea36f55083b
- Trigger Event: release

File details

Details for the file chapter_mcp-0.1.1-py3-none-any.whl.

File metadata

Download URL: chapter_mcp-0.1.1-py3-none-any.whl
Upload date: May 26, 2026
Size: 30.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for chapter_mcp-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`639473404a6e56ad8bf0ad296f265c6551b6adecab72a4982fc3d661f230e3eb`
MD5	`26a3633a23d7885d6149d7e7e564d967`
BLAKE2b-256	`7b5edce67c6084501d8a2debe8d624a49b40f802acab59be11b6a63c98cfb24c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for chapter_mcp-0.1.1-py3-none-any.whl:

Publisher: publish.yml on marcomq/chapter-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: chapter_mcp-0.1.1-py3-none-any.whl
- Subject digest: 639473404a6e56ad8bf0ad296f265c6551b6adecab72a4982fc3d661f230e3eb
- Sigstore transparency entry: 1635387256
- Sigstore integration time: May 26, 2026
Source repository:
- Permalink: marcomq/chapter-mcp@ca37c1c38c54fd3aa42945f0671f2ea36f55083b
- Branch / Tag: refs/tags/v0.1.1
- Owner: https://github.com/marcomq
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@ca37c1c38c54fd3aa42945f0671f2ea36f55083b
- Trigger Event: release

chapter-mcp 0.1.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

chapter-mcp

What it is good for

How chapters work

Install

Use Case 1: Local Development Project

Codex

Claude Code

Optional project config

Use Case 2: Assistants and Agents

Minimal agent instruction block

Compared with file-read-mcp

MCP tools

Benchmark basics

Limits

Development

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance