AI-powered documentation generation and drift detection tool

Project description

Dokken - Documentation Drift Detection

Why Dokken?

In the era of AI coding assistants and agents, documentation matters more than ever, just not for the reasons you might think.

Here's the paradox: your AI pair programmer can read every line of code in milliseconds, but it still needs docs to understand why your system works the way it does. The architectural decisions. The boundaries between modules. The things you can't grep for. Without that context, even the best AI will suggest changes that technically work but architecturally regress your codebase.

And let's be honest: documentation has always sucked at its job. Not because developers can't write, but because docs have a shelf life measured in commits. They rot. They lie. Nobody updates them because nobody trusts them, and nobody trusts them because nobody updates them. It's the software equivalent of heat death.

Dokken breaks this cycle. It detects when your docs drift from reality and regenerates them automatically. No more archaeological digs through git history to figure out if that README is from 2019 or 2023. No more "the code is the documentation" excuses (we both know that's a cop-out).

Here's how it works: Dokken captures the stuff code can't express through an interactive questionnaire, asking you about architectural decisions, design trade-offs, and why things are the way they are. Then it generates docs optimized for how both humans and AI agents actually consume them: through grep and search. Because let's face it, nobody reads docs cover-to-cover. We all jump straight to the section we need. Dokken's docs are structured so you (or your AI) can find what you're looking for in seconds.

And here's the best part: Dokken preserves manually written sections. Write an intro by hand (like this one), add custom examples, or craft specific sections yourself. Dokken will leave them untouched and only regenerate the parts it manages. You get the control of manual documentation with the freshness of automated generation.

But here's what matters: Dokken writes documentation for humans, not just machines. Because at the end of the day, humans are the ones who need to understand the overall system architecture to make good decisions, whether they're coding manually or instructing an AI to do it for them. Your AI assistant might be able to implement a feature, but you need to decide if that feature belongs in the auth module or the API layer. That's a human judgment call, and it requires human-level understanding.

What Dokken does:

Generates documentation from scratch when you don't have any (or when you're starting fresh)
Detects documentation drift automatically (new functions, changed signatures, architectural shifts)
Regenerates docs that are actually useful (architectural patterns, design decisions, module boundaries)
Works in CI/CD pipelines (exit code 1 if docs are stale)
Captures human intent through interactive questionnaires (the "why" that code can't express)
Generates search-optimized docs (because grep is how we all find things anyway)

The result? Documentation you can trust. Documentation your AI can use. Documentation that doesn't make you cringe when you read it six months later.

⚠️ Early Development Warning

Dokken is in early alpha development. Expect breaking changes, rough edges, and occasional surprises. If you're using it in production, pin your versions and test thoroughly. Bug reports and feedback are welcome!

Quick Start

# Check for drift in a module
dokken check src/module_name

# Check all modules configured in .dokken.toml
dokken check --all

# Generate module documentation
dokken generate src/module_name

# Generate project README
dokken generate . --doc-type project-readme

# Generate style guide
dokken generate . --doc-type style-guide

Installation

Prerequisites: mise and API key (Anthropic/OpenAI/Google)

git clone https://github.com/mortenkrane/dokken.git
cd dokken
mise install         # Python 3.13.7 + uv
uv sync --all-groups # Dependencies + dev tools

# API key (choose one)
export ANTHROPIC_API_KEY="sk-ant-..."  # Recommended
export OPENAI_API_KEY="sk-..."
export GOOGLE_API_KEY="AIza..."

Commands

`dokken check <module>`

Detect documentation drift. Exit code 1 if drift detected (CI/CD-friendly).

Options:

--all - Check all modules configured in .dokken.toml
--fix - Auto-generate documentation for modules with drift
--doc-type <type> - Type of documentation (module-readme, project-readme, style-guide)
--depth <n> - Directory depth to traverse (0=root only, 1=root+1 level, -1=infinite)

Examples:

dokken check src/auth                    # Check single module
dokken check --all                       # Check all configured modules
dokken check --all --fix                 # Check and auto-fix drift
dokken check . --doc-type project-readme # Check project README
dokken check . --doc-type style-guide    # Check style guide

`dokken generate <module>`

Generate or update documentation.

Options:

--doc-type <type> - Type of documentation to generate:
- module-readme (default) - Module architectural docs in <module>/README.md
- project-readme - Project README in README.md
- style-guide - Code conventions guide in docs/style-guide.md
--depth <n> - Directory depth to traverse (defaults: module=0, project=1, style-guide=-1)

Process:

Analyze code (depth varies by doc type)
Interactive questionnaire (captures human intent)
Generate documentation with LLM
Write to appropriate location

Examples:

dokken generate src/auth                # Generate module docs
dokken generate . --doc-type project-readme # Generate project README
dokken generate . --doc-type style-guide    # Generate style guide
dokken generate src/auth --depth 2         # Custom depth

Key Concepts

Drift: Documentation out of sync with code. Detected when:

New/removed functions or classes
Changed function signatures
Modified exports
Major architectural changes
See DRIFT_CHECK_PROMPT in src/llm/prompts.py for full criteria

Documentation Types: Dokken generates three types of documentation:

module-readme: Architectural docs for a specific module (depth=0 by default)
project-readme: Top-level project README (depth=1, analyzes entire repo)
style-guide: Code conventions and patterns guide (depth=-1, full recursion)

Module: Python package or directory. Target for dokken check/generate.

Human Intent Questions: Interactive questionnaire during generation (questions vary by doc type):

Module: Problems solved, core responsibilities, boundaries, system context
Project: Project type, target audience, key problem, setup notes
Style Guide: Unique conventions, organization, patterns to follow/avoid

Interactive Questionnaire

The questionnaire shows a preview of all questions before starting, displays questions on separate lines for readability, and supports multiline answers (press Enter for new lines).

Keyboard shortcuts:

Ctrl+C - Skip question or entire questionnaire (at preview or first question)
Esc+Enter or Ctrl+D - Submit answer (most reliable across terminals)
Meta+Enter - Submit answer (may work depending on your terminal)
Leave blank - Skip if no relevant information

Configuration

Quick reference: See examples/.dokken.toml for a comprehensive configuration example with all available options.

API Keys (Environment Variables)

# Choose one provider
export ANTHROPIC_API_KEY="sk-ant-..."  # Claude (Haiku) - Recommended
export OPENAI_API_KEY="sk-..."         # OpenAI (GPT-4o-mini)
export GOOGLE_API_KEY="AIza..."        # Google (Gemini Flash)

Configuration Options

Create a .dokken.toml file at your repository root to configure Dokken. Available options:

Multi-Module Projects:

modules = ["src/auth", "src/api", "src/database"]

dokken check --all              # Check all configured modules
dokken check --all --fix        # Check and auto-fix drift

File Types:

file_types = [".py"]           # Python (default)
# file_types = [".js", ".ts"]  # JavaScript/TypeScript
# file_types = [".py", ".js"]  # Multiple languages

Exclusions:

[exclusions]
files = ["__init__.py", "*_test.py", "conftest.py"]

Custom Prompts:

[custom_prompts]
global_prompt = "Use British spelling and active voice."
module_readme = "Focus on architectural patterns."

Drift Detection Cache:

[cache]
file = ".dokken-cache.json"  # Default location
max_size = 100               # Max entries (default)

Cache automatically saves drift detection results to avoid redundant LLM API calls (80-95% token reduction in CI). Add .dokken-cache.json to .gitignore.

See examples/.dokken.toml for complete configuration details and all available options.

CI/CD Integration

Use dokken check --all in CI pipelines to enforce documentation hygiene:

Exit code 0: Documentation is up-to-date
Exit code 1: Drift detected (fails the build)

Minimal GitHub Actions workflow:

- name: Check documentation drift
  run: dokken check --all
  env:
    ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}

With caching (recommended):

- name: Restore drift cache
  uses: actions/cache@v4
  with:
    path: .dokken-cache.json
    key: dokken-drift-${{ hashFiles('src/**/*.py', '.dokken.toml') }}
    restore-keys: dokken-drift-

- name: Check documentation drift
  run: dokken check --all
  env:
    ANTHROPIC_API_KEY: ${{ secrets.ANTHROPIC_API_KEY }}

Caching reduces LLM token consumption by 80-95% for unchanged code.

See examples/dokken-drift-check.yml for a complete workflow with setup instructions, auto-fix options, and multi-platform support.

Features

Three Documentation Types: Module READMEs, project READMEs, and style guides
Configurable Depth: Control code analysis depth (0=root only, -1=infinite recursion)
Drift Detection: Criteria-based detection (see src/llm/prompts.py)
Multi-Module Check: Check all modules with --all flag
Custom Prompts: Inject preferences into generation (see Configuration)
Exclusion Rules: Filter files via .dokken.toml
Multi-Provider LLM: Claude (Haiku), OpenAI (GPT-4o-mini), Google (Gemini Flash)
Cost-Optimized: Fast, budget-friendly models
Human-in-the-Loop: Interactive questionnaire for context AI can't infer
Deterministic: Temperature=0.0 for reproducible output

Development

Dev setup:

# Using mise tasks (recommended)
mise run dev                  # Set up development environment
mise run check                # Run all checks (format, lint, type, test)
mise run test                 # Run tests with coverage
mise run fix                  # Auto-fix formatting and linting
mise tasks                    # List all available tasks

# Or using uv directly
uv sync --all-groups          # Install dependencies + dev tools
uv run pytest tests/ --cov=src  # Run tests with coverage
uv run ruff format            # Format code
uv run ruff check --fix       # Lint and auto-fix
uvx ty check                  # Type checking

Full documentation:

docs/style-guide.md - Architecture, code style, testing, git workflow
CONTRIBUTING.md - Contribution guidelines

Troubleshooting

Q: ModuleNotFoundError when running dokken A: Run uv sync --all-groups to install dependencies

Q: Drift detection too sensitive A: Adjust criteria in DRIFT_CHECK_PROMPT in src/llm/prompts.py

Q: How to skip questionnaire? A: Press Ctrl+C during the question preview or on the first question

Q: Configuration questions (exclusions, custom prompts, multi-module setup)? A: See examples/.dokken.toml for comprehensive configuration examples

License

MIT License - see LICENSE

Project details

Release history Release notifications | RSS feed

0.1.7

Apr 5, 2026

0.1.6

Mar 24, 2026

0.1.5

Mar 17, 2026

0.1.4

Feb 12, 2026

This version

0.1.1

Jan 23, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dokken-0.1.1.tar.gz (90.2 kB view details)

Uploaded Jan 23, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

dokken-0.1.1-py3-none-any.whl (53.6 kB view details)

Uploaded Jan 23, 2026 Python 3

File details

Details for the file dokken-0.1.1.tar.gz.

File metadata

Download URL: dokken-0.1.1.tar.gz
Upload date: Jan 23, 2026
Size: 90.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for dokken-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`c2c37ec765ef328bcbe9505872b6e7c9d57c68c959cc0b61f1fb49ba5906fee4`
MD5	`d172f9a49586a9932ac0946f4e5024b7`
BLAKE2b-256	`9b9cd331c8b054c999590e839fb237a92de82370b53dbaee20bba0a892567ef8`

See more details on using hashes here.

Provenance

The following attestation bundles were made for dokken-0.1.1.tar.gz:

Publisher: publish-pypi.yml on mortenkrane/dokken

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: dokken-0.1.1.tar.gz
- Subject digest: c2c37ec765ef328bcbe9505872b6e7c9d57c68c959cc0b61f1fb49ba5906fee4
- Sigstore transparency entry: 848931480
- Sigstore integration time: Jan 23, 2026
Source repository:
- Permalink: mortenkrane/dokken@a77a95875e57671ff49958ab814a2cc402cfb1e1
- Branch / Tag: refs/tags/dokken-v0.1.1
- Owner: https://github.com/mortenkrane
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@a77a95875e57671ff49958ab814a2cc402cfb1e1
- Trigger Event: release

File details

Details for the file dokken-0.1.1-py3-none-any.whl.

File metadata

Download URL: dokken-0.1.1-py3-none-any.whl
Upload date: Jan 23, 2026
Size: 53.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for dokken-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f3e560240e4ab119d7b1edc32ca9ad2a1f6cf892c7a5ac31c0a58a51b439546b`
MD5	`d2c1f8fae847e9385a14c6ec2bb51122`
BLAKE2b-256	`0e08e085a9311bde16e80c33c5c090c4340d940a7240e131e0ff1f23b10ee78a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for dokken-0.1.1-py3-none-any.whl:

Publisher: publish-pypi.yml on mortenkrane/dokken

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: dokken-0.1.1-py3-none-any.whl
- Subject digest: f3e560240e4ab119d7b1edc32ca9ad2a1f6cf892c7a5ac31c0a58a51b439546b
- Sigstore transparency entry: 848931485
- Sigstore integration time: Jan 23, 2026
Source repository:
- Permalink: mortenkrane/dokken@a77a95875e57671ff49958ab814a2cc402cfb1e1
- Branch / Tag: refs/tags/dokken-v0.1.1
- Owner: https://github.com/mortenkrane
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@a77a95875e57671ff49958ab814a2cc402cfb1e1
- Trigger Event: release

dokken 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Dokken - Documentation Drift Detection

Why Dokken?

Quick Start

Installation

Commands

dokken check <module>

dokken generate <module>

Key Concepts

Interactive Questionnaire

Configuration

API Keys (Environment Variables)

Configuration Options

CI/CD Integration

Features

Development

Troubleshooting

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`dokken check <module>`

`dokken generate <module>`