Skip to main content

Documentation Code Extractor with MCP integration for documentation crawling and code search

Project description

CodeDox - AI-Powered Documentation Search & Code Extraction

Transform any documentation site into a searchable code database - CodeDox crawls documentation websites, intelligently extracts code snippets with context, and provides lightning-fast search via PostgreSQL full-text search and MCP (Model Context Protocol) integration for AI assistants.

📚 Documentation

For full documentation, installation guides, API reference, and more, visit:

https://chriswritescode-dev.github.io/codedox/

Quick Start

Docker Setup (Recommended)

# Clone the repository
git clone https://github.com/chriswritescode-dev/codedox.git
cd codedox

# Configure environment
cp .env.example .env
# Edit .env to add your CODE_LLM_API_KEY (optional for AI-enhanced extraction)

# Run the automated setup
./docker-setup.sh

# Access the web UI at http://localhost:5173
# MCP tools available at http://localhost:8000/mcp

Manual Installation

See the full installation guide for detailed instructions.

Key Features

  • Intelligent Web Crawling: Depth-controlled crawling with URL pattern filtering and domain restrictions
  • Smart Code Extraction: Dual-mode extraction (Automatic Title / Description or LLM Generated Titles and Descriptions)
  • Enhanced Search Modes: Standard code search with intelligent markdown fallback for comprehensive results
  • Lightning-Fast Search: PostgreSQL full-text search with fuzzy matching
  • GitHub Repository Processing: Clone and extract documentation from GitHub repositories with full path support (e.g., /tree/main/docs)
  • HTTP-First MCP Integration: MCP tools via HTTP endpoints with Streamable HTTP transport support (MCP 2025-03-26 spec)
  • Full Documentation Access: Get complete markdown content from any documentation page for full context
  • Modern Web Dashboard: React + TypeScript UI for visual management
  • Version Support: Track multiple versions of documentation
  • Real-time Monitoring: Live crawl progress and health monitoring
  • Upload Support: Upload documentation directly or from GitHub repositories (useful for repos with doc sites)

Demo - MCP Integration Example - OpenCode TUI

CodeDox Demo

Screenshots

Dashboard

CodeDox Dashboard

Markdown Search with Highlighting

CodeDox Markdown Search

Source Detail View

CodeDox Source Detail

Documentation

Contributing

See our Contributing Guide for details on how to contribute to CodeDox.

Author

Chris Scott - chriswritescode.dev

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iflow_mcp_chriswritescode_dev_codedox-0.3.8.tar.gz (132.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file iflow_mcp_chriswritescode_dev_codedox-0.3.8.tar.gz.

File metadata

  • Download URL: iflow_mcp_chriswritescode_dev_codedox-0.3.8.tar.gz
  • Upload date:
  • Size: 132.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.27 {"installer":{"name":"uv","version":"0.9.27","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Debian GNU/Linux","version":"13","id":"trixie","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for iflow_mcp_chriswritescode_dev_codedox-0.3.8.tar.gz
Algorithm Hash digest
SHA256 c5c2965e4aaf600e79bdbe17ae9bc904551d66074c4083eba8a5117357850dee
MD5 37fad1547c0fb2b83b2c169335ea996c
BLAKE2b-256 7a411a80c20a93853dfaf1804992a970a783e6b71ddf3fbd2461cdcb5c1d98d4

See more details on using hashes here.

File details

Details for the file iflow_mcp_chriswritescode_dev_codedox-0.3.8-py3-none-any.whl.

File metadata

  • Download URL: iflow_mcp_chriswritescode_dev_codedox-0.3.8-py3-none-any.whl
  • Upload date:
  • Size: 170.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.27 {"installer":{"name":"uv","version":"0.9.27","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Debian GNU/Linux","version":"13","id":"trixie","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for iflow_mcp_chriswritescode_dev_codedox-0.3.8-py3-none-any.whl
Algorithm Hash digest
SHA256 725517f90f8d91b63c272cbca62eb28b129b9a50d91ea4fb4d528c815052ca8a
MD5 fa74876a213eada95d944160b4ec7d03
BLAKE2b-256 a312ad87e431e5729070e1a69268442607dec76dd1b3398baf10ddf9591e6418

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page