Skip to main content

Documentation Code Extractor with MCP integration for documentation crawling and code search

Project description

CodeDox - AI-Powered Documentation Search & Code Extraction

Transform any documentation site into a searchable code database - CodeDox crawls documentation websites, intelligently extracts code snippets with context, and provides lightning-fast search via PostgreSQL full-text search and MCP (Model Context Protocol) integration for AI assistants.

📚 Documentation

For full documentation, installation guides, API reference, and more, visit:

https://chriswritescode-dev.github.io/codedox/

Quick Start

Docker Setup (Recommended)

# Clone the repository
git clone https://github.com/chriswritescode-dev/codedox.git
cd codedox

# Configure environment
cp .env.example .env
# Edit .env to add your CODE_LLM_API_KEY (optional for AI-enhanced extraction)

# Run the automated setup
./docker-setup.sh

# Access the web UI at http://localhost:5173
# MCP tools available at http://localhost:8000/mcp

Manual Installation

See the full installation guide for detailed instructions.

Key Features

  • Intelligent Web Crawling: Depth-controlled crawling with URL pattern filtering and domain restrictions
  • Smart Code Extraction: Dual-mode extraction (Automatic Title / Description or LLM Generated Titles and Descriptions)
  • Enhanced Search Modes: Standard code search with intelligent markdown fallback for comprehensive results
  • Lightning-Fast Search: PostgreSQL full-text search with fuzzy matching
  • GitHub Repository Processing: Clone and extract documentation from GitHub repositories with full path support (e.g., /tree/main/docs)
  • HTTP-First MCP Integration: MCP tools via HTTP endpoints with Streamable HTTP transport support (MCP 2025-03-26 spec)
  • Full Documentation Access: Get complete markdown content from any documentation page for full context
  • Modern Web Dashboard: React + TypeScript UI for visual management
  • Version Support: Track multiple versions of documentation
  • Real-time Monitoring: Live crawl progress and health monitoring
  • Upload Support: Upload documentation directly or from GitHub repositories (useful for repos with doc sites)

Demo - MCP Integration Example - OpenCode TUI

CodeDox Demo

Screenshots

Dashboard

CodeDox Dashboard

Markdown Search with Highlighting

CodeDox Markdown Search

Source Detail View

CodeDox Source Detail

Documentation

Contributing

See our Contributing Guide for details on how to contribute to CodeDox.

Author

Chris Scott - chriswritescode.dev

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

iflow_mcp_chriswritescode_dev_codedox-0.3.7.tar.gz (132.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file iflow_mcp_chriswritescode_dev_codedox-0.3.7.tar.gz.

File metadata

  • Download URL: iflow_mcp_chriswritescode_dev_codedox-0.3.7.tar.gz
  • Upload date:
  • Size: 132.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.27 {"installer":{"name":"uv","version":"0.9.27","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Debian GNU/Linux","version":"13","id":"trixie","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for iflow_mcp_chriswritescode_dev_codedox-0.3.7.tar.gz
Algorithm Hash digest
SHA256 3a6eb8f666bda23e96650d107d3ccbb8d46afdbe01439152c2328e3d4486a3a7
MD5 5434272c4d50a3ac2dd3829c79a53b33
BLAKE2b-256 3ad80b3e88390840697420909dd5902fafefd9f4aaec781221ed7104dbf827aa

See more details on using hashes here.

File details

Details for the file iflow_mcp_chriswritescode_dev_codedox-0.3.7-py3-none-any.whl.

File metadata

  • Download URL: iflow_mcp_chriswritescode_dev_codedox-0.3.7-py3-none-any.whl
  • Upload date:
  • Size: 170.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.27 {"installer":{"name":"uv","version":"0.9.27","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Debian GNU/Linux","version":"13","id":"trixie","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for iflow_mcp_chriswritescode_dev_codedox-0.3.7-py3-none-any.whl
Algorithm Hash digest
SHA256 26ced23d389a6b91a2713a1225b010c0e1015e598d623a83f1ea9f22001fbbab
MD5 980b233a418ea87df98a03bee0069b7f
BLAKE2b-256 d5d993aca01414dba967aa95e23ffbf5476dcb5348f660318062c616df11c584

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page