AI-powered document perception and analysis MCP server with intelligent provider selection
Project description
๐ Docsray MCP Server
Docsray is a powerful Model Context Protocol (MCP) server that gives AI assistants like Claude advanced document perception capabilities. Extract text, navigate pages, analyze structure, and understand any document with ease.
โ Status: Published to PyPI and TestPyPI - Working in Cursor, Claude Desktop, and other MCP clients
โจ Features
๐ฏ Five Powerful Tools
docsray_peek- Quick document overview with format detection and provider capabilitiesdocsray_map- Generate comprehensive document structure maps with cachingdocsray_xray- AI-powered deep analysis extracting entities, relationships, and insightsdocsray_extract- Extract content in multiple formats (markdown, text, JSON, tables)docsray_seek- Navigate to specific pages, sections, or search for content
๐ Multi-Provider Architecture
-
PyMuPDF4LLM - Lightning-fast PDF processing (โ Implemented)
- Fast markdown extraction
- Basic table detection
- Multi-page support
- Always enabled as fallback
-
LlamaParse - Deep document understanding with LLMs (โ Implemented)
- AI-powered entity extraction
- Custom analysis instructions
- Comprehensive caching in .docsray directories
- Rich format preservation (markdown, images, tables)
-
PyTesseract - OCR for scanned documents (๐ Planned)
-
Mistral OCR - AI-powered OCR and analysis (๐ Planned)
๐ Key Benefits
- Universal Input Support - Local files (./path, ../path, /absolute) and URLs (https://)
- Intelligent Provider Selection - Automatically chooses the best tool for each task
- Smart Caching - LlamaParse results cached in .docsray directories for instant access
- Dynamic Discovery - Tools report actual capabilities based on what's enabled
- Production Ready - Comprehensive error handling, logging, and 56 tests
- Self-Documenting - Built-in resources for discovery by MCP clients
๐ฆ Installation
Quick Start with uvx (Recommended)
# Run directly without installation
uvx docsray-mcp start
# Or install globally
uv tool install docsray-mcp
# Then run with:
docsray start
# or
docsray-mcp start
Alternative: Install with pip
# Basic installation (PyMuPDF4LLM only)
pip install docsray-mcp
# With LlamaParse for AI analysis
pip install "docsray-mcp[ai]"
# Development installation
pip install -e ".[dev]"
๐ Quick Start
1. Set up API Keys (Optional but Recommended)
Create a .env file in your project:
# For AI-powered analysis with LlamaParse
LLAMAPARSE_API_KEY=llx-your-key-here
# Or use environment variables
export LLAMAPARSE_API_KEY=llx-your-key-here
Get your free LlamaParse API key at cloud.llamaindex.ai
2. Configure with Your MCP Client
For Cursor
Add to your Cursor settings:
{
"mcpServers": {
"docsray": {
"command": "uvx",
"args": ["docsray-mcp"],
"env": {
"LLAMAPARSE_API_KEY": "llx-your-key-here"
}
}
}
}
For Claude Desktop
Add to ~/Library/Application Support/Claude/claude_desktop_config.json:
{
"mcpServers": {
"docsray": {
"command": "uvx",
"args": ["docsray-mcp"],
"env": {
"LLAMAPARSE_API_KEY": "llx-your-key-here"
}
}
}
}
๐ Usage Examples
Basic Document Overview
Peek at ./document.pdf to see its structure and available formats
Extract Entities from Contracts
Xray ./contract.pdf and extract all parties, dates, payment terms, and obligations
Navigate Documents
Map the complete structure of ./manual.pdf including all sections and subsections
Extract Specific Content
Extract pages 10-20 from ./report.pdf as markdown
Analyze Web Documents
Analyze https://arxiv.org/pdf/2301.00234.pdf for methodology and key findings
Compare Providers
Extract text from document.pdf with provider pymupdf4llm (fast)
Xray document.pdf with provider llama-parse (AI analysis)
๐ ๏ธ Advanced Configuration
Environment Variables
# Provider Configuration
DOCSRAY_PYMUPDF4LLM_ENABLED=true # Always true by default
DOCSRAY_LLAMAPARSE_ENABLED=true
LLAMAPARSE_API_KEY=llx-your-key
# Performance Tuning
DOCSRAY_CACHE_ENABLED=true
DOCSRAY_CACHE_TTL=3600
DOCSRAY_MAX_CONCURRENT_REQUESTS=5
DOCSRAY_TIMEOUT_SECONDS=30
# Logging
DOCSRAY_LOG_LEVEL=INFO
Provider Capabilities
PyMuPDF4LLM (Always Available)
- โ Fast text extraction
- โ Markdown formatting
- โ Basic table detection
- โ Multi-page support
- โ No AI analysis
- โ No OCR
LlamaParse (When API Key Configured)
- โ AI-powered analysis
- โ Entity extraction
- โ Custom instructions
- โ Table extraction
- โ Image extraction
- โ Layout preservation
- โ Relationship mapping
- โ Result caching
๐งช Testing
# Run all tests
pytest tests/
# Run only unit tests (no API calls)
pytest tests/unit/
# Run integration tests
pytest tests/integration/
# Run with coverage
pytest tests/ --cov=src/docsray --cov-report=html
Current test coverage: 52 tests passing with comprehensive coverage across all components
๐ API Reference
Tool: docsray_peek
Get quick document overview and metadata.
{
"document_url": "path/to/document.pdf",
"depth": "structure", # metadata | structure | preview
"provider": "auto" # auto | pymupdf4llm | llama-parse
}
Tool: docsray_map
Generate comprehensive document structure map.
{
"document_url": "path/to/document.pdf",
"include_content": false,
"analysis_depth": "deep", # basic | deep | comprehensive
"provider": "auto"
}
Tool: docsray_xray
Deep AI-powered document analysis.
{
"document_url": "path/to/document.pdf",
"analysis_type": ["entities", "key-points"],
"custom_instructions": "Extract all dates and amounts",
"provider": "llama-parse"
}
Tool: docsray_extract
Extract content in various formats.
{
"document_url": "path/to/document.pdf",
"extraction_targets": ["text", "tables"],
"output_format": "markdown", # markdown | text | json
"pages": [1, 2, 3], # Optional: specific pages
"provider": "auto"
}
Tool: docsray_seek
Navigate to specific document locations.
{
"document_url": "path/to/document.pdf",
"target": {"page": 5}, # or {"section": "Introduction"} or {"query": "search text"}
"extract_content": true,
"provider": "auto"
}
๐๏ธ Architecture
docsray-mcp/
โโโ src/docsray/
โ โโโ server.py # FastMCP server with discovery resources
โ โโโ providers/ # Provider implementations
โ โ โโโ base.py # Provider interface
โ โ โโโ pymupdf4llm.py # Fast PDF extraction
โ โ โโโ llamaparse.py # AI-powered analysis
โ โโโ tools/ # MCP tool implementations
โ โ โโโ peek.py # Document overview
โ โ โโโ map.py # Structure mapping
โ โ โโโ xray.py # Deep analysis
โ โ โโโ extract.py # Content extraction
โ โ โโโ seek.py # Navigation
โ โโโ utils/ # Utilities
โ โโโ cache.py # Document caching
โ โโโ llamaparse_cache.py # LlamaParse .docsray cache
โโโ tests/
โ โโโ unit/ # Fast isolated tests
โ โโโ integration/ # Component interaction tests
โ โโโ manual/ # Debugging scripts
โโโ PROMPTS.md # Example prompts for all use cases
๐ค Contributing
We welcome contributions! See CONTRIBUTING.md for guidelines.
Development Setup
# Clone the repository
git clone https://github.com/docsray/docsray-mcp.git
cd docsray-mcp
# Install in development mode
pip install -e ".[dev]"
# Run tests
pytest tests/
# Run linting
ruff check src/
๐ License
This project is licensed under the Apache License 2.0 - see the LICENSE file for details.
๐ Acknowledgments
- Built on FastMCP framework
- Document processing powered by PyMuPDF4LLM
- AI analysis powered by LlamaParse
- Inspired by the Model Context Protocol specification
๐ฌ Support
- ๐ Documentation
- ๐ Issue Tracker
- ๐ฌ Discussions
Made with โค๏ธ for the MCP ecosystem
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file docsray_mcp-0.3.3.tar.gz.
File metadata
- Download URL: docsray_mcp-0.3.3.tar.gz
- Upload date:
- Size: 46.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
f813428b2f23c1833249752933197afde7ba3145dc9438371091a42657913155
|
|
| MD5 |
44b944ced4ad673e82d68d0c3b39d301
|
|
| BLAKE2b-256 |
f49ba927f03cb84d539434d067cd335cd77fed64f6c188c20defee6a64758050
|
File details
Details for the file docsray_mcp-0.3.3-py3-none-any.whl.
File metadata
- Download URL: docsray_mcp-0.3.3-py3-none-any.whl
- Upload date:
- Size: 50.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.11
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8ecd57a743a1c473c298884e2ef95be4bcaa3d538226fa4f8c8ca6253ef0f7a6
|
|
| MD5 |
2c62971759cdf000a698726843f78b76
|
|
| BLAKE2b-256 |
ae6bc28592742595d74b0a4cb09737456a73b2f655a271f34f06b1fc669bf8e7
|