MCP server for reading and visually analyzing local documents (PDF, Excel, CSV, Word, PowerPoint, images). No API keys required — works with GitHub Copilot and any MCP-compatible AI host.
Project description
Docalyze MCP Server
An MCP (Model Context Protocol) server that lets AI assistants read and visually analyze local documents — PDFs, Excel spreadsheets, CSV files, Word documents, PowerPoint presentations, and images.
No API keys required. The host AI (GitHub Copilot, Claude, etc.) does all the reasoning directly.
Supported Formats
| Format | Extensions | Read | Visual |
|---|---|---|---|
.pdf |
✅ | ✅ | |
| Excel | .xlsx, .xls |
✅ | ✅ |
| CSV / TSV | .csv, .tsv |
✅ | — |
| JSON | .json |
✅ | — |
| Word | .docx |
✅ | ✅ |
| PowerPoint | .pptx |
✅ | ✅ |
| Plain text | .txt, .md |
✅ | — |
| Images | .png, .jpg, .jpeg, .gif, .bmp, .tiff, .webp |
— | ✅ |
Tools
| Tool | Description |
|---|---|
list_documents |
List files under a directory, filtered by glob pattern |
document_info |
Get metadata (size, modified date, sheets) for a file |
read_document |
Extract text content from a document with pagination |
visual_evaluate_document |
Return page images inline so the AI can analyze charts, tables, and diagrams |
Installation
From VS Code (recommended)
Search for docalyze in the MCP server gallery (Extensions sidebar → MCP tab) and click Install.
From PyPI
pip install docalyze-mcp-server
From npm
npx docalyze-mcp-server
This requires uv or pipx installed — the npm wrapper calls uvx to run the Python package automatically.
Manual setup
Add to your VS Code mcp.json (or settings.json):
{
"servers": {
"docalyze": {
"type": "stdio",
"command": "python",
"args": ["-m", "docalyze_mcp_server"],
"env": {
"PYTHONIOENCODING": "utf-8"
}
}
}
}
Or, if you installed via pip and want to use the entry point:
{
"servers": {
"docalyze": {
"type": "stdio",
"command": "docalyze-mcp-server"
}
}
}
Optional Dependencies
The base install handles PDF, Excel, CSV, JSON, and plain text. For additional formats:
# Word documents
pip install docalyze-mcp-server[docx]
# PowerPoint
pip install docalyze-mcp-server[pptx]
# OCR (requires Tesseract installed on your system)
pip install docalyze-mcp-server[ocr]
# Everything
pip install docalyze-mcp-server[all]
Configuration
The server reads documents from a configurable root directory. Set the DOCUMENTS_ROOT environment variable to change it:
{
"servers": {
"docalyze": {
"type": "stdio",
"command": "docalyze-mcp-server",
"env": {
"DOCUMENTS_ROOT": "/path/to/your/documents"
}
}
}
}
If not set, it defaults to the directory containing the server script.
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file docalyze_mcp_server-0.2.1.tar.gz.
File metadata
- Download URL: docalyze_mcp_server-0.2.1.tar.gz
- Upload date:
- Size: 10.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4d46d458eed45379b311972391ebe56205c04ee660dddb84af988ac5ff599fba
|
|
| MD5 |
6f2970b822e83f868d5868f12ce3faa9
|
|
| BLAKE2b-256 |
84dbee80493109a16299fee09dfc5a46a4ddc4b0142b95e6ed22faec3d2898eb
|
File details
Details for the file docalyze_mcp_server-0.2.1-py3-none-any.whl.
File metadata
- Download URL: docalyze_mcp_server-0.2.1-py3-none-any.whl
- Upload date:
- Size: 10.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a5e4321caf88e4a83db251e364d094fee185b20bc26db3a11c24a80cde3f2f6a
|
|
| MD5 |
38eb3b184ebbdc13a1a0d29a4568a767
|
|
| BLAKE2b-256 |
fa83fe91bb01b18a7a9f5c9a19ea0a491103448756a12dcf5010556084245b6a
|