Skip to main content

MCP server for reading and visually analyzing local documents (PDF, Excel, CSV, Word, PowerPoint, images). No API keys required — works with GitHub Copilot and any MCP-compatible AI host.

Project description

Docalyze Logo

Docalyze MCP Server

An MCP (Model Context Protocol) server that lets AI assistants read and visually analyze local documents — PDFs, Excel spreadsheets, CSV files, Word documents, PowerPoint presentations, and images.

No API keys required. The host AI (GitHub Copilot, Claude, etc.) does all the reasoning directly.

Supported Formats

Format Extensions Read Visual
PDF .pdf
Excel .xlsx, .xls
CSV / TSV .csv, .tsv
JSON .json
Word .docx
PowerPoint .pptx
Plain text .txt, .md
Images .png, .jpg, .jpeg, .gif, .bmp, .tiff, .webp

Tools

Tool Description
list_documents List files under a directory, filtered by glob pattern
document_info Get metadata (size, modified date, sheets) for a file
read_document Extract text content from a document with pagination
visual_evaluate_document Return page images inline so the AI can analyze charts, tables, and diagrams

Installation

From VS Code (recommended)

Search for docalyze in the MCP server gallery (Extensions sidebar → MCP tab) and click Install.

From PyPI

pip install docalyze-mcp-server

From npm

npx docalyze-mcp-server

This requires uv or pipx installed — the npm wrapper calls uvx to run the Python package automatically.

Manual setup

Add to your VS Code mcp.json (or settings.json):

{
  "servers": {
    "docalyze": {
      "type": "stdio",
      "command": "python",
      "args": ["-m", "docalyze_mcp_server"],
      "env": {
        "PYTHONIOENCODING": "utf-8"
      }
    }
  }
}

Or, if you installed via pip and want to use the entry point:

{
  "servers": {
    "docalyze": {
      "type": "stdio",
      "command": "docalyze-mcp-server"
    }
  }
}

Optional Dependencies

The base install handles PDF, Excel, CSV, JSON, and plain text. For additional formats:

# Word documents
pip install docalyze-mcp-server[docx]

# PowerPoint
pip install docalyze-mcp-server[pptx]

# OCR (requires Tesseract installed on your system)
pip install docalyze-mcp-server[ocr]

# Everything
pip install docalyze-mcp-server[all]

Configuration

The server reads documents from a configurable root directory. Set the DOCUMENTS_ROOT environment variable to change it:

{
  "servers": {
    "docalyze": {
      "type": "stdio",
      "command": "docalyze-mcp-server",
      "env": {
        "DOCUMENTS_ROOT": "/path/to/your/documents"
      }
    }
  }
}

If not set, it defaults to the directory containing the server script.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docalyze_mcp_server-0.2.1.tar.gz (10.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

docalyze_mcp_server-0.2.1-py3-none-any.whl (10.4 kB view details)

Uploaded Python 3

File details

Details for the file docalyze_mcp_server-0.2.1.tar.gz.

File metadata

  • Download URL: docalyze_mcp_server-0.2.1.tar.gz
  • Upload date:
  • Size: 10.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for docalyze_mcp_server-0.2.1.tar.gz
Algorithm Hash digest
SHA256 4d46d458eed45379b311972391ebe56205c04ee660dddb84af988ac5ff599fba
MD5 6f2970b822e83f868d5868f12ce3faa9
BLAKE2b-256 84dbee80493109a16299fee09dfc5a46a4ddc4b0142b95e6ed22faec3d2898eb

See more details on using hashes here.

File details

Details for the file docalyze_mcp_server-0.2.1-py3-none-any.whl.

File metadata

File hashes

Hashes for docalyze_mcp_server-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 a5e4321caf88e4a83db251e364d094fee185b20bc26db3a11c24a80cde3f2f6a
MD5 38eb3b184ebbdc13a1a0d29a4568a767
BLAKE2b-256 fa83fe91bb01b18a7a9f5c9a19ea0a491103448756a12dcf5010556084245b6a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page