Skip to main content

MCP server for MinerU - Parse PDFs and images (OCR) with MLX acceleration on Apple Silicon

Project description

MCP-MinerU

PyPI version Python 3.10+ License

MCP server for document and image parsing via MinerU. Extract text, tables, and formulas from PDFs, screenshots, and scanned documents with MLX acceleration on Apple Silicon.

Installation

claude mcp add --transport stdio --scope user mineru -- \
  uvx --from mcp-mineru python -m mcp_mineru.server

This command installs and configures the server for all your Claude Code projects using uvx (no manual installation required).

Alternative methods: See Installation Guide for PyPI, source installation, and Claude Desktop configuration.

Features

  • Multiple format support: PDF, JPEG, PNG, and other image formats
  • OCR capabilities: Built-in text extraction from screenshots and photos
  • Table recognition: Preserves structure when extracting tables
  • Formula extraction: Converts mathematical equations to LaTeX
  • MLX acceleration: Optimized for Apple Silicon (M1/M2/M3/M4)
  • Multiple backends: Choose speed vs quality tradeoffs

Quick Start

Parse a PDF document

User: "Analyze the tables in research_paper.pdf"
Claude: [Calls parse_pdf tool] "The paper contains 3 tables..."

Extract text from a screenshot

User: "What does this screenshot say? image.png"
Claude: [Calls parse_pdf tool] "The screenshot contains..."

Check system capabilities

User: "Which backend should I use?"
Claude: [Calls list_backends tool] "Your system has Apple Silicon M4..."

For more examples, see Usage Examples.

Tools

parse_pdf

Parse PDF and image files to extract structured content as Markdown.

Parameters:

  • file_path (required): Absolute path to file (PDF, JPEG, PNG, etc.)
  • backend (optional): pipeline | vlm-mlx-engine | vlm-transformers
  • formula_enable (optional): Enable formula recognition (default: true)
  • table_enable (optional): Enable table recognition (default: true)
  • start_page (optional): Starting page for PDFs (default: 0)
  • end_page (optional): Ending page for PDFs (default: -1)

list_backends

Check system capabilities and get backend recommendations.

Returns: System information, available backends, and performance recommendations.

Supported Formats

  • PDF documents (.pdf)
  • JPEG images (.jpg, .jpeg)
  • PNG images (.png)
  • Other image formats (WebP, GIF, etc.)

Performance

Benchmarked on Apple Silicon M4 (16GB RAM):

  • pipeline: ~32s/page, CPU-only, good quality
  • vlm-mlx-engine: ~38s/page, Apple Silicon optimized, excellent quality
  • vlm-transformers: ~148s/page, highest quality, slowest

Documentation

Development

git clone https://github.com/TINKPA/mcp-mineru.git
cd mcp-mineru
uv pip install -e ".[dev]"

# Run tests
pytest

# Format code
black src/
ruff check src/

License

Apache License 2.0 - see LICENSE file for details.

Acknowledgments

Built on top of MinerU by OpenDataLab.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mcp_mineru-0.1.3.tar.gz (10.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mcp_mineru-0.1.3-py3-none-any.whl (8.4 kB view details)

Uploaded Python 3

File details

Details for the file mcp_mineru-0.1.3.tar.gz.

File metadata

  • Download URL: mcp_mineru-0.1.3.tar.gz
  • Upload date:
  • Size: 10.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.8.22

File hashes

Hashes for mcp_mineru-0.1.3.tar.gz
Algorithm Hash digest
SHA256 bfe4eae7dc6b61ddfb64c71e1678efe3af1b85b6d8b447541cb5710ef06f30e1
MD5 2c8d3568480ac7a8a8d191b9c08ac265
BLAKE2b-256 b92880cce0de08c2120903426fa899b40f5ba72b3509de1b203792ae6c7d2428

See more details on using hashes here.

File details

Details for the file mcp_mineru-0.1.3-py3-none-any.whl.

File metadata

File hashes

Hashes for mcp_mineru-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 96ef14b2b487049ed35107907489d888bf46f4093e8942e305a3e195737ca340
MD5 11f7bc1a2d19ba012a9ba97787225e57
BLAKE2b-256 c49901580b0b89c92fa9a77121bfa9e144b4c5d8c444e4b5c1c10e3878794798

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page