Skip to main content

Local MCP server for RAG over PDFs, DOCX, images, and plaintext files.

Project description

mcp-local-rag

Local MCP server for RAG over PDFs, DOCX, images, and plaintext files.

Requirements

For more complex PDFs and image files, the following environment variables can be provided:

  • AZURE_DOCUMENT_INTELLIGENCE_ENDPOINT; requires mcp-local-rag[azure].
  • AZURE_DOCUMENT_INTELLIGENCE_KEY; when omitted, DefaultAzureCredential is used. Requires mcp-local-rag[azure].
  • GEMINI_API_KEY
  • MCP_LOCAL_RAG_GEMINI_MODEL (default: gemini-3-pro-preview)

Image files require either Gemini or Azure DI for extraction. See docs/configuration.md for details.

Data Storage

By default, the server stores data in:

  • Windows: %LOCALAPPDATA%\mcp-local-rag\
  • macOS: ~/Library/Application Support/mcp-local-rag/
  • Linux: $XDG_DATA_HOME/mcp-local-rag/

The data directory contains:

  • markdown/ - Extracted Markdown content of indexed documents
  • metadata.db - SQLite database for document/collection metadata
  • qdrant/ - Vector database for embeddings

AI Models are cached in the default HuggingFace cache directory (~/.cache/huggingface/).

To customize the data directory, set the MCP_LOCAL_RAG_DATA_DIR environment variable (a mcp-local-rag/ subfolder is created automatically inside it).

Usage

VS Code

Add to .vscode/mcp.json:

{
  "servers": {
    "mcp-local-rag": {
      "command": "uvx",
      "args": [
        "--python",
        "3.13",  // Does not support Python 3.14 yet: https://github.com/microsoft/markitdown/issues/1470
        "mcp-local-rag@latest"
      ]
    }
  }
}

If you run into SSL errors (Zscaler), you can try:

{
  "servers": {
    "mcp-local-rag": {
      "command": "uvx",
      "args": [
        "--native-tls",
        "--python",
        "3.13",  // Does not support Python 3.14 yet: https://github.com/microsoft/markitdown/issues/1470
        "--with",
        "pip-system-certs",
        "mcp-local-rag@latest"
      ]
    }
  }
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mcp_local_rag-0.3.5.tar.gz (31.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mcp_local_rag-0.3.5-py3-none-any.whl (38.3 kB view details)

Uploaded Python 3

File details

Details for the file mcp_local_rag-0.3.5.tar.gz.

File metadata

  • Download URL: mcp_local_rag-0.3.5.tar.gz
  • Upload date:
  • Size: 31.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for mcp_local_rag-0.3.5.tar.gz
Algorithm Hash digest
SHA256 70b6f006d174057914d75b3904f115d0a23df6eccd0d9d78496648dcc7d63c63
MD5 aca9f524559a78cbc8a18dc4ba168715
BLAKE2b-256 19808698dc0c22d8fc1da436f261fa112c2fcd4f4df7840adf4e46846fb6d3a8

See more details on using hashes here.

Provenance

The following attestation bundles were made for mcp_local_rag-0.3.5.tar.gz:

Publisher: publish.yml on Milliman-CMHH/mcp-local-rag

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file mcp_local_rag-0.3.5-py3-none-any.whl.

File metadata

  • Download URL: mcp_local_rag-0.3.5-py3-none-any.whl
  • Upload date:
  • Size: 38.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for mcp_local_rag-0.3.5-py3-none-any.whl
Algorithm Hash digest
SHA256 7ba11ec586213b179b515a416de3958a1966d83c3ff6e18513fc32dd982a03ee
MD5 acac09af5d02ff4d041d79ff3b22c9d0
BLAKE2b-256 af894799328182c7bb053c9799e64be1dc7b405892e88f1d2fc0d6144d748e1f

See more details on using hashes here.

Provenance

The following attestation bundles were made for mcp_local_rag-0.3.5-py3-none-any.whl:

Publisher: publish.yml on Milliman-CMHH/mcp-local-rag

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page