Skip to main content

Local MCP server for RAG over PDFs, DOCX, and plaintext files.

Project description

mcp-local-rag

Local MCP server for RAG over PDFs, DOCX, images, and plaintext files.

Requirements

For more complex PDFs and image files, the following environment variables can be provided:

  • AZURE_DOCUMENT_INTELLIGENCE_ENDPOINT; requires mcp-local-rag[azure].
  • AZURE_DOCUMENT_INTELLIGENCE_KEY; when omitted, DefaultAzureCredential is used. Requires mcp-local-rag[azure].
  • GEMINI_API_KEY
  • MCP_LOCAL_RAG_GEMINI_MODEL (default: gemini-3-pro-preview)

Image files require either Gemini or Azure DI for extraction. See docs/configuration.md for details.

Data Storage

By default, the server stores data in:

  • Windows: %LOCALAPPDATA%\mcp-local-rag\
  • macOS: ~/Library/Application Support/mcp-local-rag/
  • Linux: $XDG_DATA_HOME/mcp-local-rag/

The data directory contains:

  • markdown/ - Extracted Markdown content of indexed documents
  • metadata.db - SQLite database for document/collection metadata
  • qdrant/ - Vector database for embeddings

AI Models are cached in the default HuggingFace cache directory (~/.cache/huggingface/).

To customize the data directory, set the MCP_LOCAL_RAG_DATA_DIR environment variable (a mcp-local-rag/ subfolder is created automatically inside it).

Usage

VS Code

Add to .vscode/mcp.json:

{
  "servers": {
    "mcp-local-rag": {
      "command": "uvx",
      "args": [
        "--python",
        "3.13",  // Does not support Python 3.14 yet: https://github.com/microsoft/markitdown/issues/1470
        "mcp-local-rag@latest"
      ]
    }
  }
}

If you run into SSL errors (Zscaler), you can try:

{
  "servers": {
    "mcp-local-rag": {
      "command": "uvx",
      "args": [
        "--native-tls",
        "--python",
        "3.13",  // Does not support Python 3.14 yet: https://github.com/microsoft/markitdown/issues/1470
        "--with",
        "pip-system-certs",
        "mcp-local-rag@latest"
      ]
    }
  }
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mcp_local_rag-0.3.2.tar.gz (31.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mcp_local_rag-0.3.2-py3-none-any.whl (37.8 kB view details)

Uploaded Python 3

File details

Details for the file mcp_local_rag-0.3.2.tar.gz.

File metadata

  • Download URL: mcp_local_rag-0.3.2.tar.gz
  • Upload date:
  • Size: 31.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mcp_local_rag-0.3.2.tar.gz
Algorithm Hash digest
SHA256 5e04f4ff7282b0873f8146fe5edf1f4ae86d69dba9ea3c8d3b41c28af94b2a6e
MD5 785d286d87f0ceba7c9289d27402e087
BLAKE2b-256 cd3e5fc8a74a1e03a365aaf2d6c9d3edc209e7f701607ae1edf46fa7eb657315

See more details on using hashes here.

Provenance

The following attestation bundles were made for mcp_local_rag-0.3.2.tar.gz:

Publisher: publish.yml on Milliman-CMHH/mcp-local-rag

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file mcp_local_rag-0.3.2-py3-none-any.whl.

File metadata

  • Download URL: mcp_local_rag-0.3.2-py3-none-any.whl
  • Upload date:
  • Size: 37.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for mcp_local_rag-0.3.2-py3-none-any.whl
Algorithm Hash digest
SHA256 62715d56502680945aa9cf3dbc550085979ad8fe500218958a443065137bee8f
MD5 64459073bcca7cab591387d4a2106492
BLAKE2b-256 175c3e5aabd4798e3251454f6a1053823f6805265d47ff16ee1d6a18c5d235a6

See more details on using hashes here.

Provenance

The following attestation bundles were made for mcp_local_rag-0.3.2-py3-none-any.whl:

Publisher: publish.yml on Milliman-CMHH/mcp-local-rag

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page