Skip to main content

Lightweight Databricks Documentation MCP Server

Project description

Databricks Documentation MCP Server

A lightweight, stateless Model Context Protocol (MCP) server that lets AI assistants read and search docs.databricks.com in real time — no local cache, no database, no crawling required.

Features

  • Live fetch — always returns current documentation content, never stale
  • Full-text search — real-time, site-scoped search via DuckDuckGo site: operator
  • Docusaurus-aware extraction — strips navigation, sidebars, and page chrome; returns clean markdown
  • Section extraction — pull specific h2 sections from long reference pages
  • Paginationstart_index and max_length parameters for large pages

Prerequisites

  • Python 3.10 or later
  • uv (recommended) or pip

Installation

# run directly without installing (recommended)
uvx databricks-docs-mcp

# or install permanently
pip install databricks-docs-mcp

MCP client configuration (release install)

Claude Desktop

Edit ~/Library/Application Support/Claude/claude_desktop_config.json (macOS) or
%APPDATA%\Claude\claude_desktop_config.json (Windows):

{
  "mcpServers": {
    "databricks-docs": {
      "command": "uvx",
      "args": ["databricks-docs-mcp"]
    }
  }
}

VS Code (GitHub Copilot)

Add to .vscode/mcp.json in your workspace:

{
  "servers": {
    "databricks-docs": {
      "type": "stdio",
      "command": "uvx",
      "args": ["databricks-docs-mcp"]
    }
  }
}

To pin a specific version, use "args": ["databricks-docs-mcp==1.2.0"].

From source (development)

git clone https://gitlab.com/rokorolev/databricks-docs-mcp.git
cd databricks-docs-mcp
uv sync --extra dev

MCP client config for a local clone:

{
  "servers": {
    "databricks-docs": {
      "type": "stdio",
      "command": "uv",
      "args": ["--directory", "/absolute/path/to/databricks-mcp", "run", "databricks-docs-mcp"]
    }
  }
}

Environment Variables

Variable Default Description
MCP_USER_AGENT Mozilla/5.0 (compatible; DatabricksDocsMCP/1.0) HTTP User-Agent sent with every request
FASTMCP_LOG_LEVEL WARNING Log verbosity: DEBUG, INFO, WARNING, ERROR

Tools

search_documentation

Search docs.databricks.com using a site-scoped real-time web search.

Parameter Type Default Description
query string Keywords or topic to search for
limit integer 10 Maximum results to return (max 30)

Returns a JSON array of results with URL, title, and snippet.

read_documentation

Fetch a docs.databricks.com page as clean markdown.

Parameter Type Default Description
url string Full docs.databricks.com URL
max_length integer 5000 Maximum characters to return per call
start_index integer 0 Character offset for pagination

Returns markdown-formatted page content with a continuation hint when the page is truncated.

read_sections

Extract specific h2 sections from a docs page by heading title.

Parameter Type Default Description
url string Full docs.databricks.com URL
section_titles string[] h2 heading titles to extract (case-insensitive)

Returns markdown of the matched sections only.

Basic Usage

Recommended workflow

1. Search for a topic:

search_documentation("Delta Live Tables pipeline settings")

2. Read the most relevant result:

read_documentation("https://docs.databricks.com/aws/en/dlt/settings.html")

3. Extract specific sections from large pages:

read_sections(
  "https://docs.databricks.com/aws/en/dlt/settings.html",
  ["Pipeline mode", "Compute settings"]
)

Tips

  • Databricks docs URLs follow the pattern https://docs.databricks.com/<cloud>/en/<topic>/...
    Use aws for AWS, gcp for GCP, azure for Azure.
  • Use start_index in read_documentation to page through long articles.
  • Section titles for read_sections are matched case-insensitively against <h2> headings on the page.

Development

uv sync --extra dev

Lint

uv run ruff check src/ tests/

Run tests

uv run --frozen pytest --cov --cov-branch --cov-report=term-missing

Project structure

src/
  databricks_docs_mcp/
    server.py   # MCP server and tool definitions
    utils.py    # HTML extraction and formatting utilities
    models.py   # Pydantic models for search results
tests/
  test_server.py
  test_utils.py

License

MIT — see LICENSE.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

databricks_docs_mcp-1.0.5.tar.gz (125.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

databricks_docs_mcp-1.0.5-py3-none-any.whl (10.9 kB view details)

Uploaded Python 3

File details

Details for the file databricks_docs_mcp-1.0.5.tar.gz.

File metadata

  • Download URL: databricks_docs_mcp-1.0.5.tar.gz
  • Upload date:
  • Size: 125.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.14 {"installer":{"name":"uv","version":"0.11.14","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Debian GNU/Linux","version":"13","id":"trixie","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for databricks_docs_mcp-1.0.5.tar.gz
Algorithm Hash digest
SHA256 a9683180345b8a1027bc0ce14ed334a1ad60ac3c5065d4fa53cbc129069d0a86
MD5 f22931b06cb3fca51516840d2c658204
BLAKE2b-256 56e7465c87522612378299d1ba47f7d3219b890cabb70fe811e5942faffc8b40

See more details on using hashes here.

File details

Details for the file databricks_docs_mcp-1.0.5-py3-none-any.whl.

File metadata

  • Download URL: databricks_docs_mcp-1.0.5-py3-none-any.whl
  • Upload date:
  • Size: 10.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.14 {"installer":{"name":"uv","version":"0.11.14","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Debian GNU/Linux","version":"13","id":"trixie","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for databricks_docs_mcp-1.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 176f4df480caa29cf3c8eaffbf6dfc7e2539ed2db4f9d9604d3eaa566614a4f4
MD5 d36ce937ec2a889e4349a113325c5b21
BLAKE2b-256 99a301a53aef5c76d7534caca68bd799785598ef85a1deaa9b3f3b437c4c1f5c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page