Skip to main content

MCP server exposing Apache Spark Connect (and Databricks Connect) via DataFrame and SQL tools for AI agents

Project description

spark-connect-mcp

MCP server exposing Apache Spark Connect (and Databricks Connect) via DataFrame and SQL tools for AI agents.

Install

Choose one backend — do not install both.

# OSS Spark Connect
pip install "spark-connect-mcp[spark]"

# Databricks Connect
pip install "spark-connect-mcp[databricks]"

Quick Start

Add to your Claude Code MCP config:

{
  "mcpServers": {
    "spark": {
      "command": "uvx",
      "args": ["--from", "spark-connect-mcp[databricks]", "spark-connect-mcp"]
    }
  }
}

For OSS Spark Connect, replace [databricks] with [spark].

Configuration

All connection config is set via environment variables — the MCP tools require no parameters to start a session.

OSS Spark Connect

Set SPARK_REMOTE to your Spark Connect server URL (PySpark's native env var):

export SPARK_REMOTE=sc://localhost:15002

Databricks Connect

Optionally set DATABRICKS_CONFIG_PROFILE to select a profile from ~/.databrickscfg (defaults to DEFAULT):

export DATABRICKS_CONFIG_PROFILE=my-workspace

Serverless compute is used by default inside Databricks Apps, Jobs, and notebooks — no env var needed.

Status

Under active development. See issues for the roadmap.

License

Apache-2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spark_connect_mcp-0.1.0.tar.gz (102.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

spark_connect_mcp-0.1.0-py3-none-any.whl (15.9 kB view details)

Uploaded Python 3

File details

Details for the file spark_connect_mcp-0.1.0.tar.gz.

File metadata

  • Download URL: spark_connect_mcp-0.1.0.tar.gz
  • Upload date:
  • Size: 102.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for spark_connect_mcp-0.1.0.tar.gz
Algorithm Hash digest
SHA256 8da78acef1bc20a8443660bc9d1e351ba9d86d561a4235319fe0c1877f19b1ed
MD5 1d3bc8e6ce8a48d7366849e853baa9f0
BLAKE2b-256 be23802faa677dda9c3eb8178abf90476c4ffa7d8312536777df5f9edcaaf7f5

See more details on using hashes here.

Provenance

The following attestation bundles were made for spark_connect_mcp-0.1.0.tar.gz:

Publisher: release.yml on IceRhymers/spark-connect-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file spark_connect_mcp-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for spark_connect_mcp-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 4f9f7b129897ef9c2a6b93dfa9136bf732c717b211535c8c11c71d4b620b6e8d
MD5 04e54371949a19bdd79822b30a246d2c
BLAKE2b-256 a0933389cd19d0026f90166fc38bbab6d7ce8c7c6cb6dc1a70c8fde1551f6845

See more details on using hashes here.

Provenance

The following attestation bundles were made for spark_connect_mcp-0.1.0-py3-none-any.whl:

Publisher: release.yml on IceRhymers/spark-connect-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page