Skip to main content

A Model Context Protocol (MCP) server for MLflow - enables LLMs to interact with MLflow experiments, runs, metrics, and models

Project description

MLflow MCP Server

A Model Context Protocol (MCP) server that enables LLMs to interact with MLflow tracking servers. Query experiments, analyze runs, compare metrics, and explore the model registry - all through natural language.

Features

  • Experiment Management: List and search experiments, discover available metrics and parameters
  • Run Analysis: Retrieve run details, query runs with filters, find best performing models
  • Metrics & Parameters: Get metric histories, compare parameters across runs
  • Artifacts: Browse and download run artifacts
  • Model Registry: Access registered models, versions, and deployment stages
  • Comparison Tools: Side-by-side run comparisons, best run selection
  • Tag-based Search: Filter runs by custom tags

Installation

Using uvx (Recommended)

# Run directly without installation
uvx mlflow-mcp

# Or install globally
pip install mlflow-mcp

From Source

git clone https://github.com/kirillkruglikov/mlflow-mcp.git
cd mlflow-mcp
uv sync
uv run mlflow-mcp

Configuration

Claude Desktop

Add to your Claude Desktop config file:

macOS: ~/Library/Application Support/Claude/claude_desktop_config.json Windows: %APPDATA%\Claude\claude_desktop_config.json Linux: ~/.config/claude/claude_desktop_config.json

{
  "mcpServers": {
    "mlflow": {
      "command": "uvx",
      "args": ["mlflow-mcp"],
      "env": {
        "MLFLOW_TRACKING_URI": "http://localhost:5000"
      }
    }
  }
}

Environment Variables

  • MLFLOW_TRACKING_URI (required): Your MLflow tracking server URL
    • Examples: http://localhost:5000, https://mlflow.company.com

Available Tools

Experiments

  • get_experiments() - List all experiments
  • get_experiment_by_name(name) - Get experiment by name
  • get_experiment_metrics(experiment_id) - Discover all unique metrics
  • get_experiment_params(experiment_id) - Discover all unique parameters

Runs

  • get_runs(experiment_id, limit=10) - Get runs for an experiment
  • get_run(run_id) - Get detailed run information
  • query_runs(experiment_id, query, limit=10) - Filter runs (e.g., "metrics.accuracy > 0.9")
  • search_runs_by_tags(experiment_id, tags) - Find runs by tags

Metrics & Parameters

  • get_run_metrics(run_id) - Get all metrics for a run
  • get_run_metric(run_id, metric_name) - Get full metric history with steps

Artifacts

  • get_run_artifacts(run_id, path="") - List artifacts (supports browsing directories)
  • get_run_artifact(run_id, artifact_path) - Download artifact
  • get_artifact_content(run_id, artifact_path) - Read artifact content (text/json)

Analysis & Comparison

  • get_best_run(experiment_id, metric, ascending=False) - Find best run by metric
  • compare_runs(experiment_id, run_ids) - Side-by-side comparison

Model Registry

  • get_registered_models() - List all registered models
  • get_model_versions(model_name) - Get all versions of a model
  • get_model_version(model_name, version) - Get version details with metrics

Health

  • health() - Check server connectivity

Usage Examples

Ask Claude

"Show me all experiments in MLflow"

"What are the top 5 runs by accuracy in experiment 'my-experiment'?"

"Compare runs abc123 and def456"

"Which model has the highest F1 score?"

"Show me the training loss curve for run xyz789"

"List all production models in the registry"

Development

# Install dependencies
uv sync

# Run server
uv run mlflow-mcp

# Run tests
uv run pytest

# Format code
uv run black src/
uv run ruff check src/

Requirements

  • Python >=3.10
  • MLflow >=3.4.0
  • Access to an MLflow tracking server

License

MIT License - see LICENSE file for details.

Contributing

Contributions welcome! Please open an issue or submit a pull request.

Links

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlflow_mcp-0.1.1.tar.gz (177.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

mlflow_mcp-0.1.1-py3-none-any.whl (9.2 kB view details)

Uploaded Python 3

File details

Details for the file mlflow_mcp-0.1.1.tar.gz.

File metadata

  • Download URL: mlflow_mcp-0.1.1.tar.gz
  • Upload date:
  • Size: 177.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0rc1

File hashes

Hashes for mlflow_mcp-0.1.1.tar.gz
Algorithm Hash digest
SHA256 cc8c829d6c67b341aeab3fba76cff1534c1945d02ccb5ed7ecfd87845aa187e6
MD5 38a51421953d88804e70b0d4e72af567
BLAKE2b-256 593125e8301ada819557dcf7931ef9e3a0992d2be6ae008833ac454016fd29eb

See more details on using hashes here.

File details

Details for the file mlflow_mcp-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: mlflow_mcp-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 9.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.0rc1

File hashes

Hashes for mlflow_mcp-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8b1e67691c1cee369a9813d2dc5090ad53f2d2b3030e46758aab58f6d0411486
MD5 a3205c613ef5ddead2e6b737a178ee26
BLAKE2b-256 bf2be21f9a8d82c600d4e142c5fd1bb7c40e09837a0b8567b2b9a52ddbf1bd4f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page