A Model Context Protocol (MCP) server for MLflow - enables LLMs to interact with MLflow experiments, runs, metrics, and models
Project description
MLflow MCP Server
A Model Context Protocol (MCP) server that enables LLMs to interact with MLflow tracking servers. Query experiments, analyze runs, compare metrics, manage the model registry, and promote models to production — all through natural language.
Features
- Experiment Management: List, search, and filter experiments
- Run Analysis: Query runs, compare metrics, find best performing models
- Metrics & Parameters: Get metric histories, compare parameters across runs
- Artifacts: Browse and download run artifacts
- LoggedModel Support: Search and retrieve MLflow 3 LoggedModel entities
- Model Registry: Full registry management — register, tag, alias, stage, and promote models
- Write & Delete Actions: Tag, alias, register, promote, and delete runs/experiments/models
- MCP Prompts: Built-in guided workflows for common tasks
- Pagination: Offset-based pagination for browsing large result sets
Installation
Using uvx (Recommended)
# Run directly without installation
uvx mlflow-mcp
# Or install globally
pip install mlflow-mcp
From Source
git clone https://github.com/kkruglik/mlflow-mcp.git
cd mlflow-mcp
uv sync
uv run mlflow-mcp
Configuration
Claude Desktop
Add to your Claude Desktop config file:
- macOS:
~/Library/Application Support/Claude/claude_desktop_config.json - Windows:
%APPDATA%\Claude\claude_desktop_config.json - Linux:
~/.config/claude/claude_desktop_config.json
{
"mcpServers": {
"mlflow": {
"command": "uvx",
"args": ["mlflow-mcp"],
"env": {
"MLFLOW_TRACKING_URI": "http://localhost:5000"
}
}
}
}
Claude Code (project-scoped)
Add .mcp.json to your project root:
{
"mcpServers": {
"mlflow": {
"command": "uvx",
"args": ["mlflow-mcp"],
"env": {
"MLFLOW_TRACKING_URI": "http://localhost:5000"
}
}
}
}
Authenticated Server
For MLflow servers with authentication, add credentials to the env block:
{
"mcpServers": {
"mlflow": {
"command": "uvx",
"args": ["mlflow-mcp"],
"env": {
"MLFLOW_TRACKING_URI": "https://mlflow.company.com",
"MLFLOW_TRACKING_USERNAME": "your-username",
"MLFLOW_TRACKING_PASSWORD": "your-password"
}
}
}
}
For Databricks or token-based auth, use MLFLOW_TRACKING_TOKEN instead:
{
"mcpServers": {
"mlflow": {
"command": "uvx",
"args": ["mlflow-mcp"],
"env": {
"MLFLOW_TRACKING_URI": "https://mlflow.company.com",
"MLFLOW_TRACKING_TOKEN": "your-token"
}
}
}
}
Environment Variables
| Variable | Required | Description |
|---|---|---|
MLFLOW_TRACKING_URI |
Yes | MLflow tracking server URL, e.g. http://127.0.0.1:5000 |
MLFLOW_TRACKING_USERNAME |
No | HTTP Basic Auth username (MLflow built-in auth) |
MLFLOW_TRACKING_PASSWORD |
No | HTTP Basic Auth password (MLflow built-in auth) |
MLFLOW_TRACKING_TOKEN |
No | Bearer token (Databricks or token-based setups) |
Tools
Experiments
| Tool | Description |
|---|---|
get_experiments() |
List all experiments |
search_experiments(filter_string, order_by, max_results) |
Filter and sort experiments |
get_experiment_by_name(name) |
Get experiment by name |
get_experiment_metrics(experiment_id) |
Discover all unique metric keys |
get_experiment_params(experiment_id) |
Discover all unique parameter keys |
set_experiment_tag(experiment_id, key, value) |
Tag an experiment |
delete_experiment(experiment_id) |
Delete an experiment (moves to deleted stage) |
Runs
| Tool | Description |
|---|---|
get_runs(experiment_id, limit, offset, order_by) |
List runs with full details, sorting and pagination |
get_run(run_id) |
Get detailed run information |
get_parent_run(run_id) |
Get parent run for nested runs |
query_runs(experiment_id, query, limit, offset, order_by) |
Filter runs, e.g. "metrics.accuracy > 0.9" |
search_runs_by_tags(experiment_id, tags, limit, offset) |
Find runs by tag key/value |
set_run_tag(run_id, key, value) |
Tag a run |
delete_run(run_id) |
Delete a run (moves to deleted stage) |
Metrics & Parameters
| Tool | Description |
|---|---|
get_run_metrics(run_id) |
Get all metrics for a run |
get_run_metric(run_id, metric_name) |
Get full metric history with steps |
Artifacts
| Tool | Description |
|---|---|
get_run_artifacts(run_id, path) |
List artifacts, supports browsing subdirectories |
get_run_artifact(run_id, artifact_path) |
Download an artifact file |
get_artifact_content(run_id, artifact_path) |
Read artifact content as text/JSON |
Analysis & Comparison
| Tool | Description |
|---|---|
get_best_run(experiment_id, metric, ascending) |
Find best run by metric |
compare_runs(experiment_id, run_ids) |
Side-by-side run comparison |
Logged Models (MLflow 3)
| Tool | Description |
|---|---|
search_logged_models(experiment_ids, filter_string, order_by, max_results) |
Search logged models by metrics/params/tags |
get_logged_model(model_id) |
Get full details of a logged model |
Model Registry
| Tool | Description |
|---|---|
get_registered_models() |
List all registered models |
get_registered_model(name) |
Full model details including versions and aliases |
get_model_versions(model_name) |
Get all versions of a model |
get_model_version(model_name, version) |
Get version details with metrics |
get_model_version_by_alias(name, alias) |
Get version by alias, e.g. "champion" |
get_latest_versions(name, stages) |
Get latest versions per stage |
register_model(model_name, model_uri, tags) |
Register a model into the registry |
update_model_version(name, version, description) |
Update version description |
set_registered_model_tag(name, key, value) |
Tag a registered model |
set_model_alias(name, alias, version) |
Assign an alias to a model version |
delete_model_alias(name, alias) |
Remove an alias from a model |
copy_model_version(src_model_name, src_version, dst_model_name) |
Promote version to another registered model |
transition_model_version_stage(name, version, stage) |
Transition to Staging/Production/Archived (deprecated since MLflow 2.9, use aliases instead) |
delete_model_version(name, version) |
Delete a model version |
delete_registered_model(name) |
Delete a registered model and all its versions |
Health
| Tool | Description |
|---|---|
health() |
Check server connectivity |
Prompts
Built-in guided workflows available as slash commands in Claude:
| Prompt | Description |
|---|---|
compare_runs_by_ids |
Compare specific runs side-by-side |
find_best_run |
Find and analyze the best run in an experiment by metric |
promote_best_model |
End-to-end: find best model → register → tag → alias → promote |
Usage Examples
Explore experiments and runs
"Show me all experiments. Which ones were updated recently?"
"What metrics and parameters are tracked in experiment 'fraud-detection'?"
"Get the top 10 runs in 'fraud-detection' sorted by test/f1. Show me the params that differ most between the top 3."
"Find all runs tagged with model_type=lightgbm and compare their recall scores."
Analyze a training run
"Show me the full details of run abc123 — metrics, params, and artifacts."
"Plot the training loss curve for run abc123." (Claude fetches metric history and renders a chart)
"This run has a parent — show me the parent run and compare their metrics."
Find and register the best model
"Find the best logged model in experiment 'fraud-detection' by test/recall. Register it as 'fraud-classifier' with a selection_metric tag."
"Which logged model in experiments 1 and 2 has the highest F1 score on the validation set?"
"Register the model from run abc123 artifact path 'model/' as 'my-classifier'."
Manage the model registry
"Show me all versions of 'fraud-classifier' with their aliases and stages."
"Set the champion alias on version 3 of fraud-classifier."
"Update the description of fraud-classifier v3 to explain what dataset it was trained on."
"Copy fraud-classifier v3 to a separate 'fraud-classifier-prod' model as the production entry."
End-to-end promotion workflow
"Find the best model in 'fraud-detection' by test/recall, register it as 'fraud-classifier', tag it with the framework and problem type, and set it as champion. Ask me before copying to prod."
(This maps directly to the promote_best_model built-in prompt)
Debugging
Use MCP Inspector to browse tools, call them with custom inputs, and inspect raw responses — without involving an LLM.
Published package:
npx @modelcontextprotocol/inspector uvx mlflow-mcp
Local source:
npx @modelcontextprotocol/inspector uv run --project /path/to/mlflow-mcp mlflow-mcp
Set MLFLOW_TRACKING_URI in the Inspector's environment panel, or pass it inline:
MLFLOW_TRACKING_URI=http://127.0.0.1:5000 npx @modelcontextprotocol/inspector uvx mlflow-mcp
Requirements
- Python >=3.10
- MLflow >=3.4.0
- Access to an MLflow tracking server
License
MIT License - see LICENSE file for details.
Contributing
Contributions welcome! Please open an issue or submit a pull request.
Links
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mlflow_mcp-0.3.0.tar.gz.
File metadata
- Download URL: mlflow_mcp-0.3.0.tar.gz
- Upload date:
- Size: 179.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
10e50558dad2d853f7ca9d20a96ca85a2603b7d64e539cca1fa194f8d2d8fff9
|
|
| MD5 |
3936358d894f3e5f82fe9c8e1771dc71
|
|
| BLAKE2b-256 |
1cd6b4826ebe42f6bd42987297387791cdba9c65abf059c15ed8d4758a1698be
|
Provenance
The following attestation bundles were made for mlflow_mcp-0.3.0.tar.gz:
Publisher:
python-publish.yml on kkruglik/mlflow-mcp
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
mlflow_mcp-0.3.0.tar.gz -
Subject digest:
10e50558dad2d853f7ca9d20a96ca85a2603b7d64e539cca1fa194f8d2d8fff9 - Sigstore transparency entry: 1339106255
- Sigstore integration time:
-
Permalink:
kkruglik/mlflow-mcp@6948a5f85722cbbfc23bf6dcab27966fd8b68944 -
Branch / Tag:
refs/tags/v0.3.0 - Owner: https://github.com/kkruglik
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@6948a5f85722cbbfc23bf6dcab27966fd8b68944 -
Trigger Event:
release
-
Statement type:
File details
Details for the file mlflow_mcp-0.3.0-py3-none-any.whl.
File metadata
- Download URL: mlflow_mcp-0.3.0-py3-none-any.whl
- Upload date:
- Size: 14.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d8136e82d64fe3a5c9b44d41fae2966f8e414f164789d612c29794e062c90c5b
|
|
| MD5 |
da08c644e976fddc1f5fbd7665b67963
|
|
| BLAKE2b-256 |
150fba5c42fb3ccffee337e4234c233b91810992e43b45606406cb4ca0afa4c1
|
Provenance
The following attestation bundles were made for mlflow_mcp-0.3.0-py3-none-any.whl:
Publisher:
python-publish.yml on kkruglik/mlflow-mcp
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
mlflow_mcp-0.3.0-py3-none-any.whl -
Subject digest:
d8136e82d64fe3a5c9b44d41fae2966f8e414f164789d612c29794e062c90c5b - Sigstore transparency entry: 1339106342
- Sigstore integration time:
-
Permalink:
kkruglik/mlflow-mcp@6948a5f85722cbbfc23bf6dcab27966fd8b68944 -
Branch / Tag:
refs/tags/v0.3.0 - Owner: https://github.com/kkruglik
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
python-publish.yml@6948a5f85722cbbfc23bf6dcab27966fd8b68944 -
Trigger Event:
release
-
Statement type: