MCP server that lets AI agents run real ML experiments end-to-end.
Project description
mcp-ml-lab
Let AI agents run real ML experiments end-to-end.
An MCP server that gives Claude (or any MCP-aware AI agent) the ability to profile a CSV, define an ML task, tune XGBoost and LightGBM with Optuna, and produce a markdown report with feature importance — all from natural language.
Why this exists
The existing ML-related MCP servers wrap MLflow, ZenML, or Weights & Biases
and expose them as read-only — agents can browse experiment history but
can't actually run anything. mcp-ml-lab fills the gap: it lets agents
execute the full experimentation loop from a user's natural-language request.
A user typing "train a model on titanic.csv to predict survival" should not
need to know what XGBoost is, what cross-validation is, or how to write a
hyperparameter search. The agent handles all of that — mcp-ml-lab is the
tools layer that makes it possible.
Quick start
pip install mcp-ml-lab
Add to your Claude Desktop config (~/Library/Application Support/Claude/claude_desktop_config.json on macOS):
{
"mcpServers": {
"ml-lab": {
"command": "mcp-ml-lab"
}
}
}
Restart Claude Desktop. The five tools below are now available.
Example queries
Try these in Claude Desktop with mcp-ml-lab connected:
- "Profile this CSV and tell me if there's class imbalance"
- "Compare XGBoost and LightGBM on titanic.csv with 60 seconds of tuning"
- "Show me the top 10 features the winning model used"
- "How did my last three experiments on the wine dataset compare?"
Tools
| Tool | What it does |
|---|---|
inspect_data |
Profile a CSV — shape, dtypes, nulls, summary stats, class balance |
define_task |
Register an ML task (CSV + target + classification/regression) |
run_experiment |
Train one or more models, optionally tuning with Optuna |
get_results |
Markdown report with metrics, hyperparameters, feature importance |
compare_runs |
Side-by-side comparison of multiple experiments |
Each tool's full signature is in its docstring; they self-document to the LLM.
How it works
Claude Desktop ───MCP/stdio─── mcp-ml-lab server
│
├── data.py CSV loading, schema inference, preprocessor
├── trainers/ Pluggable XGBoost + LightGBM adapters
├── search.py Stratified CV + Optuna TPE tuning
├── metrics.py Accuracy, F1, AUC, log loss
├── storage.py SQLite via SQLAlchemy 2.0
└── reporting.py Markdown report generation
All experiments and trials are persisted to ~/.mcp-ml-lab/store.db so an
agent can refer back to runs across sessions.
Full design notes in ARCHITECTURE.md.
Roadmap
v0.1.0 ships classification with XGBoost and LightGBM. Planned for v0.2.0+:
- Regression tasks
- Time series forecasting (sktime / darts integration)
- Deep learning baselines (pytorch-tabular)
- Optuna multi-objective search (accuracy × latency × model size)
- Persisted model artifacts with Docker reproducibility
- Permutation feature importance (bias-free alternative to gain importance)
- Notebook export — emit a Jupyter notebook that reproduces the winning run
Issues and PRs welcome.
Development
git clone https://github.com/rohithraju-ops/mcp-ml-lab.git
cd mcp-ml-lab
python -m venv .venv && source .venv/bin/activate
pip install -e ".[dev]"
pytest -v
Local debugging is easiest with the MCP Inspector:
npx @modelcontextprotocol/inspector mcp-ml-lab
License
MIT.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file mcp_ml_lab-0.1.0.tar.gz.
File metadata
- Download URL: mcp_ml_lab-0.1.0.tar.gz
- Upload date:
- Size: 23.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
707bf6efafe17e5c2027ddebf1c3e45731abaa9668242a3916434b8622576f98
|
|
| MD5 |
930c0e8d84577d21b787c63792b15a74
|
|
| BLAKE2b-256 |
72a57bed700bd28b11c854362285c45f02f21bd68b956eeb41fc54de4e09ea32
|
File details
Details for the file mcp_ml_lab-0.1.0-py3-none-any.whl.
File metadata
- Download URL: mcp_ml_lab-0.1.0-py3-none-any.whl
- Upload date:
- Size: 24.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0e80a6f4b54f1c23d3faa8f5ccb255843ca1775dba1fc43ff416db302ce13382
|
|
| MD5 |
8572ddeaf801525962429d19b66ca336
|
|
| BLAKE2b-256 |
85ddf00720f6b0552a8f464afe71cf573c4b3068c3b31804d47f19cc4cc54ee7
|