A safety-first MCP server connecting Claude to Kaggle (competitions, datasets, kernels, models).
Project description
kaggle-mcp
A safety-first Model Context Protocol server connecting Claude (Desktop / Code) to Kaggle — competitions, datasets, kernels (notebooks), and models.
safe-kaggle-mcp is the only Kaggle MCP built safety-first. Every irreversible action — submit, delete, public-publish — is gated behind a two-call preview→commit confirm token, a per-competition submission budget surfaced in every response, and off-by-default destructive flags, while all Kaggle-returned text is fenced in
<untrusted-content>and your credentials never enter the model context. Other servers chase tool count and the official remote server wins on zero-install convenience — but none of them, official included, gate a single destructive call, cap submissions, or harden against prompt injection. It trades raw breadth for the one thing an autonomous agent with your Kaggle account actually needs: it cannot quietly burn your daily submissions, leak your key, publish your private work, or be hijacked by a poisoned discussion thread.
claude mcp add --transport stdio kaggle -- uvx --from safe-kaggle-mcp kaggle-mcp
How it compares
| Feature | safe-kaggle-mcp | Typical community server | Official remote / Composio |
|---|---|---|---|
| Two-call preview→commit confirm tokens (submit/delete/publish) | ✅ Yes | ❌ No (single-shot) | ❌ No (single-shot) |
| Per-competition submission budget surfaced to the agent | ✅ Yes (default 5/day) | ❌ No | ❌ No (opaque server quota) |
Prompt-injection hardening (<untrusted-content> fencing) |
✅ Yes | ❌ No | ❌ No |
| Private-by-default + off-by-default destructive/publish flags | ✅ Yes | ⚠️ Optional is_private param |
❌ No |
No in-chat authenticate() + 40-hex credential redaction |
✅ Yes | ❌ Some leak the key into context | ⚠️ OAuth keeps key out, no redaction |
| Zip-slip-safe / sandboxed downloads | ✅ Yes | ❌ No | ❌ Can't sandbox local writes |
| Callable local EDA tool (compact pandas digest, never raw rows) | ✅ Yes (dataset + competition) | ❌ EDA prompt at best | ❌ No |
| Competition train/test schema diff + target auto-detect | ✅ Yes | ❌ No | ❌ No |
| Competition landscape triage (days-left, budget, metric) | ✅ Yes | ❌ Thin raw list | ❌ Raw fields, no digest |
| Submission best-score digest (metric-aware best/trend/failures) | ✅ Yes | ❌ Raw list | ❌ Raw list |
| Leaderboard delta tracking (local snapshots + rank moves) | ✅ Yes | ❌ No | ❌ No API endpoint either |
| Session audit ledger of mutating actions (redacted) | ✅ Yes | ❌ No | ❌ No |
| Read-only discussion + write-up digest, injection-fenced | ✅ Yes (untrusted-fenced) | ⚠️ Yes but unfenced (Galaxy-Dawn) | ⚠️ Unfenced |
| One-call competition kickoff (data + EDA + baseline plan) | ✅ Yes | ❌ No | ❌ No |
| Output discipline (ranked/capped tables, pagination, top-N) | ✅ Yes | ❌ Undocumented | ❌ Client's job |
Modern KGAT_ token + legacy username/key auth |
✅ Both | ⚠️ Mostly legacy-only | ✅ OAuth 2.0 |
| Install | Local stdio (PyPI + git, uvx) |
Local (pip/uv) | ✅ Zero-install remote |
| Raw tool breadth | 41 tools (no benchmark listing — API exposes none) | up to ~51 | ~35–57 |
The two honest places we don't lead: the official remote server is zero-install, and the broadest community server (Galaxy-Dawn, ~51 tools) has more raw endpoints (e.g. benchmarks). Where they expose ~10 forum tools unfenced, we expose read-only discussion search + read with every body fenced as <untrusted-content> — and no posting tool, because the API has none.
What you can do with it
Drive your whole Kaggle workflow from a Claude chat, in plain language:
| Ask Claude… | What happens under the hood |
|---|---|
| "Triage active competitions — prize, metric, days left." | kaggle_competition_landscape (one digested, deadline-sorted report) |
| "Download titanic data, infer the target, and do an EDA." | kaggle_eda_competition → train/test schema diff (auto-target) + compact pandas digest |
| "Find a good Titanic dataset and show me what it looks like." | kaggle_search_datasets → kaggle_dataset_preview (capped, untrusted-fenced rows) |
| "Summarize the iris dataset." | kaggle_eda_dataset → shape, dtypes, missingness, target dist, top correlations |
| "Submit predictions.csv to titanic and tell me the score." | kaggle_preview_submission → kaggle_submit_to_competition (token + budget gate) → kaggle_get_submission_score |
| "Is it worth iterating — what's my best score so far?" | kaggle_submission_best_score (metric-aware best/trend/failures + today's budget) |
| "Did anyone pass me on the leaderboard since last check?" | kaggle_leaderboard_track (snapshot deltas, who passed you) |
| "What techniques are people discussing for this competition?" | kaggle_search_discussions → kaggle_get_discussion (read-only, every post untrusted-fenced) |
| "Find the winning solution write-ups for this competition." | kaggle_search_writeups → kaggle_get_discussion |
| "Kick off this competition for me." | kaggle_competition_kickoff — one call: metric + rules + data download + train/test EDA (auto-target) + baseline plan + budget |
| "What mutating actions have I taken this session?" | kaggle_audit_log (redacted ledger of every submit/create/delete) |
| "Run this notebook on Kaggle's free GPU and get the output." | kaggle_push_kernel → kaggle_kernel_status → kaggle_kernel_output |
| "Save my engineered features as a private dataset version." | kaggle_version_dataset — private by default |
Higher-order workflows the agent chains itself: an autonomous competition loop (download → train → submit → read score → iterate, within the daily budget), Kaggle as a remote GPU/TPU backend, and cross-run memory via private dataset versions. Every irreversible step is gated, so the agent can run autonomously without risking your account.
What it won't do (honest limits)
Some competitors advertise capabilities the public Kaggle API can't actually deliver. We refuse to ship dishonest stubs:
- No posting to forums/discussions. The API can read topics but has no create/reply/vote endpoint — so we don't pretend to.
- No private-leaderboard or final-rank prediction. Private scores are withheld until the deadline; our tools label scores as public-only.
- No historical leaderboard from Kaggle. There's no such endpoint —
kaggle_leaderboard_trackdiffs against snapshots you captured locally, and says so. - No hidden test labels / private splits. We only ever serve files the competition exposes.
Install
# fastest: install + register with Claude Code
claude mcp add --transport stdio kaggle -- uvx --from safe-kaggle-mcp kaggle-mcp
# or straight from this repo (no PyPI):
claude mcp add --transport stdio kaggle -- uvx --from git+https://github.com/parkseokjune/kaggle-mcp kaggle-mcp
# or for development
uv sync --extra dev && uv run kaggle-mcp
The PyPI distribution is
safe-kaggle-mcp(the namekaggle-mcpwas taken); the command it installs is stillkaggle-mcp, so launch it withuvx --from safe-kaggle-mcp kaggle-mcp.
Get a token at https://www.kaggle.com/settings → API → Create New Token. Both schemes work (resolved in order):
# 1) legacy username/key (env or ~/.kaggle/kaggle.json)
export KAGGLE_USERNAME=your_user
export KAGGLE_KEY=your_40_char_key
# 2) current API token (KGAT_...): the client reads it automatically
mkdir -p ~/.kaggle && echo "KGAT_..." > ~/.kaggle/access_token && chmod 600 ~/.kaggle/access_token
Claude Desktop — edit claude_desktop_config.json, then fully restart:
{ "mcpServers": { "kaggle": {
"command": "uvx", "args": ["--from", "safe-kaggle-mcp", "kaggle-mcp"],
"env": { "KAGGLE_USERNAME": "your_user", "KAGGLE_KEY": "your_key" } } } }
Safety switches (default OFF → read-only / private posture)
| Env var | Default | Effect when 1 |
|---|---|---|
KAGGLE_MCP_ENABLE_DESTRUCTIVE |
0 |
Exposes kaggle_delete_dataset / kaggle_delete_model |
KAGGLE_MCP_ENABLE_PUBLISH |
0 |
Allows creating public datasets |
KAGGLE_MCP_SUBMISSION_CAP |
5 |
Per-competition daily submission budget |
Even when enabled, each destructive call still needs a one-time confirm_token from its *_preview_* tool, and is recorded in kaggle_audit_log.
Tools
41 tools across account, competitions, datasets, discussions, kernels, models — plus kaggle:// resources (metadata, leaderboard, rules) and /kaggle-eda, /kaggle-submit-checklist, /kaggle-landscape, /kaggle-solution-research prompts. Run kaggle_status to see your auth + submission budget.
Verified live against the Kaggle API (kaggle 2.2.2) with both auth schemes; 53 offline + 6 live read-only tests pass.
Develop
uv run pytest # offline unit + integration tests
KAGGLE_LIVE=1 uv run pytest tests/test_live_readonly.py # live read-only smoke (needs creds)
uv run mcp dev src/kaggle_mcp/server.py # MCP Inspector
License
MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file safe_kaggle_mcp-0.4.2.tar.gz.
File metadata
- Download URL: safe_kaggle_mcp-0.4.2.tar.gz
- Upload date:
- Size: 46.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.15 {"installer":{"name":"uv","version":"0.11.15","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cde5c5bee20a09e13d11d1c5fc7e7f64d6d54399fcc4cf1e4e4b0f754b520172
|
|
| MD5 |
af1d1da861a49751473b0b69dd28fc27
|
|
| BLAKE2b-256 |
4dbc7ed6458179a978c4d7f86c283e6649a83e81cb44407892e7cb87f680cc71
|
File details
Details for the file safe_kaggle_mcp-0.4.2-py3-none-any.whl.
File metadata
- Download URL: safe_kaggle_mcp-0.4.2-py3-none-any.whl
- Upload date:
- Size: 39.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.11.15 {"installer":{"name":"uv","version":"0.11.15","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
553e897d20392a318a233bed28f3c303a3de93452b50d1cc4c7d9f9b2f7bbdba
|
|
| MD5 |
1a4c3edcdd5b5bd1edc88b86e50c1942
|
|
| BLAKE2b-256 |
fc7ffa13c325c345604b06617660d78d1a96bfc90ddd9a42c105561c2bce4adc
|