Skip to main content

A CLI tool to sync W&B offline runs from Kaggle Notebooks via GitHub Actions

Project description

kaggle-wandb-sync

PyPI version Test License: MIT

A CLI tool to sync Weights & Biases offline runs from Kaggle Notebooks to W&B cloud — fully automated via GitHub Actions.

Why?

Kaggle Notebooks run in an isolated environment with internet access disabled for competition submissions. This means you can't push W&B metrics in real time. kaggle-wandb-sync solves this by:

  1. Running your notebook with WANDB_MODE=offline (logs saved locally on Kaggle)
  2. Downloading the output via kaggle kernels output
  3. Syncing the offline runs to W&B cloud with wandb sync

Installation

pip install kaggle-wandb-sync

Prerequisites: Kaggle API credentials (~/.kaggle/kaggle.json) and a W&B API key (WANDB_API_KEY env var, or run wandb login once to save credentials to ~/.netrc).

Quick Start

All-in-one command

# Set your W&B API key
export WANDB_API_KEY=your_api_key

# Push notebook, wait for completion, download output, sync to W&B
kaggle-wandb-sync run my-notebook/

Step by step

kaggle-wandb-sync push   my-notebook/                      # push (with 409 protection)
kaggle-wandb-sync poll   username/my-notebook              # wait for COMPLETE
kaggle-wandb-sync output username/my-notebook              # download output
kaggle-wandb-sync sync   ./kaggle_output                   # wandb sync

Notebook Setup

Add these lines before importing wandb in your Kaggle Notebook:

import os
os.environ['WANDB_MODE'] = 'offline'   # must be set before import
os.environ['WANDB_PROJECT'] = 'my-project'

import wandb
wandb.init()
# ... your training code ...
wandb.log({"loss": 0.1, "accuracy": 0.95})
wandb.finish()

Important: Set WANDB_MODE=offline before import wandb, not after.

GitHub Actions Integration

Add this workflow to your Kaggle repo (.github/workflows/kaggle-wandb-sync.yml):

name: Kaggle W&B Sync

on:
  workflow_dispatch:
    inputs:
      notebook_dir:
        description: "Notebook directory (e.g. my-competition)"
        required: true
      kernel_id:
        description: "Kernel ID (e.g. username/my-competition-baseline)"
        required: true

jobs:
  sync:
    runs-on: ubuntu-latest
    steps:
      - uses: actions/checkout@v4

      - uses: actions/setup-python@v5
        with:
          python-version: "3.12"

      - name: Install kaggle-wandb-sync
        run: pip install kaggle-wandb-sync

      - name: Set up Kaggle credentials
        run: |
          mkdir -p ~/.kaggle
          echo '${{ secrets.KAGGLE_API_TOKEN }}' > ~/.kaggle/kaggle.json
          chmod 600 ~/.kaggle/kaggle.json

      - name: Run pipeline
        env:
          WANDB_API_KEY: ${{ secrets.WANDB_API_KEY }}
        run: |
          kaggle-wandb-sync run ${{ inputs.notebook_dir }} \
            --kernel-id ${{ inputs.kernel_id }}

Required secrets: KAGGLE_API_TOKEN (JSON content of ~/.kaggle/kaggle.json) and WANDB_API_KEY.

Commands

run — Full pipeline (recommended)

kaggle-wandb-sync run [DIRECTORY] [OPTIONS]
Option Default Description
--kernel-id, -k from metadata Kernel ID (username/slug)
--output-dir, -o ./kaggle_output Directory for downloaded files
--poll-interval 30 Seconds between status checks
--max-attempts 60 Max poll attempts (30min total)
--skip-push off Skip push step (use when notebook has already finished running)
--skip-sync off Download output only, skip W&B sync
--competition-slug Competition slug to auto-record LB score after browser submission (e.g. march-machine-learning-mania-2026)

push — Push notebook

kaggle-wandb-sync push [DIRECTORY] [OPTIONS]

Waits for any currently running kernel to finish before pushing (prevents 409 Conflict errors).

poll — Wait for completion

kaggle-wandb-sync poll KERNEL_ID [--interval 30] [--max-attempts 60]

Exits with code 1 if the kernel finishes with ERROR or CANCEL.

v0.1.5+: On ERROR or CANCEL, automatically downloads the kernel log and prints stdout + last 30 stderr lines, so you can diagnose failures without opening the Kaggle UI.

output — Download output

kaggle-wandb-sync output KERNEL_ID [--output-dir ./kaggle_output]

sync — Sync to W&B

kaggle-wandb-sync sync [OUTPUT_DIR]

Finds all offline-run-* directories and runs wandb sync on each.

score — Record Kaggle LB score to W&B

kaggle-wandb-sync score RUN_ID [OPTIONS]
Option Description
--score Kaggle public LB score (float)
--rank Leaderboard rank (int)
--metric KEY=VALUE Additional metric (repeatable)
--project entity/project W&B project path (for bare run IDs)
kaggle-wandb-sync score https://wandb.ai/me/my-proj/runs/abc123 --score 0.127 --rank 200

Known Issues

  • Windows encoding: Prefix commands with PYTHONUTF8=1 if you see encoding errors on Windows.

  • Windows PATH (Microsoft Store Python): If kaggle-wandb-sync: command not found in Git Bash, add the Scripts directory to your PATH:

    # Add to ~/.bashrc
    export PATH="$PATH:/c/Users/<your-username>/AppData/Local/Packages/PythonSoftwareFoundation.Python.3.12_qbz5n2kfra8p0/LocalCache/local-packages/Python312/Scripts"
    
  • Git Bash path format (fixed in v0.1.2): Git Bash converts paths like C:/Users/... to /c/Users/..., which Python cannot resolve. As of v0.1.2, all path arguments are automatically converted to Windows format.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

kaggle_wandb_sync-0.1.9.tar.gz (13.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

kaggle_wandb_sync-0.1.9-py3-none-any.whl (15.7 kB view details)

Uploaded Python 3

File details

Details for the file kaggle_wandb_sync-0.1.9.tar.gz.

File metadata

  • Download URL: kaggle_wandb_sync-0.1.9.tar.gz
  • Upload date:
  • Size: 13.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for kaggle_wandb_sync-0.1.9.tar.gz
Algorithm Hash digest
SHA256 2966fe92041b23937dc98da262eb3ac16d185ed6139df3934cc847b9e1099a56
MD5 c7f9b8bc19c39b18fbf7b0b1969d35c9
BLAKE2b-256 4b5557d0a4ee7591a798bf4dbb647994bc7c3b7b2fba637f13e3d13c094aa999

See more details on using hashes here.

Provenance

The following attestation bundles were made for kaggle_wandb_sync-0.1.9.tar.gz:

Publisher: publish.yml on yasumorishima/kaggle-wandb-sync

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file kaggle_wandb_sync-0.1.9-py3-none-any.whl.

File metadata

File hashes

Hashes for kaggle_wandb_sync-0.1.9-py3-none-any.whl
Algorithm Hash digest
SHA256 70fb6e71bef68b3f534811b094388c30be3bbb8d22dcc4b505de6dbde673d395
MD5 5135aa6c36aa71e0bfb1810afe1c3c03
BLAKE2b-256 a2e9240cc2af5e4107ee87a2cf1d6b1b4a3036ed68d4c9ac6fbd1258fc13c74d

See more details on using hashes here.

Provenance

The following attestation bundles were made for kaggle_wandb_sync-0.1.9-py3-none-any.whl:

Publisher: publish.yml on yasumorishima/kaggle-wandb-sync

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page