Skip to main content

MLflow Plugin for Nebari Platform

Project description

Nebari MLflow Plugin

Table of Contents

Introduction

This MLflow extension supports AWS, Azure, GCP, and local Nebari deployments. It provides a robust, collaborative environment for AI/ML professionals to manage experiments, track metrics, and deploy models.

Features

Centralized Artifact Repository: Store and manage all your metrics, parameters, and artifacts in a single location, accessible across the multi-tenant platform.

Experiment Tracking: Log, query, and visualize metrics to understand and compare different runs and models.

Automated Configuration: Simply type import mlflow in your Python script, and you're already configured to communicate with the remote multi-tenant MLflow server—no additional setup required.

Installation

Prerequisites:

  • Nebari must be deployed using the AWS, Azure, GCP, or local provider
  • Nebari version 2024.10.1 or later (or a version with IRSA support for AWS)

Installing the MLflow extension is as straightforward as installing a Python package. Run the following commands:

git clone nebari-mlflow-plugin
cd nebari-mlflow-plugin
pip install nebari-mlflow-plugin

This command installs the Python package and also creates the necessary infrastructure to run MLflow on the AI Platform.

Configuration

After installation, the MLflow extension is automatically configured to work with the AI Platform. To access the MLflow interface, navigate to https://[your-nebari-domain]/mlflow.

For Azure, your app registration will need RBAC permissions in addition to the typical Contributor permissons. We recommend you create a custom role scoped at the resource_group (usually named "<project_name>-<namespace>" where the values are what you set in nebari-config.yaml), and add the following permissions:

  • Microsoft.Authorization/roleAssignments/read
  • Microsoft.Authorization/roleAssignments/write
  • Microsoft.Authorization/roleAssignments/delete

Then create a role assignment of that role to the nebari app registration service principal.

For GCP, your service account will need additional IAM permissions beyond the standard roles. The service account used for Nebari deployment requires the Service Account Admin role to manage workload identity bindings.

Add this role using the gcloud CLI:

gcloud projects add-iam-policy-binding YOUR_PROJECT_ID \
  --member="serviceAccount:YOUR_SERVICE_ACCOUNT@YOUR_PROJECT_ID.iam.gserviceaccount.com" \
  --role="roles/iam.serviceAccountAdmin"

Or via the GCP Console:

  1. Navigate to the IAM & Admin page in the Google Cloud Console
  2. Find your Nebari service account in the list
  3. Click the pencil icon to edit permissions
  4. Click "ADD ANOTHER ROLE"
  5. Search for and select "Service Account Admin"
  6. Click "SAVE"

This role includes the iam.serviceAccounts.setIamPolicy permission required for the MLflow plugin to create workload identity bindings.

For AWS, your IAM user or role used for Nebari deployment needs permissions to create and manage:

  • S3 buckets and bucket policies
  • IAM roles and policies for IRSA (IAM Roles for Service Accounts)
  • KMS keys (if S3 encryption is enabled)

The deployment requires an EKS cluster with OIDC provider configured for IRSA. The MLflow plugin will automatically:

  • Create an S3 bucket for artifact storage with versioning enabled
  • Create IAM roles and policies for the MLflow service account
  • Configure the Kubernetes service account with the appropriate IAM role annotations

You can optionally disable S3 encryption by adding this to your Nebari configuration:

mlflow:
  enabled: true
  aws:
    enable_s3_encryption: false

Configuring MLflow Tracking URL

You may set the MLFLOW_TRACKING_URL to configure mlflow in individual users' Nebari instances by adding or updating an additional block in your Nebari configuration file. Be sure to replace {project_name} and {namespace} with the values from your own nebari config file e.g. http://mynebari-mlflow-tracking.dev.svc:5000.

jupyterhub:
  overrides:
    singleuser:
      extraEnv:
        MLFLOW_TRACKING_URI: "http://{project_name}-mlflow-tracking.{namespace}.svc:5000" 

Helm Chart Overrides

You can pass custom Helm chart values to override the default MLflow configuration. This is useful for enabling specific features like the MLflow run server or customizing resource limits. The overrides are specified in your Nebari configuration file under the mlflow section.

mlflow:
  enabled: true
  overrides:
    tracking:
      resources:
        limits:
          memory: "4Gi"
          cpu: "2"
    # ... additional overrides

Usage

Getting started with the MLflow extension is incredibly simple. To track an experiment:

Navigate to the MLFLow extension URL and create a new experiment. In your Python script, import MLflow and start logging metrics.

import mlflow

# Start an experiment
with mlflow.start_run() as run:
    mlflow.log_metric("accuracy", 0.9)
    mlflow.log_artifact("path/to/your/artifact")

With the above code, your metrics and artifacts are automatically stored and accessible via the MLFlow extension URL.

License

nebari-mlflow-plugin is distributed under the terms of the Apache license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nebari_mlflow_plugin-2025.10.1.tar.gz (205.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

nebari_mlflow_plugin-2025.10.1-py3-none-any.whl (23.9 kB view details)

Uploaded Python 3

File details

Details for the file nebari_mlflow_plugin-2025.10.1.tar.gz.

File metadata

  • Download URL: nebari_mlflow_plugin-2025.10.1.tar.gz
  • Upload date:
  • Size: 205.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for nebari_mlflow_plugin-2025.10.1.tar.gz
Algorithm Hash digest
SHA256 4f7c11c8e79129c80a7f8e312eff99be2661eafd222b626eef05c2082ef83142
MD5 df26caf89ebd25a0dbfc6540651e7c9b
BLAKE2b-256 afa92e0671a79daaef73e73f2a1d728f14d2d2b1d8e666cbeb8b3f43ff790d52

See more details on using hashes here.

Provenance

The following attestation bundles were made for nebari_mlflow_plugin-2025.10.1.tar.gz:

Publisher: release.yaml on nebari-dev/nebari-mlflow-plugin

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file nebari_mlflow_plugin-2025.10.1-py3-none-any.whl.

File metadata

File hashes

Hashes for nebari_mlflow_plugin-2025.10.1-py3-none-any.whl
Algorithm Hash digest
SHA256 7613994ee4626c0a33a2bd07b3cc6f4af26465d822ad3f76377cd773bbf790d4
MD5 69a0ea4b55321f91a59e65e0d710838d
BLAKE2b-256 4b7a5b4ef9001f538d054ad140acd2f0843252764b99ff20d506c0e07ee729e7

See more details on using hashes here.

Provenance

The following attestation bundles were made for nebari_mlflow_plugin-2025.10.1-py3-none-any.whl:

Publisher: release.yaml on nebari-dev/nebari-mlflow-plugin

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page