Skip to main content

Python tools for communication with Hafnia platform.

Project description

Hafnia

The hafnia python package is a collection of tools to create and run model training recipes on the Hafnia Platform.

The package includes the following interfaces:

  • cli: A Command Line Interface (CLI) to 1) configure/connect to Hafnia and 2) create and launch Training-aaS recipe scripts.
  • hafnia: A python package with helper functions to load and interact with sample datasets and an experiment tracker (HafniaLogger).

The Concept: Training as a Service (Training-aaS)

Training-aaS is the concept of training models on the Hafnia platform on large and hidden datasets. Hidden datasets refers to datasets that can be used for training, but are not available for download or direct access.

This is a key feature of the Hafnia platform, as a hidden dataset ensures data privacy, and allow models to be trained compliantly and ethically by third parties (you).

The script2model approach is a Training-aaS concept, where you package your custom training script as a training recipe and use the recipe to train models on the hidden datasets.

To support local development of a training recipe, we have introduced a sample dataset for each dataset available on the Hafnia platform. The sample dataset is a small and anonymized subset of the full dataset and available for download.

With the sample dataset, you can seamlessly switch between local and Hafnia training. Locally, you can create, validate and debug your training recipe. The recipe is then launched with Hafnia Training-aaS, where the recipe runs on the full dataset and can be scaled to run on multiple GPUs and instances if needed.

Getting started: Configuration

To get started with Hafnia:

  1. Install hafnia with your favorite python package manager. With pip do this:

    pip install hafnia

  2. Sign in to the Hafnia Platform.

  3. Create an API KEY for Training aaS. For more instructions, follow this guide. Copy the key and save it for later use.

  4. From terminal, configure your machine to access Hafnia:

    # Start configuration with
    hafnia configure
    
    # You are then prompted: 
    Profile Name [default]:   # Press [Enter] or select an optional name
    Hafnia API Key:  # Pass your HAFNIA API key
    Hafnia Platform URL [https://api.mdi.milestonesys.com]:  # Press [Enter]
    
  5. Download mnist from terminal to verify configuration is working.

    hafnia data download mnist --force
    

Getting started: Loading datasets samples

With Hafnia configured on your local machine, it is now possible to download and explore the dataset sample with a python script:

from hafnia.data import load_dataset

dataset_splits = load_dataset("midwest-vehicle-detection")
print(dataset_splits)
print(dataset_splits["train"])

Datasets with corresponding sample datasets can be found in data library. It is early days for the data library, but we are actively working on adding more datasets.

The returned sample dataset is a hugging face dataset and contains train, validation and test splits.

An important feature of load_dataset is that it will return the full dataset when loaded on the Hafnia platform.

This enables seamlessly switching between running/validating a training script locally (on the sample dataset) and running full model trainings in the cloud without changing code or configurations for the training script.

Getting started: Experiment Tracking with HafniaLogger

The HafniaLogger is an important part of the recipe script and enables you to track, log and reproduce your experiments.

When integrated into your training script, the HafniaLogger is responsible for collecting:

  • Trained Model: The model trained during the experiment
  • Model Checkpoints: Intermediate model states saved during training
  • Experiment Configurations: Hyperparameters and other settings used in your experiment
  • Training/Evaluation Metrics: Performance data such as loss values, accuracy, and custom metrics

Basic Implementation Example

Here's how to integrate the HafniaLogger into your training script:

from hafnia.experiment import HafniaLogger

batch_size = 128
learning_rate = 0.001

# Initialize Hafnia logger
logger = HafniaLogger()

# Log experiment parameters
logger.log_configuration({"batch_size": 128, "learning_rate": 0.001})

# Store checkpoints in this path
ckpt_dir = logger.path_model_checkpoints()

# Store the trained model in this path
model_dir = logger.path_model()

# Log scalar and metric values during training and validation
logger.log_scalar("train/loss", value=0.1, step=100)
logger.log_metric("train/accuracy", value=0.98, step=100)

logger.log_scalar("validation/loss", value=0.1, step=100)
logger.log_metric("validation/accuracy", value=0.95, step=100)

Similar to load_dataset, the tracker behaves differently when running locally or in the cloud. Locally, experiment data is stored in a local folder .data/experiments/{DATE_TIME}.

In the cloud, the experiment data will be available in the Hafnia platform under experiments.

Example: Torch Dataloader

Commonly for torch-based training scripts, a dataset is used in combination with a dataloader that performs data augmentations and batching of the dataset as torch tensors.

To support this, we have provided a torch dataloader example script example_torchvision_dataloader.py.

The script demonstrates how to make a dataloader with data augmentation (torchvision.transforms.v2) and a helper function for visualizing image and labels.

The dataloader and visualization function supports computer vision tasks and datasets available in the data library.

Example: Training-aaS

By combining logging and dataset loading, we can now construct our model training recipe.

To demonstrate this, we have provided a recipe project that serves as a template for creating and structuring training recipes recipe-classification

The project also contains additional information on how to structure your training recipe, use the HafniaLogger, the load_dataset function and different approach for launching the training recipe on the Hafnia platform.

Detailed Documentation

For more information, go to our documentation page or in below markdown pages.

Development

For development, we are using an uv based virtual python environment

Install uv

curl -LsSf https://astral.sh/uv/install.sh | sh

Install python dependencies including developer (--dev) and optional dependencies (--all-extras).

uv sync --all-extras --dev

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hafnia-0.1.12.tar.gz (158.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

hafnia-0.1.12-py3-none-any.whl (28.1 kB view details)

Uploaded Python 3

File details

Details for the file hafnia-0.1.12.tar.gz.

File metadata

  • Download URL: hafnia-0.1.12.tar.gz
  • Upload date:
  • Size: 158.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for hafnia-0.1.12.tar.gz
Algorithm Hash digest
SHA256 505ded0dee6c563a7fdc7438b3f98cc0c1414cb47513687afc089b38272a1988
MD5 dfcbc222d72c03f2adc92997115887ce
BLAKE2b-256 a148883854a576d93629e3f31b03ba117e920999542e8a0460597d48be71ff2a

See more details on using hashes here.

File details

Details for the file hafnia-0.1.12-py3-none-any.whl.

File metadata

  • Download URL: hafnia-0.1.12-py3-none-any.whl
  • Upload date:
  • Size: 28.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for hafnia-0.1.12-py3-none-any.whl
Algorithm Hash digest
SHA256 11a8747d6612a26fd1140a23d04374f94094e1b87001a8153bf0f7d8110e104c
MD5 598544b74481c0b88053991369908774
BLAKE2b-256 2ab38e3517715da3c33c14ae7ee259d7de688c774591f2d777af81e348c02576

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page