Python SDK for communication with Hafnia platform.

Project description

Hafnia

The hafnia python package is a collection of tools to create and run model training recipes on the Hafnia Platform.

The package includes the following interfaces:

cli: A Command Line Interface (CLI) to 1) configure/connect to Hafnia's Training-aaS and 2) create and launch recipe scripts.
hafnia: A python package with helper functions to load and interact with sample datasets and an experiment tracker (HafniaLogger).

The Concept: Training as a Service (Training-aaS)

Training-aaS is the concept of training models on the Hafnia platform on large and hidden datasets. Hidden datasets refers to datasets that can be used for training, but are not available for download or direct access.

This is a key feature of the Hafnia platform, as a hidden dataset ensures data privacy, and allow models to be trained compliantly and ethically by third parties (you).

The script2model approach is a Training-aaS concept, where you package your custom training script as a training recipe and use the recipe to train models on the hidden datasets.

To support local development of a training recipe, we have introduced a sample dataset for each dataset available in the Hafnia data library. The sample dataset is a small and anonymized subset of the full dataset and available for download.

With the sample dataset, you can seamlessly switch between local development and Training-aaS. Locally, you can create, validate and debug your training recipe. The recipe is then launched with Training-aaS, where the recipe runs on the full dataset and can be scaled to run on multiple GPUs and instances if needed.

Getting started: Configuration

To get started with Hafnia:

Install hafnia with your favorite python package manager. With pip do this:

pip install hafnia
Sign in to the Hafnia Platform.
Create an API KEY for Training aaS. For more instructions, follow this guide. Copy the key and save it for later use.

From terminal, configure your machine to access Hafnia:

# Start configuration with
hafnia configure

# You are then prompted: 
Profile Name [default]:   # Press [Enter] or select an optional name
Hafnia API Key:  # Pass your HAFNIA API key
Hafnia Platform URL [https://api.mdi.milestonesys.com]:  # Press [Enter]

Download mnist from terminal to verify that your configuration is working.
```
hafnia data download mnist --force
```

Getting started: Loading datasets samples

With Hafnia configured on your local machine, it is now possible to download and explore the dataset sample with a python script:

from hafnia.data import load_dataset

dataset_splits = load_dataset("mnist")

Dataset Format

The returned sample dataset is a hugging face dataset and contains train, validation and test splits.

print(dataset_splits)

# Output:
>>> DatasetDict({
    train: Dataset({
        features: ['image_id', 'image', 'height', 'width', 'objects', 'Weather', 'Surface Conditions'],
        num_rows: 172
    })
    validation: Dataset({
        features: ['image_id', 'image', 'height', 'width', 'objects', 'Weather', 'Surface Conditions'],
        num_rows: 21
    })
    test: Dataset({
        features: ['image_id', 'image', 'height', 'width', 'objects', 'Weather', 'Surface Conditions'],
        num_rows: 21
    })
})

A Hugging Face dataset is a dictionary with splits, where each split is a Dataset object. Each Dataset is structured as a table with a set of columns (also called features) and a row for each sample.

The features of the dataset can be viewed with the features attribute.

# View features of the train split
pprint.pprint(dataset["train"].features)
{'Surface Conditions': ClassLabel(names=['Dry', 'Wet'], id=None),
 'Weather': ClassLabel(names=['Clear', 'Foggy'], id=None),
 'height': Value(dtype='int64', id=None),
 'image': Image(mode=None, decode=True, id=None),
 'image_id': Value(dtype='int64', id=None),
 'objects': Sequence(feature={'bbox': Sequence(feature=Value(dtype='int64',
                                                             id=None),
                                               length=-1,
                                               id=None),
                              'class_idx': ClassLabel(names=['Vehicle.Bicycle',
                                                             'Vehicle.Motorcycle',
                                                             'Vehicle.Car',
                                                             'Vehicle.Van',
                                                             'Vehicle.RV',
                                                             'Vehicle.Single_Truck',
                                                             'Vehicle.Combo_Truck',
                                                             'Vehicle.Pickup_Truck',
                                                             'Vehicle.Trailer',
                                                             'Vehicle.Emergency_Vehicle',
                                                             'Vehicle.Bus',
                                                             'Vehicle.Heavy_Duty_Vehicle'],
                                                      id=None),
                              'class_name': Value(dtype='string', id=None),
                              'id': Value(dtype='string', id=None)},
                     length=-1,
                     id=None),
 'width': Value(dtype='int64', id=None)}

View the first sample in the training set:

# Print sample from the training set
pprint.pprint(dataset["train"][0])

{'image': <PIL.PngImagePlugin.PngImageFile image mode=RGB size=1920x1080 at 0x79D6292C5ED0>,
 'image_id': 4920,
 'height': 1080,
 'Weather': 0,
 'Surface Conditions': 0,
 'objects': {'bbox': [[441, 180, 121, 126],
                      [549, 151, 131, 103],
                      [1845, 722, 68, 130],
                      [1810, 571, 110, 149]],
             'class_idx': [7, 7, 2, 2],
             'class_name': ['Vehicle.Pickup_Truck',
                            'Vehicle.Pickup_Truck',
                            'Vehicle.Car',
                            'Vehicle.Car'],
             'id': ['HW6WiLAJ', 'T/ccFpRi', 'CS0O8B6W', 'DKrJGzjp']},
 'width': 1920}

For hafnia based datasets, we want to standardized how a dataset and dataset tasks are represented. We have defined a set of features that are common across all datasets in the Hafnia data library.

image: The image itself, stored as a PIL image
height: The height of the image in pixels
width: The width of the image in pixels
[IMAGE_CLASSIFICATION_TASK]: [Optional] Image classification tasks are top-level ClassLabel feature. ClassLabel is a Hugging Face feature that maps class indices to class names. In above example we have two classification tasks:
- Weather: Classifies the weather conditions in the image, with possible values Clear and Foggy
- Surface Conditions: Classifies the surface conditions in the image, with possible values Dry and Wet
objects: A dictionary containing information about objects in the image, including:
- bbox: Bounding boxes for each object, represented with a list of bounding box coordinates [xmin, ymin, bbox_width, bbox_height]. Each bounding box is defined with a top-left corner coordinate (xmin, ymin) and bounding box width and height (bbox_width, bbox_height) in pixels.
- class_idx: Class indices for each detected object. This is a ClassLabel feature that maps to the class_name feature.
- class_name: Class names for each detected object
- id: Unique identifiers for each detected object

Dataset Locally vs. Training-aaS

An important feature of load_dataset is that it will return the full dataset when loaded with Training-aaS on the Hafnia platform.

This enables seamlessly switching between running/validating a training script locally (on the sample dataset) and running full model trainings with Training-aaS (on the full dataset). without changing code or configurations for the training script.

Available datasets with corresponding sample datasets can be found in data library including metadata and description for each dataset.

Getting started: Experiment Tracking with HafniaLogger

The HafniaLogger is an important part of the recipe script and enables you to track, log and reproduce your experiments.

When integrated into your training script, the HafniaLogger is responsible for collecting:

Trained Model: The model trained during the experiment
Model Checkpoints: Intermediate model states saved during training
Experiment Configurations: Hyperparameters and other settings used in your experiment
Training/Evaluation Metrics: Performance data such as loss values, accuracy, and custom metrics

Basic Implementation Example

Here's how to integrate the HafniaLogger into your training script:

from hafnia.experiment import HafniaLogger

batch_size = 128
learning_rate = 0.001

# Initialize Hafnia logger
logger = HafniaLogger()

# Log experiment parameters
logger.log_configuration({"batch_size": 128, "learning_rate": 0.001})

# Store checkpoints in this path
ckpt_dir = logger.path_model_checkpoints()

# Store the trained model in this path
model_dir = logger.path_model()

# Log scalar and metric values during training and validation
logger.log_scalar("train/loss", value=0.1, step=100)
logger.log_metric("train/accuracy", value=0.98, step=100)

logger.log_scalar("validation/loss", value=0.1, step=100)
logger.log_metric("validation/accuracy", value=0.95, step=100)

Similar to load_dataset, the tracker behaves differently when running locally or in the cloud. Locally, experiment data is stored in a local folder .data/experiments/{DATE_TIME}.

In the cloud, the experiment data will be available in the Hafnia platform under experiments.

Example: Torch Dataloader

Commonly for torch-based training scripts, a dataset is used in combination with a dataloader that performs data augmentations and batching of the dataset as torch tensors.

To support this, we have provided a torch dataloader example script example_torchvision_dataloader.py.

The script demonstrates how to load a dataset sample, apply data augmentations using torchvision.transforms.v2, and visualize the dataset with torch_helpers.draw_image_and_targets.

Note also how torch_helpers.TorchVisionCollateFn is used in combination with the DataLoader from torch.utils.data to handle the dataset's collate function.

The dataloader and visualization function supports computer vision tasks and datasets available in the data library.

# Load Hugging Face dataset
dataset_splits = load_dataset("midwest-vehicle-detection")

# Define transforms
train_transforms = v2.Compose(
    [
        v2.RandomResizedCrop(size=(224, 224), antialias=True),
        v2.RandomHorizontalFlip(p=0.5),
        v2.ToDtype(torch.float32, scale=True),
        v2.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]),
    ]
)
test_transforms = v2.Compose(
    [
        v2.Resize(size=(224, 224), antialias=True),
        v2.ToDtype(torch.float32, scale=True),
        v2.Normalize(mean=[0.485, 0.456, 0.406], std=[0.229, 0.224, 0.225]),
    ]
)

keep_metadata = True
train_dataset = torch_helpers.TorchvisionDataset(
    dataset_splits["train"], transforms=train_transforms, keep_metadata=keep_metadata
)
test_dataset = torch_helpers.TorchvisionDataset(
    dataset_splits["test"], transforms=test_transforms, keep_metadata=keep_metadata
)

# Visualize sample
image, targets = train_dataset[0]
visualize_image = torch_helpers.draw_image_and_targets(image=image, targets=targets)
pil_image = torchvision.transforms.functional.to_pil_image(visualize_image)
pil_image.save("visualized_labels.png")

# Create DataLoaders - using TorchVisionCollateFn
collate_fn = torch_helpers.TorchVisionCollateFn(
    skip_stacking=["objects.bbox", "objects.class_idx"]
)
train_loader = DataLoader(train_dataset, batch_size=8, shuffle=True, collate_fn=collate_fn)

Example: Training-aaS

By combining logging and dataset loading, we can now construct our model training recipe.

To demonstrate this, we have provided a recipe project that serves as a template for creating and structuring training recipes recipe-classification

The project also contains additional information on how to structure your training recipe, use the HafniaLogger, the load_dataset function and different approach for launching the training recipe on the Hafnia platform.

Create, Build and Run `recipe.zip` locally

In order to test recipe compatibility with Hafnia cloud use the following command to build and start the job locally.

    # Create 'recipe.zip' from source folder '.'
    hafnia recipe create .
    
    # Build the docker image locally from a 'recipe.zip' file
    hafnia runc build-local recipe.zip

    # Execute the docker image locally with a desired dataset
    hafnia runc launch-local --dataset mnist  "python scripts/train.py"

Detailed Documentation

For more information, go to our documentation page or in below markdown pages.

CLI - Detailed guide for the Hafnia command-line interface
Release lifecycle - Details about package release lifecycle.

Development

For development, we are using an uv based virtual python environment.

Install uv

curl -LsSf https://astral.sh/uv/install.sh | sh

Create virtual environment and install python dependencies

uv sync

Run tests:

uv run pytest tests

Project details

Release history Release notifications | RSS feed

0.7.5

May 7, 2026

0.7.4

Apr 24, 2026

0.7.3

Apr 24, 2026

0.7.2

Mar 25, 2026

0.7.1

Mar 20, 2026

0.7.0

Mar 12, 2026

0.6.1

Mar 9, 2026

0.6.0

Mar 6, 2026

0.5.5

Feb 5, 2026

0.5.4

Feb 3, 2026

0.5.3

Feb 3, 2026

0.5.2

Jan 27, 2026

0.5.1

Jan 15, 2026

0.5.0

Jan 12, 2026

0.4.3

Nov 20, 2025

0.4.2

Nov 6, 2025

0.4.1

Nov 5, 2025

0.4.0

Oct 21, 2025

0.3.0

Sep 26, 2025

0.2.4

Aug 14, 2025

0.2.3

Aug 11, 2025

0.2.2

Jul 29, 2025

0.2.1

Jul 9, 2025

0.2.0

Jun 30, 2025

This version

0.1.27

Jun 23, 2025

0.1.26

Jun 18, 2025

0.1.25

May 8, 2025

0.1.24

May 1, 2025

0.1.23

Apr 28, 2025

0.1.19

Apr 28, 2025

0.1.18

Apr 27, 2025

0.1.17

Apr 25, 2025

0.1.16

Apr 25, 2025

0.1.15

Apr 24, 2025

0.1.14

Apr 24, 2025

0.1.13

Apr 24, 2025

0.1.12

Apr 24, 2025

0.1.10

Apr 23, 2025

0.1.9

Apr 22, 2025

0.1.8

Apr 22, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

hafnia-0.1.27.tar.gz (166.4 kB view details)

Uploaded Jun 23, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

hafnia-0.1.27-py3-none-any.whl (30.7 kB view details)

Uploaded Jun 23, 2025 Python 3

File details

Details for the file hafnia-0.1.27.tar.gz.

File metadata

Download URL: hafnia-0.1.27.tar.gz
Upload date: Jun 23, 2025
Size: 166.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for hafnia-0.1.27.tar.gz
Algorithm	Hash digest
SHA256	`5d935bdd9a78a0f186d9bd78f7cc43b646289893ec3d42b5a1c5f2a9d9d7f364`
MD5	`7bdcf4bcfa8d1f033fbaef6df5be6505`
BLAKE2b-256	`f1763a708dd098e0f2be61ea5858095146eb0de853d9ce8c66c4dd196c4e28bc`

See more details on using hashes here.

File details

Details for the file hafnia-0.1.27-py3-none-any.whl.

File metadata

Download URL: hafnia-0.1.27-py3-none-any.whl
Upload date: Jun 23, 2025
Size: 30.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for hafnia-0.1.27-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9b69b99e8eae22a2a374d0c9f923bf898f99e69bd4f94ae84ade5d78c97854bb`
MD5	`41a435dd77dc541d42ac2843ad74aaf4`
BLAKE2b-256	`267e82d14a45ba1fe2df92ef904ac84375b00eac75e31dc98bc7b74068e8c903`

See more details on using hashes here.

hafnia 0.1.27

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Hafnia

The Concept: Training as a Service (Training-aaS)

Getting started: Configuration

Getting started: Loading datasets samples

Dataset Format

Dataset Locally vs. Training-aaS

Getting started: Experiment Tracking with HafniaLogger

Basic Implementation Example

Example: Torch Dataloader

Example: Training-aaS

Create, Build and Run `recipe.zip` locally

Detailed Documentation

Development

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

hafnia 0.1.27

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Hafnia

The Concept: Training as a Service (Training-aaS)

Getting started: Configuration

Getting started: Loading datasets samples

Dataset Format

Dataset Locally vs. Training-aaS

Getting started: Experiment Tracking with HafniaLogger

Basic Implementation Example

Example: Torch Dataloader

Example: Training-aaS

Create, Build and Run recipe.zip locally

Detailed Documentation

Development

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Create, Build and Run `recipe.zip` locally