Skip to main content

A light-weight system for training AI networks using PyTorch

Project description

https://raw.githubusercontent.com/marovira/helios-ml/master/data/logo/logo-transparent.png

Test CodeFactor Ruff PythonVersion PyPi License

What is Helios?

Named after Greek god of the sun, Helios is a light-weight package for training ML networks built on top of PyTorch. It is designed to abstract all of the “boiler-plate” code involved with training. Specifically, it wraps the following common patterns:

  • Creation of the dataloaders.

  • Initialization of CUDA, PyTorch, and random number states.

  • Initialization for distributed training.

  • Training, validation, and testing loops.

  • Saving and loading checkpoints.

  • Exporting to ONNX.

It is important to note that Helios is not a fully fledged training environment similar to Pytorch Lightning. Instead, Helios is focused on providing a simple and straight-forward interface that abstracts most of the common code patterns while retaining the ability to be easily overridden to suit the individual needs of each training scheme.

Main Features

Helios offers the following functionality out of the box:

  1. Resume training: Helios has been built with the ability to resume training if it is paused. Specifically, Helios will ensure that the behaviour of the trained model is identical to the one it would’ve had if it had been trained without pauses.

  2. Automatic detection of multi-GPU environments for distributed training. In addition, Helios also supports training using torchrun and will automatically handle the initialisation and clean up of the distributed state. It will also correctly set the devices and maps to ensure weights are mapped tot he correct location.

  3. Registries for creation of arbitrary types. These include: networks, loss functions, optimizers, schedulers, etc.

  4. Correct handling of logging when doing distributed training (even over multiple nodes).

Installation

Dependencies

Helios requires:

  • Python (>= 3.11)

  • TQDM (>= 4.66.2)

  • OpenCV (>= 4.10.0.84)

  • Tensorboard (>= 2.17.1)

  • PyTorch (>= 2.4.0)

  • Torchvision (>= 0.19.0)

  • ONNX (>= 1.16.1)

  • ONNXRuntime (>= 1.19.0)

  • Matplotlib (>= 3.8.4)

  • Numpy (>= 2.0.0)

User Installation

You can install Helios using pip:

pip install -U helios-ml

If you require a specific version of CUDA, you can install with:

pip install -U helios-ml --extra-index-url https://download.pytorch.org/whl/cu<version>

Documentation

Documentation available here.

Contributing

There are three ways in which you can contribute to Helios:

  • If you find a bug, please open an issue. Similarly, if you have a question about how to use it, or if something is unclear, please post an issue so it can be addressed.

  • If you have a fix for a bug, or a code enhancement, please open a pull request. Before you submit it though, make sure to abide by the rules written below.

  • If you have a feature proposal, you can either open an issue or create a pull request. If you are submitting a pull request, it must abide by the rules written below. Note that any new features need to be approved by me.

If you are submitting a pull request, the guidelines are the following:

  1. Ensure that your code follows the standards and formatting of Helios. The coding standards and formatting are enforced through the Ruff Linter and Formatter. Any changes that do not abide by these rules will be rejected. It is your responsibility to ensure that both Ruff and Mypy linters pass.

  2. Ensure that all unit tests are working prior to submitting the pull request. If you are adding a new feature that has been approved, it is your responsibility to provide the corresponding unit tests (if applicable).

License

Helios is published under the BSD-3 license and can be viewed here.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

helios_ml-1.1.0.tar.gz (135.4 kB view details)

Uploaded Source

Built Distribution

helios_ml-1.1.0-py3-none-any.whl (68.2 kB view details)

Uploaded Python 3

File details

Details for the file helios_ml-1.1.0.tar.gz.

File metadata

  • Download URL: helios_ml-1.1.0.tar.gz
  • Upload date:
  • Size: 135.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for helios_ml-1.1.0.tar.gz
Algorithm Hash digest
SHA256 850fa78e41bc133456aa8d9cc45fff0cb792cef46507ec1c161808e89dec5df6
MD5 802bae9a493c159085a66bd4b6cf6556
BLAKE2b-256 72feb7168b012782e76168dda61dac1dbb0649ad7b3a597e365441d3727fd81c

See more details on using hashes here.

File details

Details for the file helios_ml-1.1.0-py3-none-any.whl.

File metadata

  • Download URL: helios_ml-1.1.0-py3-none-any.whl
  • Upload date:
  • Size: 68.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for helios_ml-1.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8faf0eede8ed9452aefb1f20fb37b5613047d227df22addc084b201f7be7f747
MD5 2cc07f9fbff7641b2b98708c41132a55
BLAKE2b-256 f25c4e0ed74e8865d2afa1e7e5241f08cd4bd55724f5123338466c3da9854def

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page