mle-toolbox

Machine Learning Experiment Toolbox

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

Lightweight Infrastructure for Distributed ML Experiments 🚜

Coming up with the right research hypotheses is hard - testing them should be easy.

ML researchers need to coordinate different types of experiments on separate remote resources. The Machine Learning Experiment (MLE)-Toolbox is designed to facilitate the workflow by providing a simple interface, standardized logging, many common ML experiment types (multi-seed/configurations, grid-searches and hyperparameter optimization pipelines). You can run experiments on your local machine, high-performance compute clusters (Slurm and Sun Grid Engine) as well as on cloud VMs (GCP). The results are archived (locally/GCS bucket) and can easily be retrieved or automatically summarized/reported as .md/.html files.

What Does The `mle-toolbox` Provide?

API for launching jobs on cluster/cloud computing platforms (Slurm, GridEngine, GCP).
Common machine learning research experiment setups:
- Launching and collecting multiple random seeds in parallel/batches or async.
- Hyperparameter searches: Random, Grid, SMBO, PBT and Nevergrad.
- Pre- and post-processing pipelines for data preparation/result visualization.
Automated report generation for hyperparameter search experiments.
Storage of results and database in Google Cloud Storage Bucket.
Resource monitoring with dashboard visualization.

The 4 Step `mle-toolbox` Cooking Recipe 🍲

Follow the instructions below to install the mle-toolbox and set up your credentials/configurations.
Read the docs explaining the pillars of the toolbox & the experiment meta-configuration job .yaml files .
Check out the examples 📄 to get started: Toy ODE integration, training PyTorch MNIST-CNNs or VAEs in JAX.
Run your own experiments using the template files, project and mle run.

Installation ⏳

If you want to use the toolbox on your local machine follow the instructions locally. Otherwise do so on your respective cluster resource (Slurm/SGE). A PyPI installation is available via:

pip install mle-toolbox

Alternatively, you can clone this repository and afterwards 'manually' install it:

git clone https://github.com/RobertTLange/mle-toolbox.git
cd mle-toolbox
pip install -e .

By default this will only install the minimal dependencies (not including specialized packages such as scikit-optimize, statsmodels, nevergrad etc.). To get all requirements for tests or examples you will need to install additional requirements.

Setting Up Your Remote Credentials 🙈

By default the toolbox will only run locally and without any GCS storage of your experiments. If you want to integrate the mle-toolbox with your SGE/Slurm clusters, you have to provide additional data. There 2 ways to do so:

After installation type mle init. This will walk you through all configuration steps in your CLI and save your configuration in ~/mle_config.toml.
Manually edit the config_template.toml template. Move/rename the template to your home directory via mv config_template.toml ~/mle_config.toml.

The configuration procedure consists of 3 optional steps, which depend on your needs:

Set whether to store all results & your database locally or remote in a GCS bucket.
Add SGE and/or Slurm credentials & cluster-specific details (headnode, partitions, proxy server, etc.).
Add the GCP project, GCS bucket name and database filename to store your results.

The Core Commands of the MLE-Toolbox 🌱

You are now ready to dive deeper into the specifics of job configuration and can start running your first experiments from the cluster (or locally on your machine) with the following commands:

	Command	Description
⏳	`mle init`	Setup of credentials & toolbox settings.
🚀	`mle run`	Start up an experiment.
🖥️	`mle monitor`	Monitor resource utilisation.
📥	`mle retrieve`	Retrieve an experiment result.
💌	`mle report`	Create an experiment report with figures.
🔄	`mle sync-gcs`	Extract all GCS-stored results to your local drive.

Examples 🎒

	Job Types	Description
📄 Euler PDE	`multi-configs`, `hyperparameter-search`	Integrate a PDE using forward Euler.
📄 MNIST CNN	`multi-configs`, `hyperparameter-search`	Train PyTorch MNIST-CNNs.
📄 JAX VAE	`hyperparameter-search`	Train a JAX-based MNIST VAE.
📄 Multi-Objective	`hyperparameter-search`	Multi-objective tuning with `nevergrad`.
📄 Sklearn SVM	`single-config`	Train a Sklearn SVM classifier.
📄 Multi Bash	`multi-configs`	Bash based jobs.
📄 Quadratic PBT	`population-based-training`	PBT on toy quadratic surrogate.
📄 MNIST PBT	`population-based-training`	PBT for a MNIST MLP network.
📓 Evaluation	-	Evaluation of gridsearch results.
📓 Testing	-	Perform hypothesis tests on logs.
📓 GIF Animations	-	Walk through a set of animation helpers.
📓 PBT Evaluation	-	Inspect the result from PBT.

Acknowledgements & Citing `mle-toolbox` ✏️

To cite this repository:

@software{mle_toolbox2021github,
  author = {Robert Tjarko Lange},
  title = {{MLE-Toolbox}: A Reproducible Workflow for Distributed Machine Learning Experiments},
  url = {http://github.com/RobertTLange/mle-toolbox},
  version = {0.3.0},
  year = {2021},
}

Much of the mle-toolbox design has been inspired by discussions with Jonathan Frankle and Nandan Rao about the quest for empirically sound and supported claims in Machine Learning. Finally, parts of the mle <subcommands> were inspired by Tudor Berariu's Liftoff package and parts of the philosophy by wanting to provide a light-weight version of IDISA's sacred package. Further credit goes to Facebook's submitit and Ray.

Notes, Development & Questions ❓

If you find a bug or want a new feature, feel free to contact me @RobertTLange or create an issue 🤗
You can check out the history of release modifications in CHANGELOG.md (added, changed, fixed).
You can find a set of open milestones in CONTRIBUTING.md.

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.3.4

Mar 8, 2023

0.3.3

Dec 10, 2021

0.3.2

Dec 10, 2021

This version

0.3.1

Oct 21, 2021

0.3.0

Aug 21, 2021

0.2.9

Jun 23, 2021

0.2.8

May 6, 2021

0.2.7

Apr 24, 2021

0.2.6

Apr 9, 2021

0.2.5

Apr 5, 2021

0.2.4

Feb 16, 2021

0.2.3

Feb 16, 2021

0.2.2

Oct 14, 2020

0.2.1

Oct 6, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mle_toolbox-0.3.1.tar.gz (85.5 kB view hashes)

Uploaded Oct 21, 2021 Source

Built Distribution

mle_toolbox-0.3.1-py3-none-any.whl (108.1 kB view hashes)

Uploaded Oct 21, 2021 Python 3

Hashes for mle_toolbox-0.3.1.tar.gz

Hashes for mle_toolbox-0.3.1.tar.gz
Algorithm	Hash digest
SHA256	`8ad0206dafe100dab7841de4350a7e01b53d31ed0c4ddd3c4652ae8369a6bec0`
MD5	`808cd5e727b96456c81f918a62028036`
BLAKE2b-256	`2623d7a3e1675013766e8bdc7835e7da7484f1386045efda462a48f153abb9fc`

Hashes for mle_toolbox-0.3.1-py3-none-any.whl

Hashes for mle_toolbox-0.3.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f6e22fbe38ceb907ce174423c6843a4b80caec1fd0b566846faa755b00b8b123`
MD5	`f71b489dc509557ffc94908576200563`
BLAKE2b-256	`84ea3e4977ef3bb31411f308d8d42336edc62fb0cf2a190f1d7b7da755e5953e`

mle-toolbox 0.3.1

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

Lightweight Infrastructure for Distributed ML Experiments 🚜

What Does The `mle-toolbox` Provide?

The 4 Step `mle-toolbox` Cooking Recipe 🍲

Installation ⏳

Setting Up Your Remote Credentials 🙈

The Core Commands of the MLE-Toolbox 🌱

Examples 🎒

Acknowledgements & Citing `mle-toolbox` ✏️

Notes, Development & Questions ❓

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

mle-toolbox 0.3.1

Navigation

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Project description

Lightweight Infrastructure for Distributed ML Experiments 🚜

What Does The mle-toolbox Provide?

The 4 Step mle-toolbox Cooking Recipe 🍲

Installation ⏳

Setting Up Your Remote Credentials 🙈

The Core Commands of the MLE-Toolbox 🌱

Examples 🎒

Acknowledgements & Citing mle-toolbox ✏️

Notes, Development & Questions ❓

Project details

Verified details

Maintainers

Unverified details

Project links

GitHub Statistics

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

What Does The `mle-toolbox` Provide?

The 4 Step `mle-toolbox` Cooking Recipe 🍲

Acknowledgements & Citing `mle-toolbox` ✏️