Skip to main content

A powerful API to Automate Machine Learning workflows from multiple vendors.

Project description

a2ml - Automation of AutoML

CircleCI Join the chat License Python PyPI - A2ML Versions

The A2ML ("Automate AutoML") project is a Python API and set of command line tools to automate Automated Machine Learning tools from multiple vendors. The intention is to provide a common API for all Cloud-oriented AutoML vendors. Data scientists can then train their datasets against multiple AutoML models to get the best possible predictive model. May the best "algorithm/hyperparameter search" win. Full documentation for A2ML is available at a2ml.org

The PREDIT Pipeline

Every AutoML vendor has their own API to manage the datasets and create and manage predictive models. They are similar but not identical APIs. But they share a common set of stages:

  • Importing data for training
  • Train models with multiple algorithms and hyperparameters
  • Evaluate model performance and choose one or more for deployment
  • Deploy selected models
  • Predict results with new data against deployed models
  • Review performance of deployed models

Since ITEDPR is hard to remember we refer to this pipeline by its conveniently mnemonic anagram: "PREDIT" (French for "predict"). The A2ML project provides classes which implement this pipeline for various Cloud AutoML providers and a command line interface that invokes stages of the pipeline.

Setup

A2ML is distributed as a python package, so to install it:

$ pip install -U a2ml

It will install Auger provider.

To use Azure AutoML:

Mac:

$ brew install libomp

For Mac OS High Sierra and below:

$ SKLEARN_NO_OPENMP=1 pip install "scikit-learn==0.21.3"
$ pip install "a2ml[azure]" --ignore-installed onnxruntime onnx nimbusml

Linux:

$ apt-get update && apt-get -y install gcc g++ libgomp1
$ pip install "a2ml[azure]"

To use Google Cloud:

$ pip install "a2ml[google]"

To install everything including testing and server code:

$ pip install "a2ml[all]"

Development

To release a new version the flow should be:

  1. Change the __version__ variable in a2ml/__init__.py to match what you want to release, minus the "v". By default it would be "<current-milestone>.dev0", for example "0.3.0.dev0". This ensures we don’t accidentally release a dev version to pypi.org. So for when we’re ready to release 0.3.0, the __version__ variable should simply be "0.3.0".

  2. Commit and push the changes above.

git tag v<the-version> (for example: git tag v0.3.0)
git push --tags
  1. verify circleci build passed and docker image tag exists:
pip install -U a2ml==0.3.0
docker pull augerai/a2ml:v0.3.0
  1. Increment the __version__ variable in a2ml/__init__.py to the next version in the current milestone. For example, "0.3.1.dev0"

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for a2ml, version 1.0.10
Filename, size File type Python version Upload date Hashes
Filename, size a2ml-1.0.10-py3-none-any.whl (173.3 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size a2ml-1.0.10.tar.gz (117.4 kB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring DigiCert DigiCert EV certificate Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page