A powerful API to Automate Machine Learning workflows from multiple vendors.
Project description
a2ml - Automation of AutoML
The A2ML ("Automate AutoML") project is a Python API and set of command line tools to automate Automated Machine Learning tools from multiple vendors. The intention is to provide a common API for all Cloud-oriented AutoML vendors. Data scientists can then train their datasets against multiple AutoML models to get the best possible predictive model. May the best "algorithm/hyperparameter search" win. Full documentation for A2ML is available at a2ml.org
The PREDIT Pipeline
Every AutoML vendor has their own API to manage the datasets and create and manage predictive models. They are similar but not identical APIs. But they share a common set of stages:
- Importing data for training
- Train models with multiple algorithms and hyperparameters
- Evaluate model performance and choose one or more for deployment
- Deploy selected models
- Predict results with new data against deployed models
- Review performance of deployed models
Since ITEDPR is hard to remember we refer to this pipeline by its conveniently mnemonic anagram: "PREDIT" (French for "predict"). The A2ML project provides classes which implement this pipeline for various Cloud AutoML providers and a command line interface that invokes stages of the pipeline.
Setup
A2ML is distributed as a python package, so to install it:
$ pip install -U a2ml
It will install Auger provider.
To use Azure AutoML:
Mac:
$ brew install libomp
For Mac OS High Sierra and below:
$ SKLEARN_NO_OPENMP=1 pip install "scikit-learn==0.21.3"
$ pip install "a2ml[azure]" --ignore-installed onnxruntime onnx nimbusml
Linux:
$ apt-get update && apt-get -y install gcc g++ libgomp1
$ pip install "a2ml[azure]"
To use Google Cloud:
$ pip install "a2ml[google]"
To install everything including testing and server code:
$ pip install "a2ml[all]"
Development
To release a new version the flow should be:
-
Change the
__version__
variable ina2ml/__init__.py
to match what you want to release, minus the "v". By default it would be ".dev0", for example "0.3.0.dev0". This ensures we don’t accidentally release a dev version to pypi.org. So for when we’re ready to release 0.3.0, the__version__
variable should simply be "0.3.0". -
Commit and push the changes above.
git tag v<the-version> (for example: git tag v0.3.0)
git push --tags
- verify circleci build passed and docker image tag exists:
pip install -U a2ml==0.3.0
docker pull augerai/a2ml:v0.3.0
- Increment the
__version__
variable ina2ml/__init__.py
to the next version in the current milestone. For example, "0.3.1.dev0"
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file a2ml-1.0.97.tar.gz
.
File metadata
- Download URL: a2ml-1.0.97.tar.gz
- Upload date:
- Size: 141.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.18
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | efd9a9f17bc89e76025a261a7d4f7b5fc7186d2779177ed82cfbed97717cfd72 |
|
MD5 | e97f7c97aaf8e2b551074668a6b845dc |
|
BLAKE2b-256 | 2eec1665838c2a0dedba5d6f8d79c1c5a7d3b991170f91c8a795039a438b6bce |
File details
Details for the file a2ml-1.0.97-py3-none-any.whl
.
File metadata
- Download URL: a2ml-1.0.97-py3-none-any.whl
- Upload date:
- Size: 189.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.18
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | eebe92f7a0ac0b3c24d4a3fd01ec169b7f3f3ff0fd148cc483354a7fa05414b8 |
|
MD5 | da29ee5e54369c4a6ab476aa2196e18a |
|
BLAKE2b-256 | 9f61d29a61c3d7f2b78e502ae00562b54403908da58584a3f32639e572c94d6d |