Fully automated end to end machine learning pipeline
Project description
Amplo - AutoML (for Machine Data)
Welcome to the Automated Machine Learning package Amplo
. Amplo's AutoML is designed specifically for machine data and
works very well with tabular time series data (especially unbalanced classification!).
Though this is a standalone Python package, Amplo's AutoML is also available on Amplo's ML Developer Platform. With a graphical user interface and various data connectors, it is the ideal place for service engineers to get started on Predictive Maintenance development.
Amplo's AutoML Pipeline contains the entire Machine Learning development cycle, including exploratory data analysis, data cleaning, feature extraction, feature selection, model selection, hyper parameter optimization, stacking, version control, production-ready models and documentation.
Downloading Amplo
The easiest way is to install our Python package through PyPi:
pip install Amplo
2. Amplo AutoML Features
Exploratory Data Analysis
from Amplo.AutoML import DataExplorer
Automated Exploratory Data Analysis. Covers binary classification and regression.
It generates:
- Missing Values Plot
- Line Plots of all features
- Box plots of all features
- Co-linearity Plot
- SHAP Values
- Random Forest Feature Importance
- Predictive Power Score
Additionally fFor Regression:
- Seasonality Plots
- Differentiated Variance Plot
- Auto Correlation Function Plot
- Partial Auto Correlation Function Plot
- Cross Correlation Function Plot
- Scatter Plots
Data Processing
from Amplo.AutoML import DataProcesser
Automated Data Cleaning. Handles the following items:
- Cleans Column Names
- Duplicate Columns and Rows
- Data Types
- Missing Values
- Outliers
- Constant Columns
Feature Processing
from Amplo.AutoML import FeatureProcesser
Automatically extracts and selects features. Removes Co-Linear Features.
Included Feature Extraction algorithms:
- Multiplicative Features
- Dividing Features
- Additive Features
- Subtractive Features
- Trigonometric Features
- K-Means Features
- Lagged Features
- Differencing Features
Included Feature Selection algorithms:
- Random Forest Feature Importance (Threshold and Increment)
- Predictive Power Score
- Boruta
Sequencing
from Amplo.AutoML import Sequencer
For timeseries regression problems, it is often useful to include multiple previous samples instead of just the latest.
This class sequences the data, based on which time steps you want included in the in- and output.
This is also very useful when working with tensors, as a tensor can be returned which directly fits into a Recurrent Neural Network.
Modelling
from Amplo.AutoML import Modeller
Runs various regression or classification models.
Includes:
- Scikit's Linear Model
- Scikit's Random Forest
- Scikit's Bagging
- Scikit's GradientBoosting
- Scikit's HistGradientBoosting
- DMLC's XGBoost
- Catboost's Catboost
- Microsoft's LightGBM
Grid Search
from Amplo.GridSearch import *
Contains three hyperparameter optimizers, a basic GridSearch
, an implementation of Scikit's RandomHalvingSearch
and
an implementation of Optuna's Tree-structured Parzen Estimator. Generally we advice to use Optuna.
Automatic Documntation
from Amplo.AutoML import Documenter
Contains a documenter for classification (binary
and multiclass
prolems), as well as for regression.
Creates a pdf report for a Pipeline, including metrics, data processing steps, and everything else to recreate the result.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file Amplo-0.5.9.tar.gz
.
File metadata
- Download URL: Amplo-0.5.9.tar.gz
- Upload date:
- Size: 70.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.9.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e09c1ef8d0d6df2650de0336eaec0aedf92a860f97d409835bf221852f7760a3 |
|
MD5 | 827c635270f9d203f13950fb522a1459 |
|
BLAKE2b-256 | 7c33a0cce8e45119feafbcd6aa21346e15fb95610920e32f2ba39bf7e4630828 |
File details
Details for the file Amplo-0.5.9-py3-none-any.whl
.
File metadata
- Download URL: Amplo-0.5.9-py3-none-any.whl
- Upload date:
- Size: 92.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.9.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 34da4627a70810487c256d0cb3dfd4c612d402fac52f4a3c8dcb8e580be71d79 |
|
MD5 | ec46842175219d4623d17c606e41cd75 |
|
BLAKE2b-256 | d9a8167c8a47d121770f3b1c863c6b43da3cb2c7eac127fbfcbe207d399bbe5d |