State-of-the art Automated Machine Learning python library for Tabular Data
Project description
AutoML Alex
State-of-the art Automated Machine Learning python library for Tabular Data
Works with Tasks:
-
Binary Classification
-
Regression
-
Multiclass Classification (in progress...)
Benchmark Results
The bigger, the better
From AutoML-Benchmark
Scheme
Features
- Automated Data Clean (Auto Clean)
- Automated Feature Engineering (Auto FE)
- Smart Hyperparameter Optimization (HPO)
- Feature Generation
- Feature Selection
- Models Selection
- Cross Validation
- Optimization Timelimit and EarlyStoping
- Save and Load (Predict new data)
Installation
pip install automl-alex
Docs
🚀 Examples
Classifier:
from automl_alex import AutoMLClassifier
model = AutoMLClassifier()
model.fit(X_train, y_train, timeout=600)
predicts = model.predict(X_test)
Regression:
from automl_alex import AutoMLRegressor
model = AutoMLRegressor()
model.fit(X_train, y_train, timeout=600)
predicts = model.predict(X_test)
DataPrepare:
from automl_alex import DataPrepare
de = DataPrepare()
X_train = de.fit_transform(X_train)
X_test = de.transform(X_test)
Simple Models Wrapper:
from automl_alex import LightGBMClassifier
model = LightGBMClassifier()
model.fit(X_train, y_train)
predicts = model.predict_proba(X_test)
model.opt(X_train, y_train,
timeout=600, # optimization time in seconds,
)
predicts = model.predict_proba(X_test)
More examples in the folder ./examples:
- 01_Quick_Start.ipynb
- 02_Data_Cleaning_and_Encoding_(DataPrepare).ipynb
- 03_Models.ipynb
- 04_ModelsReview.ipynb
- 05_BestSingleModel.ipynb
- Production Docker template
What's inside
It integrates many popular frameworks:
- scikit-learn
- XGBoost
- LightGBM
- CatBoost
- Optuna
- ...
Works with Features
-
Categorical Features
-
Numerical Features
-
Binary Features
-
Text
-
Datetime
-
Timeseries
-
Image
Note
- With a large dataset, a lot of memory is required! Library creates many new features. If you have a large dataset with a large number of features (more than 100), you may need a lot of memory.
Realtime Dashboard
Works with optuna-dashboard
Run
$ optuna-dashboard sqlite:///db.sqlite3
Road Map
-
Feature Generation
-
Save/Load and Predict on New Samples
-
Advanced Logging
-
Add opt Pruners
-
DL Encoders
-
Add More libs (NNs)
-
Multiclass Classification
-
Build pipelines
-
Docs Site
Contact
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for automl_alex-1.3.7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 930f01667698a50b6072e5e37fec23441cb52738b1c411ffc16a3c22e246ffb9 |
|
MD5 | 00cbf88c3d187fc844af5321ba1ebfeb |
|
BLAKE2b-256 | 2ef3b189df2dc9ac7b870b243b54d4e3556a9431161957efbafb75ba5d526ee6 |