A Python AutoML tool for fast exploration and experimentation of supervised machine learning pipelines.

These details have not been verified by PyPI

Project links

Project description

ATOM

Automated Tool for Optimized Modelling

Author: tvdboom
Email: m.524687@gmail.com

Description

Automated Tool for Optimized Modelling (ATOM) is a python package designed for fast exploration and experimentation of supervised machine learning tasks. With just a few lines of code, you can perform basic data cleaning steps, feature selection and compare the performance of multiple models on a given dataset. ATOM should be able to provide quick insights on which algorithms perform best for the task at hand and provide an indication of the feasibility of the ML solution. This package supports binary classification, multiclass classification, and regression tasks.

NOTE: A data scientist with domain knowledge can outperform ATOM if he applies usecase-specific feature engineering or data cleaning steps!

Possible steps taken by the ATOM pipeline:

Data Cleaning
- Handle missing values
- Encode categorical features
- Balance the dataset
- Remove outliers
Perform feature selection
- Remove features with too high collinearity
- Remove features with too low variance
- Select best features according to a chosen strategy
Fit all selected models (either direct or via successive halving)
- Select hyperparameters using a Bayesian Optimization approach
- Perform bagging to assess the robustness of the model
Analyze the results using the provided plotting functions!

diagram

Installation

Intall ATOM easily using pip.

NOTE: Since atom was already taken, the name of the package in pypi is `atom-ml`!

	pip install atom-ml

Usage

Call the ATOMClassifier or ATOMRegressor class and provide the data you want to use:

from atom import ATOMClassifier  

atom = ATOMClassifier(X, y, log='auto', n_jobs=2, verbose=2)

ATOM has multiple data cleaning methods to help you prepare the data for modelling:

atom.impute(strat_num='knn', strat_cat='most_frequent',  max_frac_rows=0.1)  
atom.encode(max_onehot=10, frac_to_other=0.05)  
atom.outliers(max_sigma=4)  
atom.balance(oversample=0.8, n_neighbors=15)  
atom.feature_selection(strategy='univariate', solver='chi2', max_features=0.9)

Run the pipeline with different models:

atom.pipeline(models=['LR', 'LDA', 'XGB', 'lSVM'],
              metric='f1',
              max_iter=10,
              max_time=1000,
              init_points=3,
              cv=4,
              bagging=10)

Make plots and analyze results:

atom.plot_bagging(filename='bagging_results.png')  
atom.lSVM.plot_probabilities()  
atom.lda.plot_confusion_matrix()

Documentation

For further information about ATOM, please see the project documentation.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

6.1.0

Jul 5, 2024

6.0.1

Mar 7, 2024

6.0.0

Mar 7, 2024

5.2.0

Jun 14, 2023

5.1.2

May 7, 2023

5.1.1

Mar 16, 2023

5.1.0

Mar 4, 2023

5.0.1

Nov 29, 2022

5.0.0

Nov 28, 2022

4.14.1

Jul 18, 2022

4.14.0

Jul 17, 2022

4.13.1

Apr 5, 2022

4.13.0

Apr 4, 2022

4.12.0

Feb 24, 2022

4.11.0

Jan 30, 2022

4.10.0

Dec 17, 2021

4.9.1

Oct 30, 2021

4.9.0

Oct 27, 2021

4.8.0

Sep 29, 2021

4.7.3

Sep 11, 2021

4.7.2

Sep 11, 2021

4.7.1

Sep 11, 2021

4.7.0

Sep 10, 2021

4.6.0

Jun 28, 2021

4.5.0

May 31, 2021

4.4.0

Mar 29, 2021

4.3.0

Mar 2, 2021

4.2.1

Dec 29, 2020

4.2.0

Dec 28, 2020

4.1.0

Oct 16, 2020

4.0.1

Sep 29, 2020

4.0.0

Sep 28, 2020

3.3.0

Apr 24, 2020

3.2.0

Mar 30, 2020

3.1.0

Mar 8, 2020

3.0.2

Feb 17, 2020

3.0.1

Feb 15, 2020

This version

3.0.0

Feb 13, 2020

2.4.0

Jan 26, 2020

2.3.0

Dec 13, 2019

2.2.0

Dec 3, 2019

2.1.2

Nov 27, 2019

2.1.1

Nov 22, 2019

2.1.0

Nov 8, 2019

2.0.3

Nov 1, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

atom-ml-3.0.0.tar.gz (44.2 kB view hashes)

Uploaded Feb 13, 2020 Source

Hashes for atom-ml-3.0.0.tar.gz

Hashes for atom-ml-3.0.0.tar.gz
Algorithm	Hash digest
SHA256	`e5f169b6a35e9bf09e3c63669637a226905c9d0683d210f26a6e5b2374a9feb6`
MD5	`3c63c1a527f82f249f30030bbd341623`
BLAKE2b-256	`4e5943f471b1c897efc00fc22f25995636d3d67c159ebd6cb45a6bc2ee646db3`