AutoML for Text, Image, and Tabular Data
Project description
AutoML for Text, Image, and Tabular Data
AutoGluon automates machine learning tasks enabling you to easily achieve strong predictive performance in your applications. With just a few lines of code, you can train and deploy high-accuracy machine learning and deep learning models on text, image, and tabular data.
Example
# First install package from terminal:
# python3 -m pip install -U pip
# python3 -m pip install -U setuptools wheel
# python3 -m pip install autogluon # autogluon==0.3.1
from autogluon.tabular import TabularDataset, TabularPredictor
train_data = TabularDataset('https://autogluon.s3.amazonaws.com/datasets/Inc/train.csv')
test_data = TabularDataset('https://autogluon.s3.amazonaws.com/datasets/Inc/test.csv')
predictor = TabularPredictor(label='class').fit(train_data, time_limit=120) # Fit models for 120s
leaderboard = predictor.leaderboard(test_data)
AutoGluon Task | Quickstart | API |
---|---|---|
TabularPredictor | ||
TextPredictor | ||
ImagePredictor | ||
ObjectDetector |
Resources
See the AutoGluon Website for documentation and instructions on:
-
- Tips to maximize accuracy (if benchmarking, make sure to run
fit()
with argumentpresets='best_quality'
).
- Tips to maximize accuracy (if benchmarking, make sure to run
-
More advanced topics such as Neural Architecture Search
Scientific Publications
- AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data (Arxiv, 2020)
- Fast, Accurate, and Simple Models for Tabular Data via Augmented Distillation (NeurIPS, 2020)
- Multimodal AutoML on Structured Tables with Text Fields (ICML AutoML Workshop, 2021)
Articles
- AutoGluon for tabular data: 3 lines of code to achieve top 1% in Kaggle competitions (AWS Open Source Blog, Mar 2020)
- Accurate image classification in 3 lines of code with AutoGluon (Medium, Feb 2020)
- AutoGluon overview & example applications (Towards Data Science, Dec 2019)
Hands-on Tutorials
Train/Deploy AutoGluon in the Cloud
- AutoGluon-Tabular on AWS Marketplace
- Running AutoGluon-Tabular on Amazon SageMaker
- Running AutoGluon Image Classification on Amazon SageMaker
Citing AutoGluon
If you use AutoGluon in a scientific publication, please cite the following paper:
Erickson, Nick, et al. "AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data." arXiv preprint arXiv:2003.06505 (2020).
BibTeX entry:
@article{agtabular,
title={AutoGluon-Tabular: Robust and Accurate AutoML for Structured Data},
author={Erickson, Nick and Mueller, Jonas and Shirkov, Alexander and Zhang, Hang and Larroy, Pedro and Li, Mu and Smola, Alexander},
journal={arXiv preprint arXiv:2003.06505},
year={2020}
}
If you are using AutoGluon Tabular's model distillation functionality, please cite the following paper:
Fakoor, Rasool, et al. "Fast, Accurate, and Simple Models for Tabular Data via Augmented Distillation." Advances in Neural Information Processing Systems 33 (2020).
BibTeX entry:
@article{agtabulardistill,
title={Fast, Accurate, and Simple Models for Tabular Data via Augmented Distillation},
author={Fakoor, Rasool and Mueller, Jonas W and Erickson, Nick and Chaudhari, Pratik and Smola, Alexander J},
journal={Advances in Neural Information Processing Systems},
volume={33},
year={2020}
}
If you use AutoGluon's multimodal text+tabular functionality in a scientific publication, please cite the following paper:
Shi, Xingjian, et al. "Multimodal AutoML on Structured Tables with Text Fields." 8th ICML Workshop on Automated Machine Learning (AutoML). 2021.
BibTeX entry:
@inproceedings{agmultimodaltext,
title={Multimodal AutoML on Structured Tables with Text Fields},
author={Shi, Xingjian and Mueller, Jonas and Erickson, Nick and Li, Mu and Smola, Alex},
booktitle={8th ICML Workshop on Automated Machine Learning (AutoML)},
year={2021}
}
AutoGluon for Hyperparameter Optimization
AutoGluon also provides state-of-the-art tools for hyperparameter optimization, such as for example ASHA, Hyperband, Bayesian Optimization and BOHB.
To get started, checkout our paper "Model-based Asynchronous Hyperparameter and Neural Architecture Search" arXiv preprint arXiv:2003.10865 (2020).
@article{abohb,
title={Model-based Asynchronous Hyperparameter and Neural Architecture Search},
author={Klein, Aaron and Tiao, Louis and Lienart, Thibaut and Archambeau, Cedric and Seeger, Matthias},
journal={arXiv preprint arXiv:2003.10865},
year={2020}
}
License
This library is licensed under the Apache 2.0 License.
Contributing to AutoGluon
We are actively accepting code contributions to the AutoGluon project. If you are interested in contributing to AutoGluon, please read the Contributing Guide to get started.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file autogluon.features-0.3.2b20220102.tar.gz
.
File metadata
- Download URL: autogluon.features-0.3.2b20220102.tar.gz
- Upload date:
- Size: 42.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.10.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.7.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f33071b156ea6b820657ace1057abed15d9bdbf67f9776a6a71a20a90c90c784 |
|
MD5 | 7adcf21b24709d38cb13236a352d69cc |
|
BLAKE2b-256 | 6521b74883716a4ae5f831013132d9a2c04348877f693086da9cc9ad48781f6c |
File details
Details for the file autogluon.features-0.3.2b20220102-py3-none-any.whl
.
File metadata
- Download URL: autogluon.features-0.3.2b20220102-py3-none-any.whl
- Upload date:
- Size: 59.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.10.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.7.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8ce0ca91fe545c64d38d0ee8aa9889a323c750757315188b9a3049b9c638f8b4 |
|
MD5 | 0b1bfba58e2149efb39ce5d0348c2a88 |
|
BLAKE2b-256 | 36f7a2959d62825036cf713b33be177f97ff7a8b12d7b685b0b7c5024df8a2e8 |