Helpers to speed up and structure machine learning projects
Project description
ML Helper
Helpers to speed up and structure machine learning projects.
The library is available in Pypi
Installing
The easiest way to install ml-helper is through pip
pip install ml-helper
To use it in your project, you must first import the library
from ml_helper.helper import Helper
And then create a Helper object with a dictionary of keys related to your project
KEYS = {
'SEED': 1,
'TARGET': 'y',
'METRIC': 'r2',
'TIMESERIES': True,
'SPLITS': 5
}
hp = Helper(KEYS)
After this, you may use the helper object's many functions
Dependencies
ML-Helper requires:
- Python (>3.5)
- Numpy (>=1.16)
- Pandas (>=0.23.4)
- Seaborn (>=0.9)
- Scikit-learn (>=0.20)
- Natplotlib (>=3)
- Scipy (>=1)
- Imblearn
- Vecstack
Functionality
The functionality is separated into 4 groups:
- Data Exploration
- Missing Data
- Boxplot of numerical variables
- Coefficient of variation
- Correlation (numerical and categorical)
- Under Represented Features
- Target Variable Distribution
- Feature Importance
- PCA Component Variance
- Data Preparation
- Convert features to categories
- Drop multiple columns
- Modeling
- Cross Validation (with stratified kfolds, or time series split depending on use case)
- Randomized Grid Search
- Pipeline: Collection of models and pipeline steps that get performed and scored
- Predict: Predict on unseen data
- Stack Predict: Build a stacked model and perform a prediction
- Cross Validation (with stratified kfolds, or time series split depending on use case)
- Regression
- Plots for predictions
- Classification
- ROC Curve
- Classification Report
- Others
- Select features based on types
- Split X and y
- Plot models/pipelines
Working Examples
If you wish to see the library in use, you may view the notebooks in the examples section.
Also, you can see the implementation in their corresponding Kaggle Kernels:
ML-Helper Coding Style
Ml-Helper complies to PEP8 and uses black
for coding standards
Versioning
SemVer is used for versioning.
License
This project is licensed under the MIT License - see the License file for details
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for ml_helper-0.0.21-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ae0c5914fc2fe060fefe2a6e286c2b142ffb9caa14e2fb8665439b36248f0a40 |
|
MD5 | dd5eaa64a63759025f2762232816ac9c |
|
BLAKE2b-256 | 75e8c89af0adb34f6fdc78dd53da0492ba67f57543c4d8cb9488a2585cb43801 |