Skip to main content

Helpers to speed up and structure machine learning projects

Project description

ML Helper


Helpers to speed up and structure machine learning projects.

The library is available in Pypi

Installing


The easiest way to install ml-helper is through pip

pip install ml-helper

To use it in your project, you must first import the library

from ml_helper.helper import Helper

And then create a Helper object with a dictionary of keys related to your project

KEYS = {
    'SEED': 1,
    'TARGET': 'y',
    'METRIC': 'r2',
    'TIMESERIES': True,
    'SPLITS': 5
}

hp = Helper(KEYS)

After this, you may use the helper object's many functions

Dependencies

ML-Helper requires:

  • Python (>3.5)
  • Numpy (>=1.16)
  • Pandas (>=0.23.4)
  • Seaborn (>=0.9)
  • Scikit-learn (>=0.20)
  • Natplotlib (>=3)
  • Scipy (>=1)
  • Imblearn
  • Vecstack

Functionality


The functionality is separated into 4 groups:

  • Data Exploration
    • Missing Data
    • Boxplot of numerical variables
    • Coefficient of variation
    • Correlation (numerical and categorical)
    • Under Represented Features
    • Target Variable Distribution
    • Feature Importance
    • PCA Component Variance
  • Data Preparation
    • Convert features to categories
    • Drop multiple columns
  • Modeling
    • Cross Validation (with stratified kfolds, or time series split depending on use case)
      • Randomized Grid Search
    • Pipeline: Collection of models and pipeline steps that get performed and scored
    • Predict: Predict on unseen data
    • Stack Predict: Build a stacked model and perform a prediction
  • Regression
    • Plots for predictions
  • Classification
    • ROC Curve
    • Classification Report
  • Others
    • Select features based on types
    • Split X and y
    • Plot models/pipelines

Working Examples


If you wish to see the library in use, you may view the notebooks in the examples section.

Also, you can see the implementation in their corresponding Kaggle Kernels:

ML-Helper Coding Style


Ml-Helper complies to PEP8 and uses black for coding standards

Versioning


SemVer is used for versioning.

License


This project is licensed under the MIT License - see the License file for details

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ml-helper-0.0.23.tar.gz (8.5 kB view details)

Uploaded Source

Built Distribution

ml_helper-0.0.23-py3-none-any.whl (8.4 kB view details)

Uploaded Python 3

File details

Details for the file ml-helper-0.0.23.tar.gz.

File metadata

  • Download URL: ml-helper-0.0.23.tar.gz
  • Upload date:
  • Size: 8.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.29.1 CPython/3.7.2

File hashes

Hashes for ml-helper-0.0.23.tar.gz
Algorithm Hash digest
SHA256 e29deda53f2a36cb18c9b0c9ba65e0b032e5cb9b9e627a31e8001e6bb74dc9bd
MD5 92510d06cbaa464f54a57b737c56aeed
BLAKE2b-256 030ac86dad9ea551af33a4d078808754429514a6051a7b6bf24428ab6817eff0

See more details on using hashes here.

File details

Details for the file ml_helper-0.0.23-py3-none-any.whl.

File metadata

  • Download URL: ml_helper-0.0.23-py3-none-any.whl
  • Upload date:
  • Size: 8.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.0 requests-toolbelt/0.9.1 tqdm/4.29.1 CPython/3.7.2

File hashes

Hashes for ml_helper-0.0.23-py3-none-any.whl
Algorithm Hash digest
SHA256 21425d32520be96616ef6acaae323e1907be7ca194eaa179099ed9edd5d489ce
MD5 c4ac8c5aa537091b386a2380ae4bb7da
BLAKE2b-256 ed1246ca2e9ca20eb56074d9abdde323e957d0681f504320e8869a7caabc2bb1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page