Skip to main content

Sci-kit learn tools for machine learning pipelines

Project description

scikit-duplo

Very simple reusable blocks for scikit-learn pipelines (inspired by scikit-lego)

License: MIT PyPI Documentation Status

Installation

Installation from the source tree:

python setup.py install

Or via pip from PyPI:

pip install scikit-duplo

Contents

The sci-kit duplo package contains multiple classes that you can use in a sci-kit learn compatible pipeline. There are ensemble learning classes within the meta subdirectory. These classes expect you to pass in multiple other Sci-kit learn compatible machine learning classes. It will use these to build an ensemble of models to predict the target variable.

There are feature engineering classes inside the preprocessing subdirectory. These are ColumnTransformer compatible classes that expect to receive a dataframe and set of column names that it will transform for the downstream pipeline processes.

LookupEncoder allows you to provide a custon dictionary of values for encoding categorical variables.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scikit-duplo-0.1.10.tar.gz (7.5 kB view hashes)

Uploaded Source

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page