Skip to main content

Tools to extend sklearn

Project description

sktools Documentation Status

sktools provides tools to extend sklearn, like several feature engineering based transformers.


To install sktools, run this command in your terminal:

$ pip install sktools


Can be found in


from sktools import IsEmptyExtractor

from sklearn.linear_model import LogisticRegression
from sklearn.pipeline import Pipeline


mod = Pipeline([
    ("impute-features", IsEmptyExtractor()),
    ("model", LogisticRegression())



Here’s a list of features that sktools currently offers:

  • sktools.encoders.NestedTargetEncoder performs target encoding suited for variables with nesting.

  • sktools.encoders.QuantileEncoder performs target aggregation using a quantile instead of the mean.

  • sktools.preprocessing.CyclicFeaturizer converts numeric to cyclical features via sine and cosine transformations.

  • sktools.impute.IsEmptyExtractor creates binary variables indicating if there are missing values.

  • sktools.matrix_denser.MatrixDenser transformer that converts sparse matrices to dense.

  • sktools.quantilegroups.GroupedQuantileTransformer creates quantiles of a feature by group.

  • sktools.quantilegroups.PercentileGroupFeaturizer creates features regarding how an instance compares with a quantile of its group.

  • sktools.quantilegroups.MeanGroupFeaturizer creates features regarding how an instance compares with the mean of its group.

  • sktools.selectors.TypeSelector gets variables matching a type.

  • sktools.selectors.ItemsSelector allows to manually choose some variables.

  • sktools.ensemble.MedianForestRegressor applies the median instead of the mean when aggregating trees predictions.

  • sktools.linear_model.QuantileRegression sklearn style wrapper for quantile regression.

  • sktools.model_selection.BootstrapFold bootstrap cross-validator.

  • sktools.GradientBoostingFeatureGenerator Automated feature generation through gradient boosting.


MIT license


This package was created with Cookiecutter and the audreyr/cookiecutter-pypackage project template.


0.1.4 (2021-03-20)

  • Gradient boosting feature regressor

0.1.3 (2020-07-13)

  • Bootstrap cross-validation

  • Cyclic featurizer

0.1.2 (2020-06-24)

  • L1 linear model and random forest

  • Quantile encoder refactor

0.1.1 (2020-06-10)

  • Refactor code, add group featurizers

0.1.0 (2020-04-19)

  • First release on PyPI.

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sktools-0.1.4.tar.gz (33.9 kB view hashes)

Uploaded source

Built Distribution

sktools-0.1.4-py2.py3-none-any.whl (20.5 kB view hashes)

Uploaded py2 py3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page