Utilities for scikit-learn.
Project description
Sklearn Utilities
Utilities for scikit-learn.
Installation
Install this via pip (or your favourite package manager):
pip install sklearn-utilities
API
See Docs for more information.
EstimatorWrapperBase
: base class for wrappers. Redirects all attributes which are not in the wrapper to the wrapped estimator.DataFrameWrapper
: tries to convert every estimator output to a pandas DataFrame or Series.FeatureUnionPandas
: aFeatureUnion
that works with pandas DataFrames.IncludedColumnTransformerPandas
,ExcludedColumnTransformerPandas
: select columns by name.AppendPredictionToX
: appends the prediction of y to X.AppendXPredictionToX
: appends the prediction of X to X.DropByNoisePrediction
: drops columns which has high importance in predicting noise.DropMissingColumns
: drops columns with missing values above a threshold.DropMissingRowsY
: drops rows with missing values in y. Usefeature_engine.DropMissingData
for X.IntersectXY
: drops rows where the index of X and y do not intersect. Use withfeature_engine.DropMissingData
.ReindexMissingColumns
: reindexes columns of X intransform()
to match the columns of X infit()
.ReportNonFinite
: reports non-finite values in X and/or y.IdTransformer
: a transformer that does nothing.RecursiveFitSubtractRegressor
: a regressor that recursively fits a regressor and subtracts the prediction from the target.SmartMultioutputEstimator
: aMultiOutputEstimator
that supports tuple of arrays inpredict()
and supports pandasSeries
andDataFrame
.until_event()
,since_event()
: calculates the time since or until events (Series[bool]
)ComposeVarEstimator
: composes mean and std/var estimators.DummyRegressorVar
:DummyRegressor
that returns 1.0 for std/var.TransformedTargetRegressorVar
:TransformedTargetRegressor
with std/var support.StandardScalerVar
:StandardScaler
with std/var support.EvalSetWrapper
,CatBoostProgressBarWrapper
: wrapper that passeseval_set
tofit()
usingtrain_test_split()
, mainly forCatBoost
. The latter shows progress bar (usingtqdm
) as well. Useful for early stopping. For LightGBM, seelightgbm-callbacks
.
sklearn_utilities.dataset
add_missing_values()
: adds missing values to a dataset.
sklearn_utilities.torch
PCATorch
: faster PCA using PyTorch with GPU support.
sklearn_utilities.torch.skorch
SkorchReshaper
,SkorchCNNReshaper
: reshapes X and y fornn.Linear
andnn.Conv1d/2d
respectively. (Fornn.Conv2d
, usesnp.sliding_window_view()
.)AllowNaN
: wraps a loss module and assign 0 to y and y_hat for indices where y contains NaN inforward()
..
See also
Contributors ✨
Thanks goes to these wonderful people (emoji key):
This project follows the all-contributors specification. Contributions of any kind welcome!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sklearn_utilities-0.5.0.tar.gz
(30.3 kB
view hashes)
Built Distribution
Close
Hashes for sklearn_utilities-0.5.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8a0a6c0d0f1716a846dbb62e6c6e5d8e04c2171a203566b1a878208eea7e86a4 |
|
MD5 | 4d38fb62b4c12669e4dc5a41f8dd725e |
|
BLAKE2b-256 | 915df55c483b89ffdae6da8729cf31f267958394b85658391e714fa1a245c86b |