Skip to main content

Sklearn transformers that work with Pandas dataframes

Project description

sklearn-pdtransform
-------------------

A little module with a few transformers to work with Pandas dataframes in the
Sklearn pipeline. Example usage:

.. code:: python
pipeline = Pipeline([
('ordinal_to_nums', DFTransform(_ordinal_to_nums, copy=True)),
('union', DFFeatureUnion([
('categorical', Pipeline([
('select', DFTransform(lambda X: X.select_dtypes(include=['object']))),
('fill_na', DFTransform(lambda X: X.fillna('NA'))),
('one_hot', DFTransform(_one_hot_encode)),
])),
('numerical', Pipeline([
('select', DFTransform(lambda X: X.select_dtypes(exclude=['object']))),
('fill_median', DFTransform(lambda X: X.fillna(X.median()))),
('add_features', DFTransform(_add_features, copy=True)),
('remove_skew', DFTransform(_remove_skew, copy=True)),
('find_outliers', DFTransform(_find_outliers, copy=True)),
('normalize', DFTransform(lambda X: X.div(X.max())))
])),
])),
])


For more information read `this blog post <http://signal-to-noise.xyz/why-you-should-use-scikit-learns-pipeline-object.html>`_.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pdtransform-0.1.tar.gz (2.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pdtransform-0.1-py2.py3-none-any.whl (3.6 kB view details)

Uploaded Python 2Python 3

File details

Details for the file pdtransform-0.1.tar.gz.

File metadata

  • Download URL: pdtransform-0.1.tar.gz
  • Upload date:
  • Size: 2.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pdtransform-0.1.tar.gz
Algorithm Hash digest
SHA256 e7e765fa72a37b3eab22d322de75ecf3e7293401b4a7b944124ad65741350df1
MD5 fc70d25618866deb7430f8627a66c39d
BLAKE2b-256 f5f239cc8de1033b841b7e33c18d7d21f91c02669c006d6c58911f9aaac54426

See more details on using hashes here.

File details

Details for the file pdtransform-0.1-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for pdtransform-0.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 05203e7b552fc5782ab7e8ed8abb6b22d39a14292947417e5702ae35f376cfa1
MD5 aee6860fae7cd3b050627ae08b3a9b7a
BLAKE2b-256 34d82acd8d05c456037af125d02d4b6b3fab0c0188aeb9f2b1f90612c32b2f22

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page