Skip to main content

DataFrame support and feature traceability for scikit-learn.

Project description

sklearndf is an open source library designed to address a common need with scikit-learn: the outputs of transformers are numpy arrays, even when the input is a data frame. However, to inspect a model it is essential to keep track of the feature names.

To this end, sklearndf enhances scikit-learn’s estimators as follows:

  • Preserve data frame structure:

    Return data frames as results of transformations, preserving feature names as the column index.

  • Feature name tracing:

    Add additional estimator properties to enable tracing a feature name back to its original input feature; this is especially useful for transformers that create new features (e.g., one-hot encode), and for pipelines that include such transformers.

  • Easy use:

    Simply append DF at the end of your usual scikit-learn class names to get enhanced data frame support!

pypi conda python_versions code_style made_with_sphinx_doc license_badge

License

sklearndf is licensed under Apache 2.0 as described in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sklearndf-1.2.0rc0.tar.gz (114.9 kB view details)

Uploaded Source

Built Distribution

sklearndf-1.2.0rc0-py2.py3-none-any.whl (49.6 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file sklearndf-1.2.0rc0.tar.gz.

File metadata

  • Download URL: sklearndf-1.2.0rc0.tar.gz
  • Upload date:
  • Size: 114.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.25.1

File hashes

Hashes for sklearndf-1.2.0rc0.tar.gz
Algorithm Hash digest
SHA256 b8f0b44aa9a0ee81a1a7a7e2ff2c1dce7c2a408a250ca96d8b009c24e20bf844
MD5 9d8846de87405e84ab12532cd4b7cd01
BLAKE2b-256 47700b8f842acb5620f7a44b9b298ef2ca16c1eec2d58d094175654828ca17dc

See more details on using hashes here.

File details

Details for the file sklearndf-1.2.0rc0-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for sklearndf-1.2.0rc0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 700cdd7114f6092b9b478d791363b6934b64173fb91950c6c5b872e134623c64
MD5 e78790ec5de25a1d10d21b3979f0ee46
BLAKE2b-256 5cfd127603074a1be0fe4cb46ceca27188ceae4c22cd651471e50835bfd861d1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page