Skip to main content

DataFrame support and feature traceability for scikit-learn.

Project description

sklearndf is an open source library designed to address a common need with scikit-learn: the outputs of transformers are numpy arrays, even when the input is a data frame. However, to inspect a model it is essential to keep track of the feature names.

To this end, sklearndf enhances scikit-learn’s estimators as follows:

  • Preserve data frame structure:

    Return data frames as results of transformations, preserving feature names as the column index.

  • Feature name tracing:

    Add additional estimator properties to enable tracing a feature name back to its original input feature; this is especially useful for transformers that create new features (e.g., one-hot encode), and for pipelines that include such transformers.

  • Easy use:

    Simply append DF at the end of your usual scikit-learn class names to get enhanced data frame support!

pypi conda python_versions code_style made_with_sphinx_doc license_badge

License

sklearndf is licensed under Apache 2.0 as described in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sklearndf-1.1.0rc0.tar.gz (114.8 kB view details)

Uploaded Source

Built Distribution

sklearndf-1.1.0rc0-py3-none-any.whl (49.4 kB view details)

Uploaded Python 3

File details

Details for the file sklearndf-1.1.0rc0.tar.gz.

File metadata

  • Download URL: sklearndf-1.1.0rc0.tar.gz
  • Upload date:
  • Size: 114.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.25.1

File hashes

Hashes for sklearndf-1.1.0rc0.tar.gz
Algorithm Hash digest
SHA256 3dbcf8786096c9728c3ea172803f20f389311c099b79793cd5d76e1d567ce592
MD5 9eb920ff14edefedeef65fbf8dad5755
BLAKE2b-256 21765dea285b6e741859464b1653ab55ecdb79f9246d2606a8cec5762389b500

See more details on using hashes here.

File details

Details for the file sklearndf-1.1.0rc0-py3-none-any.whl.

File metadata

File hashes

Hashes for sklearndf-1.1.0rc0-py3-none-any.whl
Algorithm Hash digest
SHA256 c32df75b2403f24e8df1d0ea44f949218d38008ffad6035b58b8c12bab976af3
MD5 501ac738f7d5de6655d5167f20b89b22
BLAKE2b-256 6e853b710569020656e808722f6bb5b2d87b25cab738d1e86b8cb605959a970a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page