Skip to main content

Data frame support and feature traceability for `scikit-learn`.

Project description

sklearndf is an open source library designed to address a common need with scikit-learn: the outputs of transformers are numpy arrays, even when the input is a data frame. However, to inspect a model it is essential to keep track of the feature names.

To this end, sklearndf enhances scikit-learn’s estimators as follows:

  • Preserve data frame structure:

    Return data frames as results of transformations, preserving feature names as the column index.

  • Feature name tracing:

    Add additional estimator properties to enable tracing a feature name back to its original input feature; this is especially useful for transformers that create new features (e.g., one-hot encode), and for pipelines that include such transformers.

  • Easy use:

    Simply append DF at the end of your usual scikit-learn class names to get enhanced data frame support!

pypi conda python_versions code_style made_with_sphinx_doc license_badge

License

sklearndf is licensed under Apache 2.0 as described in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sklearndf-2.1rc0.tar.gz (145.0 kB view details)

Uploaded Source

Built Distribution

sklearndf-2.1rc0-py3-none-any.whl (68.3 kB view details)

Uploaded Python 3

File details

Details for the file sklearndf-2.1rc0.tar.gz.

File metadata

  • Download URL: sklearndf-2.1rc0.tar.gz
  • Upload date:
  • Size: 145.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.28.1

File hashes

Hashes for sklearndf-2.1rc0.tar.gz
Algorithm Hash digest
SHA256 a6f9548c6d0984b5dd6e89c39eb0fa49baaaa2d27e945f4494d113e6f2c2d55e
MD5 294bb5abbd39cbbeb66fd78130941e3e
BLAKE2b-256 6b003f50e7c7059878dcbd16ce7bd8da397d5def640f0fbb1c4ccc757ad0f22c

See more details on using hashes here.

File details

Details for the file sklearndf-2.1rc0-py3-none-any.whl.

File metadata

  • Download URL: sklearndf-2.1rc0-py3-none-any.whl
  • Upload date:
  • Size: 68.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.28.1

File hashes

Hashes for sklearndf-2.1rc0-py3-none-any.whl
Algorithm Hash digest
SHA256 c43e5b17d983ca7c2b27730fa249bd43559e5c7c87f20d82b7248c778190ca43
MD5 0b09fd7fdfd3b058bf2cf0091bca1985
BLAKE2b-256 2da21ae3f2abf17bdcd0c1b5e22c3476b6e931b9ae9186bbbcf13b87832f58ee

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page