Skip to main content

Data frame support and feature traceability for `scikit-learn`.

Project description

sklearndf is an open source library designed to address a common need with scikit-learn: the outputs of transformers are numpy arrays, even when the input is a data frame. However, to inspect a model it is essential to keep track of the feature names.

To this end, sklearndf enhances scikit-learn’s estimators as follows:

  • Preserve data frame structure:

    Return data frames as results of transformations, preserving feature names as the column index.

  • Feature name tracing:

    Add additional estimator properties to enable tracing a feature name back to its original input feature; this is especially useful for transformers that create new features (e.g., one-hot encode), and for pipelines that include such transformers.

  • Easy use:

    Simply append DF at the end of your usual scikit-learn class names to get enhanced data frame support!

pypi conda python_versions code_style made_with_sphinx_doc license_badge

License

sklearndf is licensed under Apache 2.0 as described in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sklearndf-2.2rc1.tar.gz (152.1 kB view details)

Uploaded Source

Built Distribution

sklearndf-2.2rc1-py3-none-any.whl (72.5 kB view details)

Uploaded Python 3

File details

Details for the file sklearndf-2.2rc1.tar.gz.

File metadata

  • Download URL: sklearndf-2.2rc1.tar.gz
  • Upload date:
  • Size: 152.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.29.0

File hashes

Hashes for sklearndf-2.2rc1.tar.gz
Algorithm Hash digest
SHA256 2021902ccedae3cfa3af6adf10329301df5b84e1b006b77663666216607ec017
MD5 aa42b13a94e51c64024b4edeee554500
BLAKE2b-256 c9872a61895fe40abc7eebfb5db7558cde9e253144d157bfa4dfac73622f229e

See more details on using hashes here.

File details

Details for the file sklearndf-2.2rc1-py3-none-any.whl.

File metadata

  • Download URL: sklearndf-2.2rc1-py3-none-any.whl
  • Upload date:
  • Size: 72.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.29.0

File hashes

Hashes for sklearndf-2.2rc1-py3-none-any.whl
Algorithm Hash digest
SHA256 cbc9f9dd4a4096969d6d96c86eac7f43731ae347eff7894e3a4b5a685fcede2d
MD5 aa1368dbc14d1a34c715f3ac4c2d8dd7
BLAKE2b-256 fe0b6228136cf802cd1e091c51cb91a8af79fde6128bf927dd3a580ae331b46d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page