Skip to main content

DataFrame support and feature traceability for scikit-learn.

Project description

sklearndf is an open source library designed to address a common need with scikit-learn: the outputs of transformers are numpy arrays, even when the input is a data frame. However, to inspect a model it is essential to keep track of the feature names.

To this end, sklearndf enhances scikit-learn’s estimators as follows:

  • Preserve data frame structure:

    Return data frames as results of transformations, preserving feature names as the column index.

  • Feature name tracing:

    Add additional estimator properties to enable tracing a feature name back to its original input feature; this is especially useful for transformers that create new features (e.g., one-hot encode), and for pipelines that include such transformers.

  • Easy use:

    Simply append DF at the end of your usual scikit-learn class names to get enhanced data frame support!

pypi conda python_versions code_style made_with_sphinx_doc license_badge

License

sklearndf is licensed under Apache 2.0 as described in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sklearndf-1.2.0.tar.gz (114.9 kB view details)

Uploaded Source

Built Distribution

sklearndf-1.2.0-py2.py3-none-any.whl (49.6 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file sklearndf-1.2.0.tar.gz.

File metadata

  • Download URL: sklearndf-1.2.0.tar.gz
  • Upload date:
  • Size: 114.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.25.1

File hashes

Hashes for sklearndf-1.2.0.tar.gz
Algorithm Hash digest
SHA256 880a6fb67d8cd98ad94552dac5b155375a4f7b2ebe8f4409cb3784a863cddd9f
MD5 50bb7c0edd15d7d2fde722abbf0cd185
BLAKE2b-256 b8864cc781762a23f9bce277ee320ab0fefbaf2bd7e129a8aa5d8bbb3d0e8a3b

See more details on using hashes here.

File details

Details for the file sklearndf-1.2.0-py2.py3-none-any.whl.

File metadata

  • Download URL: sklearndf-1.2.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 49.6 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.25.1

File hashes

Hashes for sklearndf-1.2.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 0d8d7fc78d5db10b0a5c24460f6ea05e250483c16157782b3c98b616473b1fa0
MD5 e3175fb55be18b124b5397bd249f26e6
BLAKE2b-256 53a5ca6b903b662b744619ff4a66d6d1b5a084810493eec183d27c4c11b0cfc5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page