Skip to main content

Data frame support and feature traceability for `scikit-learn`.

Project description

sklearndf is an open source library designed to address a common need with scikit-learn: the outputs of transformers are numpy arrays, even when the input is a data frame. However, to inspect a model it is essential to keep track of the feature names.

To this end, sklearndf enhances scikit-learn’s estimators as follows:

  • Preserve data frame structure:

    Return data frames as results of transformations, preserving feature names as the column index.

  • Feature name tracing:

    Add additional estimator properties to enable tracing a feature name back to its original input feature; this is especially useful for transformers that create new features (e.g., one-hot encode), and for pipelines that include such transformers.

  • Easy use:

    Simply append DF at the end of your usual scikit-learn class names to get enhanced data frame support!

pypi conda python_versions code_style made_with_sphinx_doc license_badge

License

sklearndf is licensed under Apache 2.0 as described in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sklearndf-2.0.0.tar.gz (142.6 kB view details)

Uploaded Source

Built Distribution

sklearndf-2.0.0-py3-none-any.whl (66.1 kB view details)

Uploaded Python 3

File details

Details for the file sklearndf-2.0.0.tar.gz.

File metadata

  • Download URL: sklearndf-2.0.0.tar.gz
  • Upload date:
  • Size: 142.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.28.1

File hashes

Hashes for sklearndf-2.0.0.tar.gz
Algorithm Hash digest
SHA256 681b7657271d81e7d82041c89434946be4cc76d635704378eebfa46513f1ee8d
MD5 74be3ac428c16be415eac4096c973ec5
BLAKE2b-256 0d49ff7601ac024c2b25f6b953ae1382fc19632753e70d023092f132581be522

See more details on using hashes here.

File details

Details for the file sklearndf-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: sklearndf-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 66.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.28.1

File hashes

Hashes for sklearndf-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 98aa273c32630f3ce90cf54a4c32e53164cabe7fbb37d45822005df669df11ce
MD5 a206a3a442506e65213c3e752e21e383
BLAKE2b-256 2628edd61cd4119baa6c0b651dbde74b66f668c51e214b0be7bb31c515b2f9e8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page