Skip to main content

Data frame support and feature traceability for `scikit-learn`.

Project description

sklearndf is an open source library designed to address a common need with scikit-learn: the outputs of transformers are numpy arrays, even when the input is a data frame. However, to inspect a model it is essential to keep track of the feature names.

To this end, sklearndf enhances scikit-learn’s estimators as follows:

  • Preserve data frame structure:

    Return data frames as results of transformations, preserving feature names as the column index.

  • Feature name tracing:

    Add additional estimator properties to enable tracing a feature name back to its original input feature; this is especially useful for transformers that create new features (e.g., one-hot encode), and for pipelines that include such transformers.

  • Easy use:

    Simply append DF at the end of your usual scikit-learn class names to get enhanced data frame support!

pypi conda python_versions code_style made_with_sphinx_doc license_badge

License

sklearndf is licensed under Apache 2.0 as described in the LICENSE file.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sklearndf-2.0.dev3.tar.gz (127.1 kB view details)

Uploaded Source

Built Distribution

sklearndf-2.0.dev3-py3-none-any.whl (61.5 kB view details)

Uploaded Python 3

File details

Details for the file sklearndf-2.0.dev3.tar.gz.

File metadata

  • Download URL: sklearndf-2.0.dev3.tar.gz
  • Upload date:
  • Size: 127.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: python-requests/2.27.1

File hashes

Hashes for sklearndf-2.0.dev3.tar.gz
Algorithm Hash digest
SHA256 c08f6d45669a06f729e95263fdc3bfbda1346c5bb3c9d8751ce881bfa6729744
MD5 712b9066c7ca34a0bf835d645f25fe2a
BLAKE2b-256 2040c876d1bf50126fbbcbc0c33109391a43fc9f6ca8fee28480fd2a643e5bc6

See more details on using hashes here.

File details

Details for the file sklearndf-2.0.dev3-py3-none-any.whl.

File metadata

File hashes

Hashes for sklearndf-2.0.dev3-py3-none-any.whl
Algorithm Hash digest
SHA256 239a31db7bfa780bf3ddbe461efb0e9cd5366aaee80012f89fc126485dcaadde
MD5 9db17fd937ae554c6724dd5f4bcebec4
BLAKE2b-256 5e06bf7d89a63ee5c6808cbb4ee8a48872f81d7bbefa59107473b3261621868d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page