Skip to main content

How do we measure the degradation of a machine learning process? Why does the performance of our predictive models decrease? Maybe it is that a data source has changed (one or more variables) or maybe what changes is the relationship of these variables with the target we want to predict. `pydrift` tries to facilitate this task to the data scientist, performing this kind of checks and somehow measuring that degradation.

Project description

Welcome to pydrift 0.1.6

How do we measure the degradation of a machine learning process? Why does the performance of our predictive models decrease? Maybe it is that a data source has changed (one or more variables) or maybe what changes is the relationship of these variables with the target we want to predict. pydrift tries to facilitate this task to the data scientist, performing this kind of checks and somehow measuring that degradation.

Install pydrift :v:

pip install pydrift

Structure :triangular_ruler:

This is intended to be user-friendly. pydrift is divided into DataDriftChecker and ModelDriftChecker:

  • DataDriftChecker: search for drift in the variables, check if their distributions have changed
  • ModelDriftChecker: search for drift in the relationship of the variables with the target, checks that the model behaves the same way for both data sets

Both can use a discriminative model (defined by parent class DriftChecker), where the target would be binary in belonging to one of the two sets, 1 if it is the left one and 0 on the contrary. If the model is not able to differentiate given the two sets, there is no difference!

Class inheritance

Usage :book:

You can take a look to the notebooks folder where you can find one example for DataDriftChecker and other one for ModelDriftChecker.

For more info check the docs available here

More demos and code improvements will coming, if you want to contribute you can contact me, in the future I will upload a file to explain how this would work.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pydrift-0.1.6.tar.gz (9.8 kB view details)

Uploaded Source

Built Distribution

pydrift-0.1.6-py3-none-any.whl (11.2 kB view details)

Uploaded Python 3

File details

Details for the file pydrift-0.1.6.tar.gz.

File metadata

  • Download URL: pydrift-0.1.6.tar.gz
  • Upload date:
  • Size: 9.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/0.12.17 CPython/3.6.8 Linux/4.15.0-1052-aws

File hashes

Hashes for pydrift-0.1.6.tar.gz
Algorithm Hash digest
SHA256 89228405d2002aba7db47a28925474261dbbad6da25e8aff21d98d1d4dacdbed
MD5 9ecf9ca903ed8c2d00527730b78e9c93
BLAKE2b-256 9dd8da72edf144abbfa3b1ecf38e09aa3aa671931295431ecc94e78712207088

See more details on using hashes here.

File details

Details for the file pydrift-0.1.6-py3-none-any.whl.

File metadata

  • Download URL: pydrift-0.1.6-py3-none-any.whl
  • Upload date:
  • Size: 11.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/0.12.17 CPython/3.6.8 Linux/4.15.0-1052-aws

File hashes

Hashes for pydrift-0.1.6-py3-none-any.whl
Algorithm Hash digest
SHA256 f67e223a911b0eca04dc08e7e5ad385c455cbb06cba4f35a36b43639ee7be0be
MD5 fddd4fd1c535b862aeb3a6aae3d7cdd0
BLAKE2b-256 287878ecb1ca0a0b8c2ef4b8ebeb7788411e7d70e897ff25e94de7d6cd7cd18e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page