Skip to main content

Mercury's monitoring is a library to perform monitoring testing on models and/or datasets..

Project description

mercury-monitoring

mercury-monitoring is a library to monitor data and model drift. It contains a set of utilities which you can use for detecting in advance whether the statistical properties of your data have changed and whether your model's performance is getting worse.

Among others, you can check things like:

  • Changes in your entire data distribution (P(X))
  • Changes in your marginals (i.e. individial features P(Xi))
  • Estimated performance of your model when you don't have access to the true labels

and take the actions you consider pertinent (e.g. retrain your model on more recent data, generate alerts, etc.).

Most of the classes for detecting drift have a calculate_drift() method which will yield the predicted result on whether drift has been detected or not.

Usage example

from mercury.monitoring.drift.domain_classifier_drift import DomainClassifierDrift

# Create example datasets
X_src = np.array([np.random.normal(0, 1, 1000), np.random.normal(0, 1, 1000)]).T
X_target = np.array([np.random.normal(0, 1, 500) + 2, np.random.normal(0, 1, 500)]).T

# Calculate drift
drift_detector = DomainClassifierDrift(
    X_src,                    # Source dataset
    X_target,                 # Target dataset (e.g. newer batch of data)
    features=["f1", "f2"], 
    p_val=0.01, 
    test_size=0.3, 
    n_runs=10
)

# Get results
drift_metrics = drift_detector.calculate_drift()

# Print the results
print("Drift Score: ", drift_metrics["score"])
print("p_val: ", drift_metrics["p_val"])
print("Is drift detected? ", drift_metrics["drift_detected"])

# Plot drift scores per feature
drift_detector.plot_feature_drift_scores(figsize=(8,4))

Feel free to explore more of the drift detectors and use the most appropiate for your use case.

Documentation

We encourage you checking the documentation of this package at: https://bbva.github.io/mercury-monitoring/ and taking a look at the available notebook tutorials.

Mercury project at BBVA

Mercury is a collaborative library that was developed by the Advanced Analytics community at BBVA. Originally, it was created as an InnerSource project but after some time, we decided to release certain parts of the project as Open Source. That's the case with the mercury-monitoring package.

If you're interested in learning more about the Mercury project, we recommend reading this blog post from www.bbvaaifactory.com

User installation

The easiest way to install mercury-monitoring is using pip:

pip install -U mercury-monitoring

Help and support

This library is currently maintained by a dedicated team of data scientists and machine learning engineers from BBVA AI Factory. In case you find any bug, have an idea or request feel free to open a ticket on the Issues section.

Email

mercury.group@bbva.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mercury-monitoring-0.0.2.tar.gz (34.4 kB view details)

Uploaded Source

Built Distribution

mercury_monitoring-0.0.2-py3-none-any.whl (40.6 kB view details)

Uploaded Python 3

File details

Details for the file mercury-monitoring-0.0.2.tar.gz.

File metadata

  • Download URL: mercury-monitoring-0.0.2.tar.gz
  • Upload date:
  • Size: 34.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.16

File hashes

Hashes for mercury-monitoring-0.0.2.tar.gz
Algorithm Hash digest
SHA256 e6b747d699e77eddfc996400ac14df297c0255e74e6d0c1425c1a99e57ca49f9
MD5 238bd261955169588e41f1e81acc2485
BLAKE2b-256 f445c09d9457508ba405278a876258b38df9810483ce90a82e336e8417657556

See more details on using hashes here.

File details

Details for the file mercury_monitoring-0.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for mercury_monitoring-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 4ff45f05340046a419c9133d5b28b0d35285d63355fa21f2e92ebaefd7de58b6
MD5 16f51a26fc6dd66688c3b387cdbb2feb
BLAKE2b-256 d95dc8791644667e2b3a9aca3115e42199b80515bb46f14ec403c8d80fe818a1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page