Skip to main content

Profile and monitor your ML data pipeline end-to-end

Project description


The open standard for data logging

DocumentationSlack CommunityPython QuickstartWhyLabs Quickstart

License PyPi Version Code style: black PyPi Downloads CI Maintainability

What is whylogs

whylogs is an open source library for logging any kind of data. With whylogs, users are able to generate summaries of their datasets (called whylogs profiles) which they can use to:

  1. Track changes in their dataset
  2. Create data constraints to know whether their data looks the way it should
  3. Quickly visualize key summary statistics about their datasets

These three functionalities enable a variety of use cases for data scientists, machine learning engineers, and data engineers:

  • Detect data drift in model input features
  • Detect training-serving skew, concept drift, and model performance degradation
  • Validate data quality in model inputs or in a data pipeline
  • Perform exploratory data analysis of massive datasets
  • Track data distributions & data quality for ML experiments
  • Enable data auditing and governance across the organization
  • Standardize data documentation practices across the organization
  • And more

Quickstart

Install whylogs using the pip package manager in a terminal by running:

pip install whylogs

Then you can log data in python as simply as this:

import whylogs as why
import pandas as pd

df = pd.read_csv("path/to/file.csv")
results = why.log(df)

And voilà, you now have a whylogs profile. To learn more about what a whylogs profile is and what you can do with it, check out our docs and our examples.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whylogs-1.6.3.dev3.tar.gz (1.8 MB view details)

Uploaded Source

Built Distribution

whylogs-1.6.3.dev3-py3-none-any.whl (1.9 MB view details)

Uploaded Python 3

File details

Details for the file whylogs-1.6.3.dev3.tar.gz.

File metadata

  • Download URL: whylogs-1.6.3.dev3.tar.gz
  • Upload date:
  • Size: 1.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.12.3 Linux/6.8.0-48-generic

File hashes

Hashes for whylogs-1.6.3.dev3.tar.gz
Algorithm Hash digest
SHA256 9f56b7c4527289c4e44382a5deaed5cdb1269eca67617910baf0dfad51207196
MD5 4eafc7767485aba693fb1b45a83a4d23
BLAKE2b-256 555b2893a32d3a5a6df26b7d4e755669616dd34f20e991f4c3f553c09ceaf2f2

See more details on using hashes here.

File details

Details for the file whylogs-1.6.3.dev3-py3-none-any.whl.

File metadata

  • Download URL: whylogs-1.6.3.dev3-py3-none-any.whl
  • Upload date:
  • Size: 1.9 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.12.3 Linux/6.8.0-48-generic

File hashes

Hashes for whylogs-1.6.3.dev3-py3-none-any.whl
Algorithm Hash digest
SHA256 2a6dba6c8156538a426961ed4e0d2a5d4a154890393ae5bb3555e39cdaa72f0b
MD5 e7b4103068cdac866898f3add4ff2dcd
BLAKE2b-256 0a80656a4da188099b33be4393b5f2bba24ca10e8ca314f770d3ac1f5dd15ccb

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page