Skip to main content

Visualize data quality

Project description

Python Package

vizdataquality

This is a Python package for visualizing data quality, and includes this six-step workflow:

  1. Look at your data (is anything obviously wrong?)
  2. Watch out for special values
  3. Is any data missing?
  4. Check each variable
  5. Check combinations of variables
  6. Profile the cleaned data

Documentation

The vizdataquality documentation is hosted on Read the Docs.

Installation

We recommend installing vizdataquality in a python virtual environment or Conda environment.

To install vizdataquality, most users should run:

pip install 'vizdataquality'

Tutorials

The package includes notebooks that show you how to:

After installing vizdataquality, to follow theses tutorials interactively you will need to clone or download this repository. Then start jupyter from within it:

python -m jupyter notebook notebooks

Development

  • Documentation is built on readthedocs.com from main branch
  • PyPi pulls on creating a release on project repository on GitHub.

Notice

The vizdataquality software is released under the Apache Licence, version 2.0. See LICENCE for details.

Acknowledgements

The development of the vizdataquality software was supported by funding from the Engineering and Physical Sciences Research Council (EP/N013980/1; EP/R511717/1) and the Alan Turing Institute.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vizdataquality-1.0.0.tar.gz (30.9 kB view details)

Uploaded Source

Built Distribution

vizdataquality-1.0.0-py3-none-any.whl (33.6 kB view details)

Uploaded Python 3

File details

Details for the file vizdataquality-1.0.0.tar.gz.

File metadata

  • Download URL: vizdataquality-1.0.0.tar.gz
  • Upload date:
  • Size: 30.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.9.19

File hashes

Hashes for vizdataquality-1.0.0.tar.gz
Algorithm Hash digest
SHA256 be4d2383c5388955e1a6908cff5eeb49163a5ce9ff708369798bb0b2f9ea075b
MD5 0e72c94677097efa4f2b85a949680770
BLAKE2b-256 9d544c8e08199a215acf0271ced18c419be043d4b62be9c7728b1eb4e439599e

See more details on using hashes here.

File details

Details for the file vizdataquality-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for vizdataquality-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 396ff4137579f996049ae7f41fe1725c4052ceacfead8e4f96d269238214956b
MD5 23ccaf557fb6b21038fef3d7b7b87a83
BLAKE2b-256 6e0bb6047b9de7ebd079b3316077e27b4150231169a7d00147b7d6f206bb118e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page