Skip to main content

A project to ensure the data quality using python

Project description

PyQualitas

This project aims towards developing a python library ensure quality of the data. This project is an inspiration from deequ and dataflare which are also aimed towards the quality of the data.

Requirements:

  1. Pyspark - Version 3.3.0
  2. Pandas - Version 1.5.0
  3. Jinja2 - Version 3.1.2
  4. Slack-SDK - Version 3.19.3
  5. PyMSTeams - Version 0.2.2

Installation:

The package can be installed as follows:

"pip install pyQualitas"

The test version of this package can be installed as follows:

"pip install -i https://test.pypi.org/simple/ pyQualitas"

Use Cases:

The main agenda behind creating this library is to help the QA Engineers to ensure quality of the data. Given the volume of the data & the frequency of the releases happening in the industry, there is an enormous responsibility on the Quality Assurance team to ensure & sign-off the quality of the data generated by the application.

It is very hard to achieve this using manual testing and scheduling an automated validation helps achieve the timelines and ensure a high quality of the data with less efforts.

There are various tests in this library that would come in handy during the regression testing process. Since the project is implemented in Python, the learning curve is short when compared to the libraries that are available in Scala.

The documentation can be found in the following link:

https://github.com/IamVenkatesh/pyQualitas/wiki

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyqualitas-1.0.9.tar.gz (12.7 kB view details)

Uploaded Source

Built Distribution

pyQualitas-1.0.9-py3-none-any.whl (13.6 kB view details)

Uploaded Python 3

File details

Details for the file pyqualitas-1.0.9.tar.gz.

File metadata

  • Download URL: pyqualitas-1.0.9.tar.gz
  • Upload date:
  • Size: 12.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for pyqualitas-1.0.9.tar.gz
Algorithm Hash digest
SHA256 16b8ed7a9139e656fb1905341c8bd2b7184a1434a1a54acaa01f85beb7caba47
MD5 e97076b8d35d671780e3d798c36065dc
BLAKE2b-256 1c21ba00767476b0f651920e26465c6cc18d6188b273c61768ffbe4493ccef1d

See more details on using hashes here.

File details

Details for the file pyQualitas-1.0.9-py3-none-any.whl.

File metadata

  • Download URL: pyQualitas-1.0.9-py3-none-any.whl
  • Upload date:
  • Size: 13.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.12.3

File hashes

Hashes for pyQualitas-1.0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 e50b54dfd67e72571ca89f8d2155cc93c9f3a8d877a9ee45e4a535f83dcb80e9
MD5 fa817961fa26f32dc03add4b9ee2c3de
BLAKE2b-256 649a802184429823611325e1be3cdc0b16ad003c3b976bd13c8a14d9dc36694a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page