Skip to main content

A project to ensure the data quality using python

Project description

PyQualitas

This project aims towards developing a python library ensure quality of the data. This project is an inspiration from deequ and dataflare which are also aimed towards the quality of the data.

Requirements:

  1. Pyspark - Version 3.5.5 or greater
  2. Pandas - Version 2.2.3 or greater
  3. Jinja2 - Version 3.1.6 or greater
  4. Slack-SDK - Version 3.35.0 or greater
  5. PyMSTeams - Version 0.2.5 or greater

Installation:

The package can be installed as follows:

"pip install pyQualitas"

The test version of this package can be installed as follows:

"pip install -i https://test.pypi.org/simple/ pyQualitas"

Use Cases:

The main agenda behind creating this library is to help the QA Engineers to ensure quality of the data. Given the volume of the data & the frequency of the releases happening in the industry, there is an enormous responsibility on the Quality Assurance team to ensure & sign-off the quality of the data generated by the application.

It is very hard to achieve this using manual testing and scheduling an automated validation helps achieve the timelines and ensure a high quality of the data with less efforts.

There are various tests in this library that would come in handy during the regression testing process. Since the project is implemented in Python, the learning curve is short when compared to the libraries that are available in Scala.

The documentation can be found in the following link:

https://github.com/IamVenkatesh/pyQualitas/wiki

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyqualitas-2.0.1.tar.gz (12.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyqualitas-2.0.1-py3-none-any.whl (13.7 kB view details)

Uploaded Python 3

File details

Details for the file pyqualitas-2.0.1.tar.gz.

File metadata

  • Download URL: pyqualitas-2.0.1.tar.gz
  • Upload date:
  • Size: 12.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for pyqualitas-2.0.1.tar.gz
Algorithm Hash digest
SHA256 a4f361f040ac08c676829563fc460a77f3e4020d05e9d537669cba36e8b468b3
MD5 bf5f8687d246c08934604d8cf81f7218
BLAKE2b-256 0e5d7e470e7853e52a33234317aad483685338dc2442f431a1a67aa97614465b

See more details on using hashes here.

File details

Details for the file pyqualitas-2.0.1-py3-none-any.whl.

File metadata

  • Download URL: pyqualitas-2.0.1-py3-none-any.whl
  • Upload date:
  • Size: 13.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for pyqualitas-2.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 68979acb9a9df572044fa77ca945becc66b465439312141e27304e472beb8123
MD5 4c739ece519ff7fa7cd2877434befd57
BLAKE2b-256 3d8eb06d602a4ea51cee09289778dd559352dc7bafce7c9e14f91ba4a51cdc62

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page