A project to ensure the data quality using python
Project description
PyQualitas
This project aims towards developing a python library ensure quality of the data. This project is an inspiration from deequ and dataflare which are also aimed towards the quality of the data.
Requirements:
- Pyspark - Version 3.3.0
- Pandas - Version 1.5.0
- Jinja2 - Version 3.1.2
- Slack-SDK - Version 3.19.3
- PyMSTeams - Version 0.2.2
Installation:
The package can be installed as follows:
"pip install pyQualitas"
The test version of this package can be installed as follows:
"pip install -i https://test.pypi.org/simple/ pyQualitas"
Use Cases:
The main agenda behind creating this library is to help the QA Engineers to ensure quality of the data. Given the volume of the data & the frequency of the releases happening in the industry, there is an enormous responsibility on the Quality Assurance team to ensure & sign-off the quality of the data generated by the application.
It is very hard to achieve this using manual testing and scheduling an automated validation helps achieve the timelines and ensure a high quality of the data with less efforts.
There are various tests in this library that would come in handy during the regression testing process. Since the project is implemented in Python, the learning curve is short when compared to the libraries that are available in Scala.
The documentation can be found in the following link:
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pyqualitas-1.0.9.tar.gz
.
File metadata
- Download URL: pyqualitas-1.0.9.tar.gz
- Upload date:
- Size: 12.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 16b8ed7a9139e656fb1905341c8bd2b7184a1434a1a54acaa01f85beb7caba47 |
|
MD5 | e97076b8d35d671780e3d798c36065dc |
|
BLAKE2b-256 | 1c21ba00767476b0f651920e26465c6cc18d6188b273c61768ffbe4493ccef1d |
File details
Details for the file pyQualitas-1.0.9-py3-none-any.whl
.
File metadata
- Download URL: pyQualitas-1.0.9-py3-none-any.whl
- Upload date:
- Size: 13.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e50b54dfd67e72571ca89f8d2155cc93c9f3a8d877a9ee45e4a535f83dcb80e9 |
|
MD5 | fa817961fa26f32dc03add4b9ee2c3de |
|
BLAKE2b-256 | 649a802184429823611325e1be3cdc0b16ad003c3b976bd13c8a14d9dc36694a |