Python API for Drunken Data Quality
Python API of Drunken Data Quality.
DDQ is a small library for checking constraints on Spark data structures. It can be used to assure a certain data quality, especially when continuous imports happen.
This project has been set up using PyScaffold 2.5.6. For details and usage information on PyScaffold see http://pyscaffold.readthedocs.org/.
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.