simple data validation
Project description
data_check
data_check is a simple data validation tool. Write SQL queries and CSV files with the expected result sets and data_check will test the result sets against the queries.
data_check should work with any database that works with SQLAlchemy. Currently data_check is tested against PostgreSQL, MySQL, SQLite, Oracle and Microsoft SQL Server.
Quickstart
You need Python 3.6 or above to run data_check. The easiest way to install data_check is via pipx:
pipx install data_check
The data_check Git repository is also a sample data_check project. Clone the repository, switch to the folder and run data_check:
git clone git@github.com:andrjas/data_check.git
cd data_check
data_check
This will run the tests in the checks folder using the default connection as set in data_check.yml.
See the documentation how to install data_check in different environments with additional database drivers and other usages of data_check.
Project layout
data_check has a simple layout for projects: a single configuration file and a folder with the test files. You can also organize the test files in subfolders.
data_check.yml # The configuration file
checks/ # Default folder for data tests
some_test.sql # SQL file with the query to run against the database
some_test.csv # CSV file with the expected result
subfolder/ # Tests can be nested in subfolders
Documentation
See the documentation how to setup data_check, how to create a new project and more options.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for data_check-0.1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8d4c9d9c9bb36ad8a7c10f2f7c5cd42172e699b5d4e8481600846ca19b28a0f8 |
|
MD5 | 55561bbb617ce5c6ca0b44b404664149 |
|
BLAKE2b-256 | c99950ac95ac972514b5d975733bb726894e71bce6e8a9ae9b8f5a39bac3eea2 |