re_data - data quality framework
Project description
re_data
re_data is a data quality framework. It lets you do queries similar to those:
select * from anomalies_in_row_counts;
select * from recent_schema_changes;
select * from all_tables_freshness order by last_update_time;
select * from daily_null_percent where table = 'X' and col = 'Y';
in your Snowflake, Redshift, BigQuery, Postgres DB.
Build as dbt-package & optional python lib.
It lets you know what's happening in your data.
And you can visualize it, any way you want in your favorite BI tool.
Getting started
Check out docs :notebook_with_decorative_cover: :notebook_with_decorative_cover:
Source code
As dbt packages, currenlty need to be a seperate github repos - most of source code of re_data is here
Community
Join Slack for questions about using re_data and discussion with people making it :slightly_smiling_face:
Integrations
We support almost all of the main data warehouses supported by dbt. We plan to add support for Spark (now officially supported by dbt).
Integration | Status | |
---|---|---|
BigQuery | Supported | |
PostgreSQL | Supported | |
Redshift | Supported | |
Snowflake | Supported | |
Apache Spark | Planned |
License
re_data is licensed under the MIT license. See the LICENSE file for licensing information.
Contributing
We love all contributions :heart_eyes: bigger and smaller.
Check out the current list of issues here and see if you like anything from there. Also, feel welcome to join our Slack and suggest ideas or set up a live session here.
And if you got this far and like what we are building, support us! Star https://github.com/re-data/re-data on Github :star_struck:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for re_data-0.2.0a0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | a75c67bc79c2465fffd8e2d22d2986869652079e7e6c0f9dc85f3f6b3065bf79 |
|
MD5 | 6d321d9792badd013b8c717813c5f8da |
|
BLAKE2b-256 | 1997cdad11d220edfddea2614894bfae13c7c15da905ff3749500de35736e832 |