re_data - data quality framework
Project description
What is re_data?
re_data - is meant to help data teams fix data issues before users & CEOs would discover them 😊
re_data lets you compute various metrics about your datasets and later on:
- test
- visualize
- find anomalies in those
re_data works strictly inside your data warehouse (it's implemented in large part as a dbt package) - and is doing transformations on your tables in your data warehouse.
Getting started
Check our docs! 📓 📓
Join re_data community on Slack (we are very responsive there)
Source code
As dbt packages currently need to be a seperate github repos, most of source code of re_data can be found here
Integrations
We support most of the main data warehouses supported by dbt. We plan to add support for Spark (now officially supported by dbt).
Integration | Status | |
---|---|---|
BigQuery | Supported | |
PostgreSQL | Supported | |
Redshift | Supported | |
Snowflake | Supported | |
Apache Spark | Planned |
License
re_data is licensed under the MIT license. See the LICENSE file for licensing information.
Contributing
We love all contributions :heart_eyes: bigger and smaller.
Check out the current list of issues here and see if you like anything from there. Also, feel welcome to join our Slack and suggest ideas or set up a live session here.
And if you got this far and like what we are building, support us! Star https://github.com/re-data/re-data on Github :star_struck:
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.