Skip to main content

re_data - data quality framework

Project description

Logo

Slack License Last commit

What is re_data?

re_data is a set of tools (dbt macros & models) that helps you make sure your data pipelines are clean & reliable. 😊

Data Preparation

re_data data preparation macros help you clean your data faster, with less code & a smaller chance of errors. Currently, we support four types of data preparation:

  • data cleaning
  • data filtering
  • data normalization
  • data validation

Data Monitoring

re_data metrics & alerts models contain information about data quality which lets you discover bad data much faster. You can:

  • use built-in metrics & extend them with your code
  • test them as regular dbt models
  • visualize them in your favourite BI tool
  • trigger external (Slack/Pagerduty/etc.) alerts based on them

Getting started

Check our docs! 📓 📓

Join re_data community on Slack (we are very responsive there)

Source code

As dbt packages currently need to be a seperate github repos, most of source code of re_data can be found here

Integrations

We support most of the main data warehouses supported by dbt. We plan to add support for Spark (now officially supported by dbt).

Integration Status
BigQuery Supported
PostgreSQL Supported
Redshift Supported
Snowflake Supported
Apache Spark Planned

License

re_data is licensed under the MIT license. See the LICENSE file for licensing information.

Contributing

We love all contributions :heart_eyes: bigger and smaller.

Check out the current list of issues here and see if you like anything from there. Also, feel welcome to join our Slack and suggest ideas or set up a live session here.

And if you got this far and like what we are building, support us! Star https://github.com/re-data/re-data on Github :star_struck:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

re_data-0.4.0.tar.gz (8.4 kB view hashes)

Uploaded Source

Built Distribution

re_data-0.4.0-py3-none-any.whl (8.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page