Skip to main content

PiperRider CLI

Project description

PipeRider

PipeRider: Data Reliability Automated

What is PipeRider?

PipeRider is an open-source toolkit for detecting data issues across pipelines that work with CI systems for continuous data quality assessment. PipeRider makes it easy for teams to have visibility about how data are being tested, and ensure errors are caught before they cause outages of downstream data applications like business intelligence dashboards or ML systems.

Why PipeRider?

Ensuring consistent quality of data used to be difficult. Missing values, schema changes and data drift (to name just a few) could be introduced to your data at any time. Without effective data quality tools, these errors will affect downstream operations and result in countless lost hours of debugging and missed revenue opportunities from unexpected downtime. PipeRider allows you to define the shape of your data once, and then use the data checking functionality to alert you to changes in your data quality.

ci-tests release pipy python downloads license InfuseAI Discord Invite

Learn More

PipeRider Resources Description
Documentation PipeRider Main Doc Site
Sample_Project Sample Project with with sqlite
dbt_Sample_Project Sample Project with dbt
Roadmap PipeRider Roadmap

Key Features

Instant quality assessment in HTML report

Example of single run report

Report Comparison

Example of comparison report

Extensible custom assertions

Refer to custom assertions

Works with existing dbt projects

Automatic Test Recommendations (Coming Soon)

Getting started

Install PipeRider

pip install piperider

By default, PipeRider supports built-in SQLite connector, extra connectors are available:

connectors install
snowflake pip install 'piperider[snowflake]'
postgres pip install 'piperider[postgres]'

Use comma to install multiple connectors in one line:

pip install 'piperider[postgres,snowflake]'

Attach PipeRider to a dbt project

piperider_init

This command creates /.piperider under a dbt project root and generates necessary configurations.

Scan data quality from models

piperider_run

This command scans the models from data sources and creates assessment results in /.piperider/output

Generate reports

piperider_report

generate a static HTML report under the current path.

Generate comparison view

piperider_compare

The generated report in HTML will be placed in the path shown in the console.

Get involved

Contributions

We welcome contributions. See the Set up dev environment and the Contributing guildline to get started.

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

piperider-nightly-0.3.0.20220703.tar.gz (619.2 kB view details)

Uploaded Source

Built Distribution

piperider_nightly-0.3.0.20220703-py3-none-any.whl (672.6 kB view details)

Uploaded Python 3

File details

Details for the file piperider-nightly-0.3.0.20220703.tar.gz.

File metadata

File hashes

Hashes for piperider-nightly-0.3.0.20220703.tar.gz
Algorithm Hash digest
SHA256 1ee2bba343efb56b183d83b2a35ed4b8d9b3e27b2a348612a4f77adf9e913fe0
MD5 ac98a18d4bfecde3b3ee8fea6db940e1
BLAKE2b-256 e97650a4d317994bd1a2818b4dba2027caa17ad1a282819d7c88ddbd0392a546

See more details on using hashes here.

File details

Details for the file piperider_nightly-0.3.0.20220703-py3-none-any.whl.

File metadata

File hashes

Hashes for piperider_nightly-0.3.0.20220703-py3-none-any.whl
Algorithm Hash digest
SHA256 95f50ecfd705e7d8a07b4edde52e04872a35b86b18e7a878107166c9df4da69b
MD5 5fb808de0a8f2d46cbd9caf9c57e75b6
BLAKE2b-256 f92ef4ac9daa543ea4df08955b1a2229640c8095862e41ed9b396e1d0d385146

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page