Environment diff tool for dbt
Project description
recce
recce
is a environment diff tool for dbt
Features
- Support the same dbt adapter framework as dbt.
- Support both Web UI & CLI
- Lineage diff
Use cases
- When developing, we can check the new result by comparing against the production one.
- When reviewing PR, you can understand the change impacts.
- When trouble shooting, you can run adhoc dif query to find the root causes.
Usage
Prerequisites
You have to have at least two environments in your dbt project. For example, one is for developing and another is for production. You can prepare two targets with separate schemas in you dbt profile. Here is profiles.yml
example
jaffle_shop:
target: dev
outputs:
dev:
type: duckdb
path: jaffle_shop.duckdb
schema: dev
prod:
type: duckdb
path: jaffle_shop.duckdb
schema: main
Getting Started
-
Installation
git clone git@github.com:InfuseAI/recce.git cd recce pip install -e .
-
Put the
manifest.json
of production (or any environment you would like to diff) in thetarget-base/
folder.manifest.json
is one of the generated artifacts for each dbt command execution. You can find it intarget/
folder by default. -
Develop your awesome features
dbt run
-
Run the recce command
recce server
-
Review the linage diff.
-
Switch to query tab, Write and run a query
select * from {{ ref('mymodel') }}
where
ref
is a Jinja macro to reference a model name.
Under the hood, recce uses the
manifest.json
undertarget/
andtarget-base/
to geenrate query and execute.
Run Query Diff
You can either run in Web UI
recce server
or run in CLI
recce diff --sql 'select * from {{ ref('mymodel') }}'
Specify the primary key columns
In the query diff, we use primary key columns as the basis for identifying the same record on both sides.
There are two ways to specify the primary key
-
Define in the SQL: Add the
config
macro in your sql.{{ config( primary_key=['DATE_WEEK', 'COUNTRY']) }} select ...
-
Select in the query result: In the Web UI, you can click the key icons in the column headers to toggle if a column is a primary key.
Q&A
Q: How recce
connect to my data warehouse? Does recce support my data warehouse?
recce
use the dbt adapter to connect to your warehouse. So it should work for your data warehouse.
Q: What credential does recce
connect to the two environments?
Recce uses the same target in the profile to connect your warehouse. If you use the default target dev
, it use the credentials to connect to both environments. So please make sure that the credential able to access both environments.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file recce-0.1.0.dev0.tar.gz
.
File metadata
- Download URL: recce-0.1.0.dev0.tar.gz
- Upload date:
- Size: 556.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.11
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d2064e64c5c9cfd4e1e0beab8e8f1dd0f1370fa0bdbda64984cc91c1b6efabd5 |
|
MD5 | 730c996a5eef3fe3e3f149804c313ee5 |
|
BLAKE2b-256 | 3f4034baac0d55171496d7405cd4e0027741d5b557d09f69e3949c270253e3ef |