Configure and enforce conventions for your dbt project.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

GoDataDriven

These details have not been verified by PyPI

Project links

Project description

dbt-bouncer logo

dbt-bouncer

Configure and enforce conventions for your dbt project.

How to use

Generate dbt artifacts by running a dbt command.
Create a dbt-bouncer.yml config file, details here.
Run dbt-bouncer to validate that your conventions are being maintained. You can use GitHub Actions, Docker, a .pex file or python to run dbt-bouncer.

GitHub Actions

steps:
    ...

    - uses: godatadriven/dbt-bouncer@v0
      with:
        config-file: ./<PATH_TO_CONFIG_FILE>
        output-file: results.json # optional, default does not save a results file
        send-pr-comment: true # optional, defaults to true

    ...

Docker

Don't use GitHub Actions? You can still use dbt-bouncer via Docker:

docker run --rm \
    --volume "$PWD":/app \
    ghcr.io/godatadriven/dbt-bouncer:vX.X.X \
    --config-file /app/<PATH_TO_CONFIG_FILE>

Pex

You can also run the .pex (Python EXecutable) artifact directly once you have a python executable (3.8 -> 3.12) installed:

wget https://github.com/godatadriven/dbt-bouncer/releases/download/vX.X.X/dbt-bouncer.pex -O dbt-bouncer.pex

python dbt-bouncer.pex --config-file $PWD/<PATH_TO_CONFIG_FILE>

Python

Install from pypi.org:

pip install dbt-bouncer

Run:

dbt-bouncer --config-file <PATH_TO_CONFIG_FILE>

Config file

dbt-bouncer requires a config file to be provided. This file configures what checks are run. Here is an example config file:

dbt_artifacts_dir: target # [Optional] Directory where the dbt artifacts exists, generally the `target` directory inside a dbt project. Defaults to `./target`.

manifest_checks:
  - name: check_macro_name_matches_file_name
  - name: check_model_names
    include: ^staging
    model_name_pattern: ^stg_

For more example config files, see here.

Note that the config can also be passed via a pyproject.toml file:

[tool.dbt-bouncer]
dbt_artifacts_dir = "target"

[[tool.dbt-bouncer.manifest_checks]]
name = "check_macro_name_matches_file_name"

[[tool.dbt-bouncer.manifest_checks]]
name = "check_model_names"
include = "^staging"
model_name_pattern = "^stg_"

Checks

:bulb: Click on a check name to see more details.

Catalog checks

These checks require the following artifact to be present:

catalog.json
manifest.json

Columns

check_column_has_specified_test: Columns that match the specified regexp pattern must have a specified test.
check_column_name_complies_to_column_type: Columns with specified data type must comply to the specified regexp naming pattern.
check_columns_are_all_documented: All columns in a model should be included in the model's properties file, i.e. .yml file.
check_columns_are_documented_in_public_models: Columns should have a populated description in public models.

Sources

check_source_columns_are_all_documented: All columns in a source should be included in the source's properties file, i.e. .yml file.

Manifest checks

These checks require the following artifact to be present:

manifest.json

Exposures

check_exposure_based_on_non_public_models: Exposures should be based on public models only.
check_exposure_based_on_view: Exposures should not be based on views.

Lineage

check_lineage_permitted_upstream_models: Upstream models must have a path that matches the provided upstream_path_pattern.
check_lineage_seed_cannot_be_used: Seed cannot be referenced in models with a path that matches the specified include config.
check_lineage_source_cannot_be_used: Sources cannot be referenced in models with a path that matches the specified include config.

Macros

check_macro_arguments_description_populated: Macro arguments must have a populated description.
check_macro_code_does_not_contain_regexp_pattern: The raw code for a macro must not match the specified regexp pattern.
check_macro_description_populated: Macros must have a populated description.
check_macro_name_matches_file_name: Macros names must be the same as the file they are contained in.
check_macro_property_file_location: Macro properties files must follow the guidance provided by dbt here.

Metadata

check_project_name: Enforce that the name of the dbt project matches a supplied regex.

Models

check_model_access: Models must have the specified access attribute.
check_model_contract_enforced_for_public_model: Public models must have contracts enforced.
check_model_code_does_not_contain_regexp_pattern: The raw code for a model must not match the specified regexp pattern.
check_model_depends_on_multiple_sources: Models cannot reference more than one source.
check_model_description_populated: Models must have a populated description.
check_model_directories: Only specified sub-directories are permitted.
check_model_documentation_coverage: Set the minimum percentage of models that have a populated description.
check_model_documented_in_same_directory: Models must be documented in the same directory where they are defined (i.e. .yml and .sql files are in the same directory).
check_model_has_contracts_enforced: Model must have contracts enforced.
check_model_has_meta_keys: The meta config for models must have the specified keys.
check_model_has_no_upstream_dependencies: Identify if models have no upstream dependencies as this likely indicates hard-coded tables references.
check_model_has_tags: Models must have the specified tags.
check_model_max_chained_views: Models cannot have more than the specified number of upstream dependents that are not tables (default: 3).
check_model_max_fanout: Models cannot have more than the specified number of downstream models (default: 3).
check_model_max_upstream_dependencies: Limit the number of upstream dependencies a model has. Default values are 5 for models, 5 for macros, and 1 for sources.
check_model_names: Models must have a name that matches the supplied regex.
check_model_property_file_location: Model properties files must follow the guidance provided by dbt here.
check_model_test_coverage: Set the minimum percentage of models that have at least one test.

Sources

check_source_description_populated: Sources must have a populated description.
check_source_freshness_populated: Sources must have a populated freshness.
check_source_loader_populated: Sources must have a populated loader.
check_source_has_meta_keys: The meta config for sources must have the specified keys.
check_source_has_tags: Sources must have the specified tags.
check_source_names: Sources must have a name that matches the supplied regex.
check_source_not_orphaned: Sources must be referenced in at least one model.
check_source_property_file_location: Source properties files must follow the guidance provided by dbt here.
check_source_used_by_models_in_same_directory: Sources can only be referenced by models that are located in the same directory where the source is defined.
check_source_used_by_only_one_model: Each source can be references by a maximum of one model.

Tests

check_model_has_unique_test: Models must have a test for uniqueness of a column.

Run results checks

These checks require the following artifact to be present:

manifest.json
run_results.json

Results

check_run_results_max_gigabytes_billed: Each result can have a maximum number of gigabytes billed. Note that this only works for the dbt-bigquery adapter.
check_run_results_max_execution_time: Each result can take a maximum duration (seconds).

Saving results to a file

It is possible to the outcome of a run, and associated metadata, to a .json file. This file will contain all the checks that were run, both failed checks and successful checks. This can be achieved by using the --output-file flag:

dbt-bouncer --config-file <PATH_TO_CONFIG_FILE> --output-file <PATH_TO_OUTPUT_FILE>

Reporting bugs and contributing code

Want to report a bug or request a feature? Let us know and open an issue
Want to help us build `dbt-bouncer? Check out the Contributing Guide

Code of Conduct

Everyone interacting in dbt-bouncer's codebase, issue trackers, chat rooms, and mailing lists is expected to follow the Code of Conduct.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

GoDataDriven

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.3.4

Oct 21, 2024

1.3.3

Oct 5, 2024

1.3.2

Oct 3, 2024

1.3.1

Oct 1, 2024

1.3.0

Sep 30, 2024

1.2.3

Sep 27, 2024

1.2.2

Sep 25, 2024

1.2.1

Sep 25, 2024

1.2.0

Sep 24, 2024

1.1.3

Sep 23, 2024

1.1.2

Sep 20, 2024

1.1.1

Sep 18, 2024

1.1.0

Sep 18, 2024

1.0.0

Sep 11, 2024

0.27.2

Sep 9, 2024

0.27.1

Sep 9, 2024

0.27.0

Sep 9, 2024

0.26.1

Sep 3, 2024

0.26.0

Sep 2, 2024

0.25.0

Sep 2, 2024

0.25.0a6 pre-release

Sep 2, 2024

0.25.0a5 pre-release

Sep 2, 2024

0.25.0a4 pre-release

Sep 2, 2024

0.25.0a3 pre-release

Sep 2, 2024

0.25.0a2 pre-release

Sep 2, 2024

0.25.0a1 pre-release

Sep 2, 2024

0.25.0a0 pre-release

Sep 2, 2024

0.24.0

Aug 26, 2024

0.23.2

Aug 22, 2024

0.23.1

Aug 22, 2024

0.23.0

Aug 22, 2024

0.22.0

Aug 22, 2024

0.21.1

Aug 21, 2024

0.21.0

Aug 20, 2024

0.20.0

Aug 19, 2024

0.19.5

Aug 15, 2024

0.19.4

Aug 15, 2024

0.19.3

Aug 15, 2024

0.19.2

Aug 15, 2024

0.19.1

Aug 15, 2024

0.19.0

Aug 15, 2024

0.18.2

Aug 15, 2024

This version

0.18.1

Aug 15, 2024

0.18.0

Aug 14, 2024

0.17.1

Aug 8, 2024

0.17.0

Aug 8, 2024

0.16.0

Jul 31, 2024

0.15.0

Jul 31, 2024

0.14.8a1 pre-release

Jul 30, 2024

0.14.8a0 pre-release

Jul 30, 2024

0.14.7

Jul 30, 2024

0.14.6

Jul 30, 2024

0.14.5

Jul 29, 2024

0.14.4

Jul 29, 2024

0.14.3

Jul 29, 2024

0.14.2

Jul 29, 2024

0.14.1

Jul 29, 2024

0.14.0

Jul 29, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dbt_bouncer-0.18.1.tar.gz (27.8 kB view hashes)

Uploaded Aug 15, 2024 Source

Built Distribution

dbt_bouncer-0.18.1-py3-none-any.whl (34.5 kB view hashes)

Uploaded Aug 15, 2024 Python 3

Hashes for dbt_bouncer-0.18.1.tar.gz

Hashes for dbt_bouncer-0.18.1.tar.gz
Algorithm	Hash digest
SHA256	`e6aa337d6932ee3ab75aa629112b88b9bb8e57926b60d0587c39be317d271a65`
MD5	`eb9c415bd1703425e3c9ea4d3312d2a3`
BLAKE2b-256	`e7db7a390049eaabe78370445596e65d27e0dbfe5ab97ba4bfcc6d594ff133d9`

Hashes for dbt_bouncer-0.18.1-py3-none-any.whl

Hashes for dbt_bouncer-0.18.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`9fdc39eafbbbb592b9a9fcce100ce63cdbb3918c986ec652d9da87a8955aca89`
MD5	`dac33b4d88f940e2c43146c7ba0ea6bf`
BLAKE2b-256	`4019b43bbfb889b4e8b14ed5ad83faaf22263c0d13bc63b690991677634634b5`