Skip to main content

Data quality, observability, semantic, and causality library

Project description

dqtlib

Open-source data quality, lineage, semantic layer & causality — for dbt, warehouses and data lakes.

pip-installable Python library for watching dbt-built warehouses and any SQL warehouse for statistical drift, anomalies, silent regressions, and explaining why metrics moved.

pip install dqtlib

The import name is dqt:

from dqt import Check, Runner, MemoryStore

Full documentation and examples: https://github.com/antonbarr-data/dqt

Quality

All 64 detectors are benchmarked against labeled synthetic datasets. Benchmark scripts and raw results live in the GitHub repo and are run on every release:

Median F1: 1.00 | Detectors with F1 >= 0.8: 52/64 | Detectors with F1 >= 0.6: 58/64

Per-detector breakdown: examples/benchmarks/results_summary.md

Reproduce by cloning the repo and running:

python scripts/run_benchmark_suite.py --quick
python scripts/generate_benchmark_summary.py

Adapters

Six adapters ship in v1.0. Nightly CI runs against live credentials for the cloud warehouses.

Adapter CI
PostgreSQL (local) bundled — no credentials needed
ClickHouse clickhouse
Snowflake snowflake
BigQuery bigquery
Databricks databricks
Local (DuckDB) bundled — no credentials needed

Detector documentation

64 statistical detectors across 10 groups — drift, outliers, time series, distribution, information theory, pattern, referential, schema, basic, and custom.

Every detector has a structured page at docs/algorithms/<group>/<slug>.md covering:

  • What it computes and its parameters
  • When it works well and when it fails (with concrete failure-mode table)
  • Default-threshold calibration — empirical FPR across six canonical data shapes (Normal, Lognormal, Poisson, Beta, Pareto, Exponential)
  • Recommended thresholds per data shape
  • Canonical citation and runnable Python API example

Browse the full catalog: docs/algorithms/README.md

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dqtlib-1.0.3.tar.gz (272.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dqtlib-1.0.3-py3-none-any.whl (307.8 kB view details)

Uploaded Python 3

File details

Details for the file dqtlib-1.0.3.tar.gz.

File metadata

  • Download URL: dqtlib-1.0.3.tar.gz
  • Upload date:
  • Size: 272.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.11 {"installer":{"name":"uv","version":"0.11.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":null,"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for dqtlib-1.0.3.tar.gz
Algorithm Hash digest
SHA256 d11df9a8ace28add7965a0ce4acfa0bce36415bc7e740dc637cdb7e9ffc14dd0
MD5 e2006a80c439ddb74ed0e4474a6c5831
BLAKE2b-256 ae63b74b9c7723b6e2fae08a6b01e98c651e8e79cd506e7284c5453dcca84462

See more details on using hashes here.

File details

Details for the file dqtlib-1.0.3-py3-none-any.whl.

File metadata

  • Download URL: dqtlib-1.0.3-py3-none-any.whl
  • Upload date:
  • Size: 307.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.11 {"installer":{"name":"uv","version":"0.11.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":null,"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for dqtlib-1.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 781b8f4bc8d34d942891a7a8fb70a1bb7dabf731c4cf9b80be9d8d1d758e62b3
MD5 4cc9f4b866d104173751c2715699792b
BLAKE2b-256 160bbcee7aa1a35f99809569a2c869e7a0e254a3f86c26a8fed2359c309ce678

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page