Skip to main content

Data quality, observability, semantic, and causality library

Project description

dqtlib

Open-source data quality, lineage, semantic layer & causality — for dbt, warehouses and data lakes.

pip-installable Python library for watching dbt-built warehouses and any SQL warehouse for statistical drift, anomalies, silent regressions, and explaining why metrics moved.

pip install dqtlib

The import name is dqt:

from dqt import Check, Runner, MemoryStore

Full documentation and examples: https://github.com/antonbarr-data/dqt

Quality

All 64 detectors are benchmarked against labeled synthetic datasets. Results are reproducible by anyone:

Median F1: 1.00 | Detectors with F1 >= 0.8: 52/64 | Detectors with F1 >= 0.6: 58/64

Per-detector breakdown: examples/benchmarks/results_summary.md

Reproduce locally:

python scripts/run_benchmark_suite.py --quick
python scripts/generate_benchmark_summary.py

Adapters

Adapter Nightly Tests
PostgreSQL postgres
ClickHouse clickhouse
Snowflake snowflake
BigQuery bigquery
Databricks databricks
MySQL mysql
Redshift redshift
DuckDB duckdb
Trino trino

Detector documentation

64 statistical detectors across 10 groups — drift, outliers, time series, distribution, information theory, pattern, referential, schema, basic, and custom.

Every detector has a structured page at docs/algorithms/<group>/<slug>.md covering:

  • What it computes and its parameters
  • When it works well and when it fails (with concrete failure-mode table)
  • Default-threshold calibration — empirical FPR across six canonical data shapes (Normal, Lognormal, Poisson, Beta, Pareto, Exponential)
  • Recommended thresholds per data shape
  • Canonical citation and runnable Python API example

Browse the full catalog: docs/algorithms/README.md

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dqtlib-1.0.0.tar.gz (257.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dqtlib-1.0.0-py3-none-any.whl (307.8 kB view details)

Uploaded Python 3

File details

Details for the file dqtlib-1.0.0.tar.gz.

File metadata

  • Download URL: dqtlib-1.0.0.tar.gz
  • Upload date:
  • Size: 257.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.11 {"installer":{"name":"uv","version":"0.11.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":null,"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for dqtlib-1.0.0.tar.gz
Algorithm Hash digest
SHA256 d8ebc235b37375cbbdff02dbacea3b9b3f3b6cf5d309e8d55ffeaf94aa8ed342
MD5 47dd8e2b2770d8fac42b7a14bed910ce
BLAKE2b-256 435dcc573411c8dfa8814b49b92d3ac544baad523596f26a63da8142f0812d4d

See more details on using hashes here.

File details

Details for the file dqtlib-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: dqtlib-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 307.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.11.11 {"installer":{"name":"uv","version":"0.11.11","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":null,"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for dqtlib-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 9cc0d3601b9d97f16dd8f6218d51e39d573052681b04daa584c6e9719714c5db
MD5 68ccab6e8f4dc29f6baec2c651e7d901
BLAKE2b-256 baac11e25b24fd32d9a0b2fb9e2d03e5fe3966185f7a7b8141914b232c887f92

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page