Skip to main content

A CLI tool for working with data products and contracts

Project description

Turbine

Contract-driven data quality for data products — powered by ODCS and Soda Core.

pipeline coverage python ODCS

turbine check

How It Works

Contract ➜ Lint ➜ Check ➜ Score ➜ Flag ➜ Observe
  1. Contract — Define expectations in YAML using ODCS v3.1.0
  2. Lint — Validate contract schema before anything touches a database
  3. Check — Run quality checks against live data (SodaCL, SQL, Python, window, group)
  4. Score — Calculate a dimension-aware quality score
  5. Flag — Tag failing rows with bitmask flags for downstream filtering
  6. Observe — Export traces and metrics via OpenTelemetry

Features

  • YAML contracts — ODCS v3.1.0 with Soda extensions for quality checks
  • 13 check types — missing, duplicate, invalid, freshness, row_count, SQL, Python, typed multi-table Python, group, and window (zscore, spike, flatline)
  • Schema drift detection — Compare live database schemas against your contract
  • Dimension-aware scoring — Weight quality dimensions (completeness, accuracy, …) per check
  • Row-level flagging — Per-cell Roaring-bitmap matrix tracks which rows failed which checks across runs
  • Code generation — Scaffold SQLModel models and FastAPI routers from contracts
  • Dependency management — Lockfile-based contract dependency resolution
  • IDE support — Language server (LSP) with extensions for VSCode and JetBrains

Quick Start

Prerequisites: Python 3.13+ and uv

# Install with your database driver
uv add "turbine-data[snowflake]"    # or: postgres, duckdb

# Initialize the recommended src/{project_name} layout
uv run turbine init --defaults

# Copy .env.example to .env and fill your warehouse credentials
cp .env.example .env

# Validate the starter Contract
uv run turbine lint src/{project_name}/contracts/example.yml

# Run quality checks against the starter Datasource named default
uv run turbine check --datasource default src/{project_name}/contracts/example.yml

Supported Databases

Database Install extra
PostgreSQL turbine-data[postgres]
Snowflake turbine-data[snowflake]
DuckDB turbine-data[duckdb]

Documentation

Full docs live in docs/ — covering getting started, guides, concepts, and CLI reference.

Contributing

See the contributing guide for dev setup, testing, and code style.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

turbine_data-0.1.3.tar.gz (9.0 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

turbine_data-0.1.3-py3-none-any.whl (1.0 MB view details)

Uploaded Python 3

File details

Details for the file turbine_data-0.1.3.tar.gz.

File metadata

  • Download URL: turbine_data-0.1.3.tar.gz
  • Upload date:
  • Size: 9.0 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.0 {"installer":{"name":"uv","version":"0.10.0","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for turbine_data-0.1.3.tar.gz
Algorithm Hash digest
SHA256 b2b92beb481ae54425bc2a2c808857676802f90e06d22466189453db00510988
MD5 4f29566aa7875dc77addb29079eae20e
BLAKE2b-256 8d3b6d927a4c891c104f36fbd12e5fd197e3ca0e9ec205e2b32e0aa081f7d835

See more details on using hashes here.

File details

Details for the file turbine_data-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: turbine_data-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 1.0 MB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.0 {"installer":{"name":"uv","version":"0.10.0","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for turbine_data-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 c6c613dcf6831fb0b801934476ba54c35efa1665db4867d2eccce476632070b7
MD5 cd4040d834da046e8f36159f0ab27eea
BLAKE2b-256 3786eb2473844d299d16587dc1b7588e485e9232c254b4081379cf724eccbece

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page