Skip to main content

Dagster integration for the Rocky SQL transformation engine

Project description

dagster-rocky

Dagster integration for the Rocky SQL transformation engine.

dagster-rocky wraps the rocky CLI as a Dagster ConfigurableResource and exposes Rocky-managed tables as materializable Dagster assets — complete with check results, lineage, drift detection and quality metrics surfaced as native Dagster events.

Install

uv add dagster-rocky

You'll also need the Rocky CLI on your $PATH: https://github.com/rocky-data/rocky/releases?q=engine

Quick start (component)

Add a defs.yaml next to your other Dagster definitions:

type: dagster_rocky.RockyComponent
attributes:
  binary_path: rocky
  config_path: config/rocky.toml
  models_dir: models

Dagster's component loader will:

  1. Run rocky discover (and rocky compile, when models are present) and cache the result.
  2. Build one subset-aware multi_asset per Rocky group, with declared row_count / column_match / freshness checks per table.
  3. On materialization, shell out to rocky run --filter <key>=<value> for the selected subset and yield MaterializeResult + AssetCheckResult events with rich metadata.

Quick start (resource)

import dagster as dg
from dagster_rocky import RockyResource

rocky = RockyResource(
    binary_path="rocky",
    config_path="config/rocky.toml",
    timeout_seconds=3600,
)

defs = dg.Definitions(resources={"rocky": rocky})

Then in an asset:

@dg.asset
def acme_orders(rocky: RockyResource) -> dg.MaterializeResult:
    result = rocky.run(filter="tenant=acme")
    return dg.MaterializeResult(
        metadata={"tables_copied": result.tables_copied, "duration_ms": result.duration_ms},
    )

Public API

Symbol Purpose
RockyResource ConfigurableResource wrapping the Rocky CLI
RockyComponent State-backed Dagster component that loads Rocky tables as assets
RockyDagsterTranslator Subclass to customize asset key / group / tag mapping
RockyMetadataSet Namespaced metadata (source_id, strategy, watermark, …)
load_rocky_assets() Functional helper that returns AssetSpec for each Rocky table
emit_materializations() / emit_check_results() Convert a RunResult into Dagster events
check_metadata() Build a metadata mapping for a single Rocky CheckResult
cost_metadata_from_optimize() Extract per-model cost recommendations from OptimizeResult
parse_rocky_output() Auto-detect a Rocky JSON payload and return the matching Pydantic model

Documentation

Related projects

License

Apache 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dagster_rocky-1.19.0.tar.gz (341.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dagster_rocky-1.19.0-py3-none-any.whl (194.1 kB view details)

Uploaded Python 3

File details

Details for the file dagster_rocky-1.19.0.tar.gz.

File metadata

  • Download URL: dagster_rocky-1.19.0.tar.gz
  • Upload date:
  • Size: 341.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for dagster_rocky-1.19.0.tar.gz
Algorithm Hash digest
SHA256 f1c679eb16bd0c0bc38e4b70bf3aeec6af0e92fd22352c41b4254e27019dc817
MD5 792126ee23b38aff225a4c01bd53641a
BLAKE2b-256 fae16417a5f8777c6841ebfa09048fdec9962f3944b1d7dd324763ff4699b660

See more details on using hashes here.

Provenance

The following attestation bundles were made for dagster_rocky-1.19.0.tar.gz:

Publisher: dagster-release.yml on rocky-data/rocky

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file dagster_rocky-1.19.0-py3-none-any.whl.

File metadata

  • Download URL: dagster_rocky-1.19.0-py3-none-any.whl
  • Upload date:
  • Size: 194.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.13

File hashes

Hashes for dagster_rocky-1.19.0-py3-none-any.whl
Algorithm Hash digest
SHA256 37ec9487f21f9e02bac11e0076b9c9378e25076c8265d10ac913ac2abe14d2d6
MD5 b430d906d03003cd827686921dc382fe
BLAKE2b-256 fdfe301788d0ce6fe2b3666edb614efe94bbb479c483113dad1f0a52ec75dce0

See more details on using hashes here.

Provenance

The following attestation bundles were made for dagster_rocky-1.19.0-py3-none-any.whl:

Publisher: dagster-release.yml on rocky-data/rocky

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page