Skip to main content

Data observability for Databricks — freshness, volume, and schema-change monitoring

Project description

DashObserve — Databricks Library

CI PyPI License

Part of the Dashlibs suite — Databricks libraries built for business users.

Notebook-native data observability for Databricks — no external service, no agent to deploy.

  • Freshness monitoring — alert when a table's most recent timestamp value is older than expected
  • Volume monitoring — alert on row-count bounds, or on deviation from a rolling historical baseline
  • Schema-change detection — alert when columns are added, removed, or change type since the last run
  • Next-update prediction — estimate when a table will next be refreshed, based on its observed update cadence
  • Volume forecasting — project row counts over the next N days / weeks / months using a linear trend fitted to historical observations

All monitor results are appended to a Delta history table, which feeds baselines, schema-diff comparisons, and the forecasting models for future runs.

Installation

%pip install dash-observe

Quick Start

import dashobserve
dashobserve.launch()   # Opens interactive UI in your Databricks notebook

What it looks like

DashObserve UI

Python API

from dashobserve import MonitorConfig, run_monitors, run_forecast

# Monitor
cfg = MonitorConfig(
    table="catalog.schema.orders",
    freshness_column="updated_at", max_staleness_minutes=60,
    min_rows=1000, volume_tolerance_pct=20,
    track_schema=True,
)
report = run_monitors(cfg, history_table="catalog.schema.observe_history")
report.display()
print(report.summary())

# Forecast (requires history built up from prior monitor runs)
forecast = run_forecast(
    table="catalog.schema.orders",
    history_table="catalog.schema.observe_history",
    n_periods=4, period="weeks",
)
forecast.display()

Part of Dashlibs

Library Purpose
dash-dq Data Quality
dash-synthetic Synthetic Data Generation
dash-observe Data Observability (freshness, volume, schema)
dash-ml ML Model Monitoring
dash-ingest Data Ingestion
dash-gov Data Governance
dash-ontology Ontology & Lineage for AI
dash-ui Shared UI components (PyPI: dash-uis)

Quality & Contributing

  • 33 unit tests, zero Spark dependency to run them — pytest tests/ -v (freshness/volume/schema-diff checks are pure Python and fully covered; only the Spark/Delta glue in runner.py needs a live cluster)
  • Lint-clean (ruff check dashobserve/), PEP 561 typed (py.typed)
  • Every change ships through a reviewed pull request; CI (lint → test on Python 3.9–3.12 → build) gates every PR and every release
  • See CONTRIBUTING.md for dev setup, CHANGELOG.md for release history, SECURITY.md to report a vulnerability, and CODE_OF_CONDUCT.md

License

Apache 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dash_observe-0.1.5.tar.gz (122.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dash_observe-0.1.5-py3-none-any.whl (15.1 kB view details)

Uploaded Python 3

File details

Details for the file dash_observe-0.1.5.tar.gz.

File metadata

  • Download URL: dash_observe-0.1.5.tar.gz
  • Upload date:
  • Size: 122.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for dash_observe-0.1.5.tar.gz
Algorithm Hash digest
SHA256 12336cef553b656c12ecbc44723e89276b94bfa4d54f5d420185b76823c5cd2a
MD5 dad649860f4ea33a866ed25f1c8ee2dd
BLAKE2b-256 4ef349c3a89a4d6a648935e69e0d92031b258d7a1ae45cb20ab951af009a4399

See more details on using hashes here.

Provenance

The following attestation bundles were made for dash_observe-0.1.5.tar.gz:

Publisher: release.yml on dash-libs/dash-observe

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file dash_observe-0.1.5-py3-none-any.whl.

File metadata

  • Download URL: dash_observe-0.1.5-py3-none-any.whl
  • Upload date:
  • Size: 15.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for dash_observe-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 fdd2a4d584b1301670b54ecb5ef210e7480d4e4c9c35f11dcb018ef57c9c7f92
MD5 87b75d9f692621414033be06fdc68851
BLAKE2b-256 afa7f9144e8cf14842baa1e34a99348a49863d89385ae2170cbef666e60d6ff0

See more details on using hashes here.

Provenance

The following attestation bundles were made for dash_observe-0.1.5-py3-none-any.whl:

Publisher: release.yml on dash-libs/dash-observe

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page