Skip to main content

Synthetic data generation for Databricks — with a built-in notebook UI

Project description

DashSynthetic — Databricks Library

CI PyPI License

Part of the Dashlibs suite — Databricks libraries built for business users.

Installation

%pip install dash-synthetic

Quick Start

import dashsynthetic
dashsynthetic.launch()   # Opens interactive UI in your Databricks notebook

The UI has two tabs:

  • Single Table — profile a source table/DataFrame/SQL query and generate synthetic data from it.
  • Multi-Table Relationships — define multiple tables, their primary keys, foreign keys, and master data columns (e.g. currency/country codes); the tool figures out the dependency order and generates every table with referentially valid foreign keys.

What it looks like

Single Table — profile a source and generate synthetic data from it:

DashSynthetic single-table tab

Multi-Table Relationships — define tables, primary/foreign keys, and master data columns:

DashSynthetic multi-table relationships tab

Python API

from dashsynthetic import RelationshipGraph, MultiTableGenerator

graph = RelationshipGraph()
graph.add_table("Customer", table="catalog.schema.dim_customer", primary_key="customer_id")
graph.add_table("Account", table="catalog.schema.fact_account", primary_key="account_id",
                master_data_columns=["currency_code"])
graph.add_foreign_key("Account", "customer_id", "Customer", "customer_id")

gen = MultiTableGenerator(graph)
gen.configure_table("Customer", n_rows=5000)
gen.configure_table("Account", n_rows=20000, output_table="catalog.schema.syn_account")
results = gen.run()   # {"Customer": df, "Account": df}, generated in dependency order

Part of Dashlibs

Library Purpose
dash-dq Data Quality
dash-synthetic Synthetic Data Generation
dash-observe Data Observability (freshness, volume, schema)
dash-ml ML Model Monitoring
dash-ingest Data Ingestion
dash-gov Data Governance
dash-relate Ontology & Lineage for AI
dash-ui Shared UI components (PyPI: dash-uis)

License

Apache 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dash_synthetic-0.1.2.tar.gz (230.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dash_synthetic-0.1.2-py3-none-any.whl (16.7 kB view details)

Uploaded Python 3

File details

Details for the file dash_synthetic-0.1.2.tar.gz.

File metadata

  • Download URL: dash_synthetic-0.1.2.tar.gz
  • Upload date:
  • Size: 230.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for dash_synthetic-0.1.2.tar.gz
Algorithm Hash digest
SHA256 17f8e01ffbbe8227b2e62a09a9843c06f10e8e77198cfc8fc7a0a02e585cab51
MD5 9e0d8199147d1d59a667b7a0a2da101e
BLAKE2b-256 22b6b8fd0355f28ad53294dd2728ec0929b073ca044e59ddf20510c34bb74184

See more details on using hashes here.

Provenance

The following attestation bundles were made for dash_synthetic-0.1.2.tar.gz:

Publisher: release.yml on dash-libs/dash-synthetic

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file dash_synthetic-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: dash_synthetic-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 16.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for dash_synthetic-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 db2648bfd73e69c74ff72e42b4225c36865bb8eaf9331fbeb90d811657a1c1c2
MD5 e56466209dc0410fb323d1c0eff7b409
BLAKE2b-256 d39fe1d8314c3827949b5d8a9afdf2a5972863cea8e32c48c5381437fd680ee1

See more details on using hashes here.

Provenance

The following attestation bundles were made for dash_synthetic-0.1.2-py3-none-any.whl:

Publisher: release.yml on dash-libs/dash-synthetic

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page