Skip to main content

Synthetic data generation for Databricks — with a built-in notebook UI

Project description

Dashsynthetic — Databricks Library

CI PyPI License

Part of the Dashlibs suite — Databricks libraries built for business users.

Installation

%pip install dash-synthetic

Quick Start

import dashsynthetic
dashsynthetic.launch()   # Opens interactive UI in your Databricks notebook

The UI has two tabs:

  • Single Table — profile a source table/DataFrame/SQL query and generate synthetic data from it.
  • Multi-Table Relationships — define multiple tables, their primary keys, foreign keys, and master data columns (e.g. currency/country codes); the tool figures out the dependency order and generates every table with referentially valid foreign keys.

Python API

from dashsynthetic import RelationshipGraph, MultiTableGenerator

graph = RelationshipGraph()
graph.add_table("Customer", table="catalog.schema.dim_customer", primary_key="customer_id")
graph.add_table("Account", table="catalog.schema.fact_account", primary_key="account_id",
                master_data_columns=["currency_code"])
graph.add_foreign_key("Account", "customer_id", "Customer", "customer_id")

gen = MultiTableGenerator(graph)
gen.configure_table("Customer", n_rows=5000)
gen.configure_table("Account", n_rows=20000, output_table="catalog.schema.syn_account")
results = gen.run()   # {"Customer": df, "Account": df}, generated in dependency order

Part of Dashlibs

Library Purpose
dash-dq Data Quality
dash-synthetic Synthetic Data Generation
dash-ml ML Model Monitoring
dash-ingest Data Ingestion
dash-gov Data Governance
dash-relate Ontology & Lineage for AI

License

Apache 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dash_synthetic-0.1.1.tar.gz (45.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dash_synthetic-0.1.1-py3-none-any.whl (12.3 kB view details)

Uploaded Python 3

File details

Details for the file dash_synthetic-0.1.1.tar.gz.

File metadata

  • Download URL: dash_synthetic-0.1.1.tar.gz
  • Upload date:
  • Size: 45.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for dash_synthetic-0.1.1.tar.gz
Algorithm Hash digest
SHA256 632c0b50789061e157bc4c9b0c8588ed440f7fea8eda7d4e7d4a2d37d94b6252
MD5 c5be9f36a8bc0440d5c927bac2395aaa
BLAKE2b-256 05e89bf335cfcf8e606fdbfb2534b9211dcdf36a107fffd9f6351ece052c1782

See more details on using hashes here.

Provenance

The following attestation bundles were made for dash_synthetic-0.1.1.tar.gz:

Publisher: release.yml on dash-libs/dash-synthetic

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file dash_synthetic-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: dash_synthetic-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 12.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for dash_synthetic-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 af63776d6e16affb6a348361ec4f4635db899869b81e9b180ee2d0f7688d157c
MD5 1b1264d799ca054330db6c50c4d08fba
BLAKE2b-256 07d04be1b87823150d97a185ab0ae102562c71275ecaa4c09f418a7b082cc4bc

See more details on using hashes here.

Provenance

The following attestation bundles were made for dash_synthetic-0.1.1-py3-none-any.whl:

Publisher: release.yml on dash-libs/dash-synthetic

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page