Skip to main content

Synthetic data quality evaluation with Polars-native performance and interactive HTML reports.

Project description

dataxid-syntheval

PyPI version Python versions License

Synthetic data quality evaluation — compare original and synthetic datasets with interactive HTML reports.

Quickstart

import polars as pl
from dataxid_syntheval import SynthEval

original = pl.read_csv("original.csv")
synthetic = pl.read_csv("synthetic.csv")

se = SynthEval(original=original, synthetic=synthetic)
se.to_html("report.html")

Programmatic access:

diffs = se.diff
diffs["column_diffs"]          # per-column stat deltas
diffs["alert_diff"]            # new / resolved alerts
diffs["distribution_overlays"] # histogram & frequency overlays
diffs["correlation_diffs"]     # correlation matrix differences

Features

  • Column-level stat comparison — mean, std, median, min/max, missing %, distinct count and more
  • Alert change detection — new and resolved data quality alerts between profiles
  • Distribution overlays — proportion-based histograms and categorical frequency charts for fair comparison across different dataset sizes
  • Correlation matrix diffs — Pearson, Spearman, Kendall, Cramér's V, Phik
  • Interactive HTML report — tabbed column comparison, ECharts visualizations, lazy chart rendering
  • Built on dataxid-profiling and Polars

Installation

pip install dataxid-syntheval

Contributing

Contributions are welcome. See CONTRIBUTING.md for details.

Links

License

Apache-2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dataxid_syntheval-0.1.0.tar.gz (208.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dataxid_syntheval-0.1.0-py3-none-any.whl (90.2 kB view details)

Uploaded Python 3

File details

Details for the file dataxid_syntheval-0.1.0.tar.gz.

File metadata

  • Download URL: dataxid_syntheval-0.1.0.tar.gz
  • Upload date:
  • Size: 208.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.22 {"installer":{"name":"uv","version":"0.9.22","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for dataxid_syntheval-0.1.0.tar.gz
Algorithm Hash digest
SHA256 134a3bb95da92255680ab3e9550bf44a185598c7d972246a830b1247e453699b
MD5 4bb6d78a59b72958fad173cf755edc73
BLAKE2b-256 9b829160b6233d7a49557087f8962fb8714d3cd2b857b6936e8c91940f8cb25a

See more details on using hashes here.

File details

Details for the file dataxid_syntheval-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: dataxid_syntheval-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 90.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.9.22 {"installer":{"name":"uv","version":"0.9.22","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for dataxid_syntheval-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 482c8b86d6feaea643a13e453014b999aced6983f624f45f6f44ff0e83c836e8
MD5 d08a8f378f795fd73be17234dd0f371e
BLAKE2b-256 b630f25df36e523bfdbc3dc07dbc3570d7a745deb3b35653a403d712a1faee9b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page