Synthetic data quality evaluation with Polars-native performance and interactive HTML reports.
Project description
dataxid-syntheval
Synthetic data quality evaluation — compare original and synthetic datasets with interactive HTML reports.
Quickstart
import polars as pl
from dataxid_syntheval import SynthEval
original = pl.read_csv("original.csv")
synthetic = pl.read_csv("synthetic.csv")
se = SynthEval(original=original, synthetic=synthetic)
se.to_html("report.html")
Programmatic access:
diffs = se.diff
diffs["column_diffs"] # per-column stat deltas
diffs["alert_diff"] # new / resolved alerts
diffs["distribution_overlays"] # histogram & frequency overlays
diffs["correlation_diffs"] # correlation matrix differences
Features
- Column-level stat comparison — mean, std, median, min/max, missing %, distinct count and more
- Alert change detection — new and resolved data quality alerts between profiles
- Distribution overlays — proportion-based histograms and categorical frequency charts for fair comparison across different dataset sizes
- Correlation matrix diffs — Pearson, Spearman, Kendall, Cramér's V, Phik
- Interactive HTML report — tabbed column comparison, ECharts visualizations, lazy chart rendering
- Built on dataxid-profiling and Polars
Installation
pip install dataxid-syntheval
Contributing
Contributions are welcome. See CONTRIBUTING.md for details.
Links
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dataxid_syntheval-0.1.0.tar.gz.
File metadata
- Download URL: dataxid_syntheval-0.1.0.tar.gz
- Upload date:
- Size: 208.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.22 {"installer":{"name":"uv","version":"0.9.22","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
134a3bb95da92255680ab3e9550bf44a185598c7d972246a830b1247e453699b
|
|
| MD5 |
4bb6d78a59b72958fad173cf755edc73
|
|
| BLAKE2b-256 |
9b829160b6233d7a49557087f8962fb8714d3cd2b857b6936e8c91940f8cb25a
|
File details
Details for the file dataxid_syntheval-0.1.0-py3-none-any.whl.
File metadata
- Download URL: dataxid_syntheval-0.1.0-py3-none-any.whl
- Upload date:
- Size: 90.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.9.22 {"installer":{"name":"uv","version":"0.9.22","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
482c8b86d6feaea643a13e453014b999aced6983f624f45f6f44ff0e83c836e8
|
|
| MD5 |
d08a8f378f795fd73be17234dd0f371e
|
|
| BLAKE2b-256 |
b630f25df36e523bfdbc3dc07dbc3570d7a745deb3b35653a403d712a1faee9b
|