Skip to main content

YData SDK allows to use the *Data-Centric* tools from the YData ecosystem to accelerate AI development

Project description

YData SDK

YData Logo

pypi Pythonversion downloads


🎊 YData SDK for improved data quality everywhere!

ydata-sdk v0.1.0 is here! Create a YData account so you can start using today!


Documentation | More on YData

Overview

The YData SDK is an ecosystem of methods that allows users to, through a python interface, adopt a Data-Centric approach towards the AI development. The solution includes a set of integrated components for data ingestion, standardized data quality evaluation and data improvement, such as synthetic data generation, allowing an iterative improvement of the datasets used in high-impact business applications.

Synthetic data can be used as Machine Learning performance enhancer, to augment or mitigate the presence of bias in real data. Furthermore, it can be used as a Privacy Enhancing Technology, to enable data-sharing initiatives or even to fuel testing environments.

Under the YData SDK hood, you can find a set of algorithms and metrics based on statistics and deep learning based techniques, that will help you to accelerate your data preparation.

What you can expect:

YData SDK is composed by the following main modules:

  • Datasources

    • YData’s SDK includes several connectors for easy integration with existing data sources. It supports several storage types, like filesystems and RDBMS. Check the list of connectors.
    • SDK’s Datasources run on top of Dask, which allows it to deal with not only small workloads but also larger volumes of data.
  • Synthesizers

    • Simplified interface to train a generative model and learn in a data-driven manner the behavior, the patterns and original data distribution. Optimize your model for privacy or utility use-cases.
    • From a trained synthesizer, you can generate synthetic samples as needed and parametrise the number of records needed.
  • Synthetic data quality report Coming soon

    • An extensive synthetic data quality report that measures 3 dimensions: privacy, utility and fidelity of the generated data. The report can be downloaded in PDF format for ease of sharing and compliance purposes or as a JSON to enable the integration in data flows.
  • Profiling Coming soon

    • A set of metrics and algorithms summarizes datasets quality in three main dimensions: warnings, univariate analysis and a multivariate perspective.

Supported data formats

  • Tabular The RegularSynthesizer is perfect to synthesize high-dimensional data, that is time-independent with high quality results.
  • Time-Series The TimeSeriesSynthesizer is perfect to synthesize both regularly and not evenly spaced time-series, from smart-sensors to stock.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

ydata_sdk-0.13.1-py312-none-any.whl (146.6 kB view details)

Uploaded Python 3.12

ydata_sdk-0.13.1-py311-none-any.whl (151.3 kB view details)

Uploaded Python 3.11

ydata_sdk-0.13.1-py310-none-any.whl (124.0 kB view details)

Uploaded Python 3.10

ydata_sdk-0.13.1-py39-none-any.whl (122.9 kB view details)

Uploaded Python 3.9

ydata_sdk-0.13.1-py38-none-any.whl (123.1 kB view details)

Uploaded Python 3.8

File details

Details for the file ydata_sdk-0.13.1-py312-none-any.whl.

File metadata

  • Download URL: ydata_sdk-0.13.1-py312-none-any.whl
  • Upload date:
  • Size: 146.6 kB
  • Tags: Python 3.12
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for ydata_sdk-0.13.1-py312-none-any.whl
Algorithm Hash digest
SHA256 283ced4c5be0e37e1cf70c3491cc4e1aa0a073c8eb3246f350ebbced9d2027d8
MD5 cd5f2ef643e2a48fc5249bf7f0d759e5
BLAKE2b-256 5b41adfeadfe1f1bfa244e33be427a24d03509546225d4643a96d453d12f87c2

See more details on using hashes here.

File details

Details for the file ydata_sdk-0.13.1-py311-none-any.whl.

File metadata

  • Download URL: ydata_sdk-0.13.1-py311-none-any.whl
  • Upload date:
  • Size: 151.3 kB
  • Tags: Python 3.11
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for ydata_sdk-0.13.1-py311-none-any.whl
Algorithm Hash digest
SHA256 914400ee2ceb705cda1a8c958570d6baa85a5f07c5ab58d15f0aaec9c0b19fbd
MD5 97da6b9a3e307d0cf5a116065e3954be
BLAKE2b-256 b534579848f6cbcdd31428c585627b677a9f65957705cdaad4f12a48189f73b5

See more details on using hashes here.

File details

Details for the file ydata_sdk-0.13.1-py310-none-any.whl.

File metadata

  • Download URL: ydata_sdk-0.13.1-py310-none-any.whl
  • Upload date:
  • Size: 124.0 kB
  • Tags: Python 3.10
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for ydata_sdk-0.13.1-py310-none-any.whl
Algorithm Hash digest
SHA256 78d2db33e82d6d4671f5e541fadedd1d96fe27055350b1ffde2c838f9a6cc5a7
MD5 9e76c7574f1381cd9ec9cb9247cfe158
BLAKE2b-256 5d4c4a548e245fd9d4e51bc17a23863f8648a288abd095a12b25ae744cc3ba1e

See more details on using hashes here.

File details

Details for the file ydata_sdk-0.13.1-py39-none-any.whl.

File metadata

  • Download URL: ydata_sdk-0.13.1-py39-none-any.whl
  • Upload date:
  • Size: 122.9 kB
  • Tags: Python 3.9
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for ydata_sdk-0.13.1-py39-none-any.whl
Algorithm Hash digest
SHA256 d7e1c92ea035387b8649e1d9c5bf2df11d9c385316644bbe2a1df0773648826e
MD5 a4fefb2eb8217e3e075e977c64876f09
BLAKE2b-256 9f961c10a9820319070946e4cc3df3c2f3db9c5fe5bb9607b8a73e34ce5b6f1c

See more details on using hashes here.

File details

Details for the file ydata_sdk-0.13.1-py38-none-any.whl.

File metadata

  • Download URL: ydata_sdk-0.13.1-py38-none-any.whl
  • Upload date:
  • Size: 123.1 kB
  • Tags: Python 3.8
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for ydata_sdk-0.13.1-py38-none-any.whl
Algorithm Hash digest
SHA256 168616310785ccd0e6d66b1560db63cd805ba615076076396907b88b979a6f54
MD5 38bfd9956324cd5fb9607e02834fdd58
BLAKE2b-256 b17905d33c1b8faeea4e3dc273362d0564731821c88c414c381fe44cf2bc8e13

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page