Skip to main content

Synthesized SDK

Project description

Synthesized

Documentation PyPI codecov Quality Gate Status Technical Debt Supported Python Versions Supported OS


synthesize

Synthesized's Scientific Data Kit (SDK)

The SDK generates high quality, privacy-preserving datasets for machine learning and data science use cases. It's available on PyPi for a free 30-day trial.

Usage

A licence key is required to use the full version of the package. If you don't have one, a free 30-day trial licence key will be provided during the installation. See the comparison table in the documentation for details about the features included in the trial.

Please contact us for more information about obtaining a full licence key.

Installation

It is assumed that you have Python 3.7+ already installed on a Windows, Linux, or MacOS machine.

Before starting, ensure that pip and wheel are installed and up to date.

pip install -U pip wheel

Synthesized can then be installed directly with pip.

pip install synthesized

Setting the licence key

Once you have installed the package, you'll need a licence key to run the software. The quickest way to check if the SDK is working is by running the command:

synth-validate

The first time this is run you will be asked if you have a licence key. If you do not have one simply select "no" and the prompts will guide you in acquiring one by entering your email address.

asciicast

Once you have set your licence key, the SDK will briefly verify the installation was successful.

With the SDK installed you are now able to get synthesizing! Check out our quick start or user guides for ways that the SDK can be put to use.

Dependencies

Below are the minimum dependencies required to run the SDK.

Package Version
faker >=8.0
matplotlib >=3.4
numpy >=1.19.2
pandas >=1.2
prompt-toolkit >=3.0
PyYAML >=5.2
rsa >=4.7
rstr >=2.2
scikit_learn >=0.23
scipy >=1.5
seaborn >=0.11
synthesized_insight >=0.5
tensorflow_privacy <0.8, >=0.6
tensorflow_probability >=0.14.0
tensorflow >=2.6

Additional Technical Details

There is no explicit limit for the size of a dataset, this is limited by the size of the RAM on the machine running the processing. More information can be found in the [Benchmarks(https://docs.synthesized.io/sdk/latest/user_guide/benchmark) section.

The library can use a GPU but it is not required.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

synthesized-2.1-cp39-cp39-win_amd64.whl (5.3 MB view hashes)

Uploaded CPython 3.9 Windows x86-64

synthesized-2.1-cp39-cp39-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (34.2 MB view hashes)

Uploaded CPython 3.9 manylinux: glibc 2.17+ x86-64

synthesized-2.1-cp39-cp39-macosx_10_9_x86_64.whl (6.6 MB view hashes)

Uploaded CPython 3.9 macOS 10.9+ x86-64

synthesized-2.1-cp38-cp38-win_amd64.whl (5.3 MB view hashes)

Uploaded CPython 3.8 Windows x86-64

synthesized-2.1-cp38-cp38-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (34.9 MB view hashes)

Uploaded CPython 3.8 manylinux: glibc 2.17+ x86-64

synthesized-2.1-cp38-cp38-macosx_10_9_x86_64.whl (6.6 MB view hashes)

Uploaded CPython 3.8 macOS 10.9+ x86-64

synthesized-2.1-cp37-cp37m-win_amd64.whl (5.2 MB view hashes)

Uploaded CPython 3.7m Windows x86-64

synthesized-2.1-cp37-cp37m-manylinux_2_17_x86_64.manylinux2014_x86_64.whl (31.2 MB view hashes)

Uploaded CPython 3.7m manylinux: glibc 2.17+ x86-64

synthesized-2.1-cp37-cp37m-macosx_10_9_x86_64.whl (6.5 MB view hashes)

Uploaded CPython 3.7m macOS 10.9+ x86-64

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page