Skip to main content

Toolkit for generating fake datasets.

Project description

Fake Dataset

image

Toolkit for generating fake datasets.

How to Install

pip install fake-dataset

Usage

>>> from fake_dataset import columns, generators

>>> data_gen = generators.DataGenerator(
...    vehicle=columns.CategoricalRandomColumn(categories=["car", "bus", "bicycle"], missing_rate=(0.2, 0.5), na_value="NA"),
...    year=columns.IntegerRandomColumn(values_range=(1950, 2010), missing_rate=(0.1, 0.2)),
...    value=columns.FloatRandomColumn(values_range=(10e4, 10e5), missing_rate=(0.0, 0.0)),
...    )

>>> data_gen.sample(3)
           value vehicle  year
0  823994.355388     car  2002
1  903007.903927      NA  1952
2  435372.320886      NA  None

Credits

This package was created with Cookiecutter and the giswqs/pypackage project template.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fake_dataset-0.0.2.tar.gz (4.4 kB view hashes)

Uploaded Source

Built Distribution

fake_dataset-0.0.2-py2.py3-none-any.whl (4.9 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page