Toolkit for generating fake datasets.
Project description
Fake Dataset
Toolkit for generating fake datasets.
- Free software: MIT license
- Documentation: https://matheusfsa.github.io/fake_dataset
How to Install
pip install fake-dataset
Usage
>>> from fake_dataset import columns, generators
>>> data_gen = generators.DataGenerator(
... vehicle=columns.CategoricalRandomColumn(categories=["car", "bus", "bicycle"], missing_rate=(0.2, 0.5), na_value="NA"),
... year=columns.IntegerRandomColumn(values_range=(1950, 2010), missing_rate=(0.1, 0.2)),
... value=columns.FloatRandomColumn(values_range=(10e4, 10e5), missing_rate=(0.0, 0.0)),
... )
>>> data_gen.sample(3)
value vehicle year
0 823994.355388 car 2002
1 903007.903927 NA 1952
2 435372.320886 NA None
Credits
This package was created with Cookiecutter and the giswqs/pypackage project template.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
fake_dataset-0.0.2.tar.gz
(4.4 kB
view hashes)
Built Distribution
Close
Hashes for fake_dataset-0.0.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f055e5801312ccd2cd044fe891f8000d9af9b68dc85ebb18e7b569156fbe86a9 |
|
MD5 | 1895afe15f9c68a3d357ba038b4e8f3f |
|
BLAKE2b-256 | 54264b195d3b85b3bac3a7c3c728dbe8eb1799b1707615548f6870418ac9310b |