an elegant datasets factory
Project description
an elegant datasets factory
Free software: MIT license
Documentation: https://rawbuilder.readthedocs.io.
Features
Schema oriented datasets builder
How to Use it
Schema
The Schema is a JSON object that describes three main components.
The model names, the column names, and the data source per column.
Note the below code-block, The model name is “Student”, and it contain 4 properties [id,first_name,email,math_test_results].
Each property of the model “student” has its data source description.
The builder will use all the information in the schema to build the required tasks or data sets.
Student data model example
Data types to can use in the schema
int: build a list of integers between 1 and requested dataset size.
decrement: build a list of decremented integers between the requested size and 1.
random_int: build a list of random integers between 0 and 100 by default.
first_name: build a list of first names.
last_name: build a list of last names.
email: build a list of fake emails.
Data Modifiers
Combine Data Modifiers to the above data types, it can adjust values, change the data nature, and gives more control over the final output.
Modifiers syntax is simple:
Use the modifier between to generate random integer column between 0 and 30:*
All Modifiers
1) between
Use this modifier to set the high-end and low-end for a specific data type.
Syntax:
Supported with: random_int:
History
0.0.4 (2021-11-13)
Data modifiers
0.0.3 (2021-11-05)
Migrate to JSON
Generate simple datasets
0.0.2 (2021-11-05)
Proof of concept
0.0.1 (2021-10-24)
First release on PyPI.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for rawbuilder-0.0.4-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 658dd5f761d5c1913be7e83dc80ac9c9ac30f6f0bee5a2e0387fa28298f9f96b |
|
MD5 | b601d240355800bd54b0d2f244a786be |
|
BLAKE2b-256 | 0b2281b2e857fd04fb43e17469f90c3958d419281fd3d06cd945f24642a727d8 |