Python package: genesynth
Project description
[![Python packages](https://github.com/sterling312/genesynth/actions/workflows/github-actions-unittests.yaml/badge.svg)](https://github.com/sterling312/genesynth/actions/workflows/github-actions-unittests.yaml)
# genesynth Library to synthetically generate data for declarative data structures.
# install ` pip install -r requirements.txt `
# example ` >>> from genesynth.orchestration import * >>> pipe = Orchestration.read_yaml('tests/graph.yaml') >>> pipe.run() >>> asyncio.run(pipe.root.save('graph.csv')) `
` $ pip install genesynth $ python -m genesynth.cli -f tests/test.yaml --stdout `
# project status
## supported feature * load yaml as configuration file * arbitrary row size support * data type mapping with configurable parameters * JSON (semi-structured data) support * improved data type support * foreign relationship support * DOT file graph * table graph * built-in orchestrator using graph * thread and process support * intermediary data temporary cache * graph visualization
## key features to add * add yaml validator * fix header support * additional output file formats (JSON, PSQL dump, CSV with quotes, etc) * add support for JSON arrays * improve constraint support * add support for quoted string * add support statistical distribution via kernel convolution * optimize orchestration and disk cache efficiency * optimize thread/process based generation * convert serial to autoincrement constraint for integer type * convert password to constraint of string type
## nice to have features to add * support external scheduler * support NLP based text generation * support sklearn * support integration with pytorch embedding * support for object reference via $ref * fix compatibility with [JSON schema array notation](https://json-schema.org/understanding-json-schema/reference/array.html#items) * fix when json array child object appears as separate items
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for genesynth-0.2.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2bb58f23e2b9bba2e661529b2a12b455f0ffc272e7e8c0957bde09e874b3e6d9 |
|
MD5 | 1470cb4e156eb2a690f87b1eae0a38df |
|
BLAKE2b-256 | 749459adbe2ec41ea197f73a215d964f422549b23cba70f5eab234edad1f73cc |