Analytic generation of datasets with specified statistical characteristics.
Project description
Analytic generation of datasets with specified statistical characteristics.
Introduction
analytics-dataset provides a set of functionality to enable the specification and generation of a wide range of datasets with specified statistical characteristics. Specification to include the predictor matrix and the response vector. Check the analyticsdf documentation for more details. Examples include:
- High correlation and multi-collinearity among predictor variables
- Interaction effects between variables
- Skewed distributions of predictor and response variables
- Nonlinear relationships between predictor and response variables
Research existing automate dataset functionality
- Sklearn Make Datasets functionality
- MIT Synthetic Data Vault project
- MIT Data to AI Lab
- datacebo
- 2016 IEEE conference paper, The Synthetic Data Vault.
Public Package
This repo has published beta packages on both Pypi and Anaconda
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
analyticsdf-0.0.7.tar.gz
(10.2 kB
view hashes)
Built Distribution
Close
Hashes for analyticsdf-0.0.7-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c3736232d8e279b2427a1578ef85f8f72365f720e0aacef44e7525cccaf96caa |
|
MD5 | 09d7c85d0128a50b7888f3bd83ad2206 |
|
BLAKE2b-256 | 26e7863a97f29bc2cdbd0ee6a9f934c00a5fc122fd20cd522eeb71361e5eea5a |