Analytic generation of datasets with specified statistical characteristics.
Project description
Analytic generation of datasets with specified statistical characteristics.
Introduction
analytics-dataset provides a set of functionality to enable the specification and generation of a wide range of datasets with specified statistical characteristics. Specification to include the predictor matrix and the response vector
Examples include:
- High correlation and multi-collinearity among predictor variables
- Interaction effects between variables
- Skewed distributions of predictor and response variables
- Nonlinear relationships between predictor and response variables
Research existing automate dataset functionality
- Sklearn Make Datasets functionality
- MIT Synthetic Data Vault project
- MIT Data to AI Lab
- datacebo
- 2016 IEEE conference paper, The Synthetic Data Vault.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
analyticsdf-0.0.4.3.tar.gz
(3.9 kB
view hashes)
Built Distribution
Close
Hashes for analyticsdf-0.0.4.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1cfa32865601f57c4478dfeb21a1f503a12e620229bb7f444590d5ac4ffde9f7 |
|
MD5 | f761b52db273a2878d6da7bf86ca7748 |
|
BLAKE2b-256 | 00fbb7da9c1be29029dcaec5e72d129588c0f82e44585acc3ff1f473f89e69c1 |