A generator of random SQL SELECT queries mainly to compare responses from a real dataset against that from a synthetic dataset.
Project description
Fuzzy SQL
The package generates semantically and syntactically correct random SELECT SQL Statements. It is developed by EHIL mainly to check the query response from synthetic data against that of real data. The package supports both tabular and longitudinal datasets. Table shapes, variable names and relations in both real and synthetic datasets shall be identical.
To install:
pip install fuzzy-sql
For further details, please refer to the Documentation.
Detailed examples comprising three sample datasets are provided under the examples folder in the repository. To generate the random queries, you first need to download the sample data by running 0.0-download_data.ipynb. You may then proceed with the remaining notebooks to construct the necessary databases and generate the random queries. For details, please refer to the usage/code-examples subsection in the Documentation.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for fuzzy_sql-2.0.5-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 615ca3e8c58f8e86a211767038013e4550a1e325e0eef4df9989f86f5f87e41e |
|
MD5 | d7ae001f598f1bdcde0807f37e4a5212 |
|
BLAKE2b-256 | 08da4a8bc122a279ba8cd60f503a3b33b0b7a9aabbd15d7425399aca4a869ab0 |