Skip to main content

A generator of random SQL SELECT queries mainly to compare responses from a real dataset against that from a synthetic dataset.

Project description

Fuzzy SQL

The package generates semantically and syntactically correct random SELECT SQL Statements. It is developed by EHIL mainly to check the query response from synthetic data against that of real data. The package supports both tabular and longitudinal datasets. Table shapes, variable names and relations in both real and synthetic datasets shall be identical.

To install:

pip install fuzzy-sql

For further details, please refer to the Documentation.

Detailed examples comprising three sample datasets are provided under the examples folder in the repository. To generate the random queries, you first need to download the sample data by running 0.0-download_data.ipynb. You may then proceed with the remaining notebooks to construct the necessary databases and generate the random queries. For details, please refer to the usage/code-examples subsection in the Documentation.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

fuzzy-sql-2.0.5.tar.gz (23.2 kB view hashes)

Uploaded Source

Built Distribution

fuzzy_sql-2.0.5-py3-none-any.whl (23.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page