Skip to main content

Sample SQL datasets

Project description

datasets

This helps with the use of standard SQL datasets.

It comes with 4 datasets:

Instalation

The package can be installed with:

pip install qrlew-datasets

The library assumes:

  • either that postgresql is installed,
  • or that docker is installed and can spawn postgresql containers.

Postgresql in a container

The library automatically spawns containers. There is nothing to do.

Without docker installed

Setup a psql as in https://colab.research.google.com/github/tensorflow/io/blob/master/docs/tutorials/postgresql.ipynb

You can set the port to use: here 5433.

# Inspred by https://colab.research.google.com/github/tensorflow/io/blob/master/docs/tutorials/postgresql.ipynb#scrollTo=YUj0878jPyz7
sudo apt-get -y -qq update
sudo apt-get -y -qq install postgresql-14
# Start postgresql server
# sudo sed -i "s/#port = 5432/port = 5433/g" /etc/postgresql/14/main/postgresql.conf
sudo sed -i "s/port = 5432/port = 5433/g" /etc/postgresql/14/main/postgresql.conf
sudo service postgresql start
# Set password
sudo -u postgres psql -U postgres -c "ALTER USER postgres PASSWORD 'pyqrlew-db'"
# Install python packages

Testing the absence of docker if docker is installed:

You can simulate the absence of docker by running this code inside a container.

First run: docker run --name test -d -i -t -v .:/datasets ubuntu:22.04 Then run: docker exec -it test bash

Building the .sql dumps

To build the datasets, install the requirements with:

poetry shell

You can then build the datasets with:

python -m datasets.build

You may need to install the requirements of some drivers such as: https://pypi.org/project/mysqlclient/

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

qrlew_datasets-0.9.0.tar.gz (9.8 kB view hashes)

Uploaded Source

Built Distribution

qrlew_datasets-0.9.0-py3-none-any.whl (12.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page