Skip to main content

Data7 streams CSV/Parquet datasets over HTTP from SQL queries.

Project description

Data7 ⚡ Open your data in minutes

Pronounced data·set (7 like sept in French).

GitHub Actions Workflow Status PyPI - Version

The idea 💡

TL;DR Data7 is a high performance web server that generates dynamic datasets (in CSV or Parquet formats) from existing databases and streams them over HTTP 🎉

A quick example

Let say you have a restaurant table in your PostgreSQL database, and you want to make this table an always-up-to-date dataset that can be easily used by the rest of the world.

All you have to do is to initialize your project:

data7 init

✍️ Edit configuration files:

# settings.yaml
production:
  host: "https://data7.wonderful-places.org"
  port: 80

# .secrets.yaml
production:
  DATABASE_URL: "postgresql+asyncpg://user:pass@server:port/wonderful-places"

# data7.yaml
production:
  datasets:
    - basename: restaurants
      query: "SELECT * FROM restaurant"

🏎️ Fire up the data7 server:

data7 run

💥 Your dataset is available at:

Documentation

The complete documentation of the project is avaiable at: https://jmaupetit.github.io/data7/

License

This work is released under the MIT License.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data7-0.5.0.tar.gz (9.4 kB view hashes)

Uploaded Source

Built Distribution

data7-0.5.0-py3-none-any.whl (10.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page