Skip to main content

Python package that loads data from the web and deploys a corresponding external table definition, so that the data can be queried using standard SQL.

Project description

vestapol

vestapol is a Python package that loads data from the web and deploys a corresponding external table definition, so that the data can be queried using standard SQL.

"Vestapol" is an open D Major tuning for the guitar. It is named after a 19th-century composition distributed in some of the earliest instructional guides for guitar.

Usage

from vestapol.web_resources.csv_resource import CSVResource
from vestapol.destinations.gcp_destination import GoogleCloudPlatform


nyt_covid_data_2022 = CSVResource(
    name="nyt_covid19_us_counties_2022",
    base_url="https://raw.githubusercontent.com/",
    endpoint="nytimes/covid-19-data/master/rolling-averages/us-counties-2022.csv",
    version="v0",
    skip_leading_rows=1,
)

destination = GoogleCloudPlatform()

nyt_covid_data_2022.load(destination)
tablename = destination.create_table(nyt_covid_data_2022)


from google.cloud import bigquery

client = bigquery.Client()
query = f"""
    select date, state, county, cases_avg_per_100k
    from `{tablename}`
    where requested_at = '{nyt_covid_data_2022.requested_at}'
    limit 5
"""
query_job = client.query(query)
for row in query_job.result():
    print(row)

Prerequisites

Installation of this project requires Poetry 1.2+ and Python version 3.9+.

Older version of poetry can be updated by running:

poetry self update
poetry --version

Installation

Install vestapol and its dependencies by running:

poetry install

Testing

Run tests with the following command:

poetry run pytest

Environment Variables

  • GCS_BUCKET_NAME: the Google Cloud Storage bucket where data is loaded (e.g. inq-warehouse-waligob)
  • GCS_ROOT_PREFIX: the GCS prefix where data is loaded (e.g. data_catalog)
  • GBQ_PROJECT_ID: the BigQuery project identifier (e.g. inq-warehouse)
  • GBQ_DATASET_ID: the BigQuery dataset where external tables will be created (e.g. data_catalog_waligob)
  • GBQ_DATASET_LOCATION: the BigQuery dataset location (e.g. US)
  • GOOGLE_APPLICATION_CREDENTIALS=: location of the GCS service account keyfile (e.g. ~/inq-warehouse-f0962a57089e-inf.json)

Publishing to PyPI

Instructions for pushing new versions of vestapol to PyPI:

  1. Update CHANGELOG.md. Include Additions, Fixes, and Changes.

  2. Update project version using either a valid PEP 440 string or a valid bump rule following Semantic Versioning.

    poetry version <version string or bump rule>
  1. Create a release and check the CD Pipeline action to ensure that the project was built and published to PyPI successfully.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vestapol-0.0.20.tar.gz (11.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vestapol-0.0.20-py3-none-any.whl (15.4 kB view details)

Uploaded Python 3

File details

Details for the file vestapol-0.0.20.tar.gz.

File metadata

  • Download URL: vestapol-0.0.20.tar.gz
  • Upload date:
  • Size: 11.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.10.2 Linux/5.15.0-1039-azure

File hashes

Hashes for vestapol-0.0.20.tar.gz
Algorithm Hash digest
SHA256 0884975fca2cd96394f0c9bd5bf1159fdaac704e12a36591573c6d0b64f1ec23
MD5 adaee7721b538e573c482490aaf1e829
BLAKE2b-256 c4ee17308dbc9f6d1e61804f8b61f299da4211ba2601365c9559e3d344f7dc7d

See more details on using hashes here.

File details

Details for the file vestapol-0.0.20-py3-none-any.whl.

File metadata

  • Download URL: vestapol-0.0.20-py3-none-any.whl
  • Upload date:
  • Size: 15.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.5.1 CPython/3.10.2 Linux/5.15.0-1039-azure

File hashes

Hashes for vestapol-0.0.20-py3-none-any.whl
Algorithm Hash digest
SHA256 6b711150c9120bd9d57f9cef43e3235e06ecade84b235b5a9769e7c13fbd2756
MD5 7c106bf3d703d7a8c80b1399eb85f3c9
BLAKE2b-256 2fa7733de0af01db35d968dba5c986978ac18faaa337fd41215cc99b46ae6ab9

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page