GCP utility will give additional custom features over gcs
Project description
# GCP Utility tool
We can backfill any kind of bigquery sql to specific destination table with powerful python jinja templating feature. By default it can support date templates in your query. It will automatically convert to date template to date value in your query while executing.
Following are the available date templates you can use it in your query.
```bash
{{ds}} - YYYY-DD-MM date string
{{ds_nodash}} - YYYYDDMM date string
{{ds_yesterday}} - YYYY-DD-MM yesterday date string
{{ds_yesterday_nodash}} - YYYYDDMM yesterday date string
{{ds_tomorrow}} - YYYY-DD-MM tomorrow date string
{{ds_tomorrow_nodash}} - YYYYDDMM tomorrow date string
```
## Getting Started
### Prerequisites
What things you need to install the software and how to install them
```
python 2.7
google-cloud==0.32.0
Jinja2==2.10
```
### Installing
Use virtualenv to Install the dependancy packages
```
git clone https://gitlab.com/manojnuk50/GCPUtility
cd GCPUtility
virtualenv venv
activate venv/bin/activate
pip install -e .
```
-h command will return the help content which contains all the available options
```
bq_client -h
```
End with an example of getting some data out of the system or using it for a little demo
## Available option
```bash
-h, --help show this help message and exit
-v, --version show program's version number and exit
-q QUERY, --query QUERY
provide valid bigquery sql
-f, --no-flattern Flattern results
-p PROJECT, --project_id PROJECT
provide valid project id
-ssql SSQL, --standard-sql SSQL
Mention if using Standard sql
-d DESTINATION_TABLE, --destination-table DESTINATION_TABLE
<projectname>:<datasetid>.<tableid> provide valid
destination project-id
-w WRITE_DESPOSITION, --write-desposition WRITE_DESPOSITION
Write disposition value
-qf QUERY_FILE, --query-file QUERY_FILE
provide bigquery sql filepath
-t TEMPLATE, --template TEMPLATE
provide template values
-tf TEMPLATE_FILE, --template-file TEMPLATE_FILE
provide template file path
-s START_DATE, --start-date START_DATE
Provide valid startdate (YYYY-MM-DD)
-e END_DATE, --end-date END_DATE
Provide valid end date (YYYY-MM-DD)
-sf SERVICE_ACCOUNT_PATH, --service-account-file-path SERVICE_ACCOUNT_PATH
provide valid path of service account json file
```
### passing raw query
you can pass the raw query to script that need to be execute
```
bq_client -q "select * from [sample_proj:datasetid.tableid]" -p destination_projectid -d dest_datasetid.tableid
```
### Using sql file
if your query is quite long, you can save it to text file and pass the path of it
```
bq_client --query-file "/path/sample.sql" -p destination_projectid -d dest_datasetid.tableid```
```
### Passing date parameter
In default it will substitute current date into template fields. but if you want to change the date value you can pass manually using start date parameter
```
bq_client --query-file sample.sql -p projectid -d dest_datasetid.tableid -s 2018-02-02
```
### Backfill
Pass start-date and end-date to script, it will backfill the data.
```
bq_client --query-file sample.sql -p projectid -d dest_datasetid.tableid -s 2018-02-02 -e 2018-02-10
```
We can backfill any kind of bigquery sql to specific destination table with powerful python jinja templating feature. By default it can support date templates in your query. It will automatically convert to date template to date value in your query while executing.
Following are the available date templates you can use it in your query.
```bash
{{ds}} - YYYY-DD-MM date string
{{ds_nodash}} - YYYYDDMM date string
{{ds_yesterday}} - YYYY-DD-MM yesterday date string
{{ds_yesterday_nodash}} - YYYYDDMM yesterday date string
{{ds_tomorrow}} - YYYY-DD-MM tomorrow date string
{{ds_tomorrow_nodash}} - YYYYDDMM tomorrow date string
```
## Getting Started
### Prerequisites
What things you need to install the software and how to install them
```
python 2.7
google-cloud==0.32.0
Jinja2==2.10
```
### Installing
Use virtualenv to Install the dependancy packages
```
git clone https://gitlab.com/manojnuk50/GCPUtility
cd GCPUtility
virtualenv venv
activate venv/bin/activate
pip install -e .
```
-h command will return the help content which contains all the available options
```
bq_client -h
```
End with an example of getting some data out of the system or using it for a little demo
## Available option
```bash
-h, --help show this help message and exit
-v, --version show program's version number and exit
-q QUERY, --query QUERY
provide valid bigquery sql
-f, --no-flattern Flattern results
-p PROJECT, --project_id PROJECT
provide valid project id
-ssql SSQL, --standard-sql SSQL
Mention if using Standard sql
-d DESTINATION_TABLE, --destination-table DESTINATION_TABLE
<projectname>:<datasetid>.<tableid> provide valid
destination project-id
-w WRITE_DESPOSITION, --write-desposition WRITE_DESPOSITION
Write disposition value
-qf QUERY_FILE, --query-file QUERY_FILE
provide bigquery sql filepath
-t TEMPLATE, --template TEMPLATE
provide template values
-tf TEMPLATE_FILE, --template-file TEMPLATE_FILE
provide template file path
-s START_DATE, --start-date START_DATE
Provide valid startdate (YYYY-MM-DD)
-e END_DATE, --end-date END_DATE
Provide valid end date (YYYY-MM-DD)
-sf SERVICE_ACCOUNT_PATH, --service-account-file-path SERVICE_ACCOUNT_PATH
provide valid path of service account json file
```
### passing raw query
you can pass the raw query to script that need to be execute
```
bq_client -q "select * from [sample_proj:datasetid.tableid]" -p destination_projectid -d dest_datasetid.tableid
```
### Using sql file
if your query is quite long, you can save it to text file and pass the path of it
```
bq_client --query-file "/path/sample.sql" -p destination_projectid -d dest_datasetid.tableid```
```
### Passing date parameter
In default it will substitute current date into template fields. but if you want to change the date value you can pass manually using start date parameter
```
bq_client --query-file sample.sql -p projectid -d dest_datasetid.tableid -s 2018-02-02
```
### Backfill
Pass start-date and end-date to script, it will backfill the data.
```
bq_client --query-file sample.sql -p projectid -d dest_datasetid.tableid -s 2018-02-02 -e 2018-02-10
```
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
Close
Hashes for gcp_utility-1.1.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | aa0d52b82aa498cadd5abe6cea4d832fd11c8cdcca8e951cc15fc706a7e52e58 |
|
MD5 | 0852cbb682154228560a02e0646b79c4 |
|
BLAKE2b-256 | bdc3d556fdd04be2b05570c4eab66695ddc539cae453e35978c61ca9e09d9d07 |