Skip to main content

Pyplatform provides wrapper functions for using Google BigQuery as datawarehouse and analytics platform.The package enables creating data pipelines involving Google Cloud, Microsoft Azure, O365, and Tableau Server as source and destination.

Project description

Pyplatform is a data analytics platform built around Google BigQuery. This package provides wrapper functions for interacting with cloud services and creating data pipelines involving Google Cloud, Microsoft Azure, O365, and Tableau Server as source and destination.

the platorm architecture:

  • enables fast and scalable SQL datawarehousing service
  • abstracts away the infrastuture by builiding data pipelines with serverless compute solutions in python runtime environments
  • simplifies development environment by using jupyter lab as the main tool

Installation

pip install pyplatform

Setting up development environment

git clone https://github.com/mhadi813/pyplatform
cd pyplatform
conda env create -f pyplatform_dev.yml

Authentication and environment variables

Credential file path can be set a environment varible in conda activation script. Please refer to conda documentation for enviroment variables

import os
#TODO: update path to credential files
# see ``secrets`` folder for credential tamplates
# see functions ``doc string`` for authentication methods at run-time
os.environ['GOOGLE_APPLICATION_CREDENTIALS'] = './secrets/dummy_gcp_service_account_credentials.json'
os.environ["AZURE_CREDENTIALS"]= './secrets/dummy_ms_azure_credentials.json' 
os.environ['TABLEAU_SERVER_CREDENTIALS']='./secrets/dummy_tableau_server_credentials.json'
os.environ['PIVOTAL_CREDENTIALS']='./secrets/dummy_pivotal_credentials.json'

os.environ['DATASET'] = 'default_bigquery_dataset_name'
os.environ['STORAGE_BUCKET'] = 'default_storage_bucket_id'

Usage

common data pipeline architectures:

- Http sources

- On-prem servers

- Bigquery integration with Azure Logic Apps

- Event driven ETL process

- Streaming pipelines

Exploring modules

from pyplatform.common import *
show_me()

import pyplatform as pyp
show_me(pyp)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyplatform-2020.7.3.tar.gz (445.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyplatform-2020.7.3-py3-none-any.whl (14.8 kB view details)

Uploaded Python 3

File details

Details for the file pyplatform-2020.7.3.tar.gz.

File metadata

  • Download URL: pyplatform-2020.7.3.tar.gz
  • Upload date:
  • Size: 445.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.7

File hashes

Hashes for pyplatform-2020.7.3.tar.gz
Algorithm Hash digest
SHA256 474dd71bcc3c1c4e36925c9251f0791687e200bec4c1263bb0769db73f2ab6ae
MD5 eb05f51791465ce410815b2baa446781
BLAKE2b-256 3ccdc8f0682d977560e6e7253aa01d34a5f6c7e8d500249c969ff1d59094111d

See more details on using hashes here.

File details

Details for the file pyplatform-2020.7.3-py3-none-any.whl.

File metadata

  • Download URL: pyplatform-2020.7.3-py3-none-any.whl
  • Upload date:
  • Size: 14.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.7

File hashes

Hashes for pyplatform-2020.7.3-py3-none-any.whl
Algorithm Hash digest
SHA256 524e7d61caaa0f9fd7fe5951b2d8806db1eda2dd50743b90d766d6ec29740683
MD5 8373675a884a6411acfd592303098871
BLAKE2b-256 ca95b8799c78e12c712cf31c91cad68bb06e093109460247026610c9f3d38938

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page