Skip to main content

Pyplatform is a data analytics platform built around Google BigQuery. This package provides wrapper functions for interacting with cloud services and creating data pipelines using Google Cloud, Microsoft Azure, O365, and Tableau Server as source and destination.

Project description

Pyplatform is a data analytics platform built around Google BigQuery. This package provides wrapper functions for interacting with cloud services and creating data pipelines using Google Cloud, Microsoft Azure, O365, and Tableau Server as source and destination.

The platorm architecture:

  • enables fast and scalable SQL datawarehousing service
  • abstracts away the infrastuture by builiding data pipelines with serverless compute solutions in python runtime environments
  • simplifies development environment by using jupyter lab as the main tool

Installation

pip install pyplatform

Setting up development environment

git clone https://github.com/mhadi813/pyplatform
cd pyplatform
conda env create -f pyplatform_dev.yml

Authentication and environment variables

Credential file path can be set a environment varible in conda env activation script or bash profiles. Please reference conda documentation for enviroment variables and environment activation script

import os
# if env activation script not created: update path to credential files
# see ``secrets`` folder for credential tamplates
# see functions ``docstring`` for authentication methods when calling a function
os.environ['GOOGLE_APPLICATION_CREDENTIALS'] = './secrets/dummy_gcp_service_account_credentials.json'
os.environ["AZURE_CREDENTIALS"]= './secrets/dummy_ms_azure_credentials.json' 
os.environ['TABLEAU_SERVER_CREDENTIALS']='./secrets/dummy_tableau_server_credentials.json'
os.environ['PIVOTAL_CREDENTIALS']='./secrets/dummy_pivotal_credentials.json'

os.environ['DATASET'] = 'default_bigquery_dataset_name'
os.environ['STORAGE_BUCKET'] = 'default_storage_bucket_id'

Usage

common usage patterns:

- Http sources

- On-prem sources with VPN requirement

- Bigquery integration with Azure Logic Apps

- Event driven ETL process

- Streaming pipelines

Exploring the modules

from pyplatform.common import *
show_me()

import pyplatform as pyp
show_me(pyp)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyplatform-2020.8.1.tar.gz (445.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyplatform-2020.8.1-py3-none-any.whl (14.7 kB view details)

Uploaded Python 3

File details

Details for the file pyplatform-2020.8.1.tar.gz.

File metadata

  • Download URL: pyplatform-2020.8.1.tar.gz
  • Upload date:
  • Size: 445.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.7

File hashes

Hashes for pyplatform-2020.8.1.tar.gz
Algorithm Hash digest
SHA256 133ce0ed24d9197e94121bd610e0452064968bc1c11f7b2640db967e31699479
MD5 aa942c7aacc22e696c930e838773062c
BLAKE2b-256 d8f8b8aac2023e7fab7286b3717778ef2ab681c8faa140c846b0fe8e47de16f7

See more details on using hashes here.

File details

Details for the file pyplatform-2020.8.1-py3-none-any.whl.

File metadata

  • Download URL: pyplatform-2020.8.1-py3-none-any.whl
  • Upload date:
  • Size: 14.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.6.0 requests-toolbelt/0.9.1 tqdm/4.48.2 CPython/3.7.7

File hashes

Hashes for pyplatform-2020.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 e8152a8a3a146b050039aae9bfb50aa2caad7abc3de1d242912b02c716014436
MD5 8dfa4978f0fc1dbc82c213ac4eb8fc63
BLAKE2b-256 a9609179ead0fb552f9648d3e80527798049171fe247de06561f11ad49e7f045

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page