Google Cloud Storage and Azure Storage functions
Project description
Pyplatform is a data analytics platform architeture built around Google BigQuery in a hybrid cloud environment.
the platorm:
- provides fast, scalable and reliable SQL database solution
- abstracts away the infrastuture by builiding data pipelines with serverless compute solutions in python runtime environments
- simplifies development environment by using jupyter lab as the main tool
Installation
pip install pyplatform
Setting up development environment
git clone https://github.com/mhadi813/pyplatform
cd pyplatform
conda env create -f pyplatform_dev.yml
Environment variables
import os
os.environ['GOOGLE_APPLICATION_CREDENTIALS'] = 'path/to/default_service_account.json'
os.environ['DATASET'] = 'default_bigquery_dataset_name'
os.environ['STORAGE_BUCKET'] = 'default_storage_bucket_id'
Usage
common data pipeline architectures:
- Http sources
- On-prem servers
- Bigquery integration with Azure Logic Apps
- Event driven ETL process
- Streaming pipelines
Exploring modules
import pyplatform as pyp
pyp.show_me()
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for pyplatform-datalake-0.0.0.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2db827a7bedff5dfa94a9ca0c2e29216e31674dc3dfb2e4042d4a1675d399b13 |
|
MD5 | 20dcaefc187fe77f9529b274060212c5 |
|
BLAKE2b-256 | 7c3fb99491bfe98271b77189899637aaab03e0b68442113fc0645b0fcbc53a74 |
Close
Hashes for pyplatform_datalake-0.0.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e2bafe967d9b23930a7509f2bef5c812fb42755dfc8cc7bd4b9a632415feb031 |
|
MD5 | 7fbc6e6dd14e7d868b166e81be4fe7ad |
|
BLAKE2b-256 | 618c438a40ee32113edcc18fda9558ed0d7df0e31aec2089326bb06c4c9f2842 |