Data engineering & Data science Pipeline Framework
Project description
PyGyver
PyGyver is a user-friendly python package for data integration and manipulation.
Named after MacGyver, title character in the TV series MacGyver, and Python, the main language used in the repository.
Installation
PyPi
PyGyver is available on PyPi.
pip install pygyver
Setup
Most APIs requires access token files to authentificate and perform tasks such as creating or deleting objects. Those files need to be generated prior to using pygyver
and stored in the environment you are executing your code against. The package make use of environment variables, and some of the below might need be supplied in your environment:
# Access token path
GOOGLE_APPLICATION_CREDENTIALS=path_to_google_access_token.json
FACEBOOK_APPLICATION_CREDENTIALS=path_to_facebook_access_token.json
# Default values
BIGQUERY_PROJECT=your-gcs-project
GCS_PROJECT=your-gcs-project
GCS_BUCKET=your-gcs-bucket
# Optional
PROJECT_ROOT=path_to_where_your_code_lives
Modules
PyGyver is structured around several modules available in the etl
folder. Here is a summary table of those modules:
Module name | Descrition | Documentation |
---|---|---|
dw |
Perform task against the Google Cloud BigQuery API | dw.md |
facebook |
Perform task against the Facebook Marketing API | facebook.md |
gooddata |
Perform task against the GoodData API | - |
gs |
Perform task against the Google Sheet API | - |
lib |
Store utilities used by other modules | - |
pipeline |
Utility to build data pipelines via YAML definition | pipeline.md |
prep |
Data transformation - ML pipelines | - |
storage |
Perform task against the AWS S3 and Google Cloud Storage API | storage.md |
toolkit |
Sets of tools for data manipulation | - |
In order to load BigQueryExecutor
from the dw
module, you can run:
from pygyver.etl.dw import BigQueryExecutor
Contributing
To get started...
Step 1
- 👯 Clone this repo to your local machine using
git@github.com:madedotcom/pygyver.git
Step 2
- HACK AWAY! 🔨🔨🔨
The team follows TDD to develop new features on pygyver
.
Tests can be found in pygyver/tests
.
Step 3
- 🔃 Create a new pull request and request review from team members. Where applicable, a test should be added with the code change.
FAQ
- How to release a new version to PyPi?
- Merge your changes to
master
branch - Create a new release using
https://github.com/madedotcom/pygyver/releases
- Merge your changes to
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pygyver-0.1.1.42.tar.gz
.
File metadata
- Download URL: pygyver-0.1.1.42.tar.gz
- Upload date:
- Size: 63.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 036e837607c2a6952927fdf54bae24b50cd034aaf4b85945a0300e7ba546320d |
|
MD5 | 69fb21ff05c89925e59aadd2dea000d1 |
|
BLAKE2b-256 | e6c0cb05b7eef1faeda980138bf515390099bff1ac59fbc9c8e1a819da00d2f1 |
File details
Details for the file pygyver-0.1.1.42-py3-none-any.whl
.
File metadata
- Download URL: pygyver-0.1.1.42-py3-none-any.whl
- Upload date:
- Size: 52.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 96dcaf42fffe80c01bafedf46d7978d2f8470c44b3075fa79de7333f580f9555 |
|
MD5 | d66b3b674fc942f4fca261a7a51a20ba |
|
BLAKE2b-256 | d9c221a405f78613c98419bfa0adac0d90e91d2d5c7f267d7b2f730cb7e7b096 |