Skip to main content

A compiliation of data pipeline scripts in Python

Project description

About

This is a compilation of my data pipeline scripts written in Python.

Conventions

Each pipeline function is an executable python file that accepts flags to modify the specific configurations of the pipeline (i.e. MSSQL DB Name, GCS Bucket Name).

When loading data from a third-party source you can set the temporary destination of the data to the data/ folder. After the data has been successfully ingested remove the file from the data folder.

Folder Structure

  • config/
    • contains any specific configurations that need to be modified within the Docker container
  • data/
    • a temporary landing zone for any data that is ingested from a third-party source
  • functions/
    • contains all pipeline functions
  • functions/utils/
    • contains all reusable code and can be organized futher as either a Source (where data is pulled from), or a Target (where data is placed)

Setup

  1. Create and activate a python virtual environment.
python3 -m venv venv
source /venv/bin/activate
  1. Install python dependencies
pip install -r requirements.txt

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pypeline_functions-0.1.0.tar.gz (12.4 kB view details)

Uploaded Source

Built Distribution

pypeline_functions-0.1.0-py3-none-any.whl (17.9 kB view details)

Uploaded Python 3

File details

Details for the file pypeline_functions-0.1.0.tar.gz.

File metadata

  • Download URL: pypeline_functions-0.1.0.tar.gz
  • Upload date:
  • Size: 12.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.5

File hashes

Hashes for pypeline_functions-0.1.0.tar.gz
Algorithm Hash digest
SHA256 a5e517fd3b50a5ab90ef5097f1d23d07ed29a6dc691893b4668a664fbefff6ba
MD5 e3b2225bacd57037147f10f169a87291
BLAKE2b-256 0b3584818100e3cec12bbc433a24ed01c17822985bfc290783254b165b85acac

See more details on using hashes here.

File details

Details for the file pypeline_functions-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for pypeline_functions-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ccac55717cd242d0d3679d24c68c12c643b5e5b642bb5c4956b6196e5e4d45ce
MD5 4687865806fa96042c161caf9c64cc8b
BLAKE2b-256 e8832f9dafffbf1b8ff27c9255769359b887488b6702e9ad8f269a40ab142695

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page