Skip to main content

ETL for Python

Project description

ETL using python

Python package for extracting data from a source, transforming it and loading it to a destination, with validation in between.

The provided ETL pipeline provides useful functionality on top of the usual operations:

  • Extract: Extract data from multiples sources, in parallel (using threads).
  • Validate: Validate the extracted data, to make sure it matches what will be required by the transform step, using pandera schemas. This provide early fail if there is any unexpected change in the sources.
  • Transform: Define the logic for transformation of the data, making it reusable, and allowing multiple data frames as input and multiple data frames as output.
  • Validate again: Validate the transformed data, to make sure it matches your expectation, and what the destination will require.
  • Load: Load multiple data, each to one or more destination, and load diferent data to diferent destinations in parallel (using threads).

Installation

The package is available at PyPI, so you can install it using pip:

pip install extralo

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

extralo-0.12.3.tar.gz (11.9 kB view details)

Uploaded Source

Built Distribution

extralo-0.12.3-py3-none-any.whl (13.6 kB view details)

Uploaded Python 3

File details

Details for the file extralo-0.12.3.tar.gz.

File metadata

  • Download URL: extralo-0.12.3.tar.gz
  • Upload date:
  • Size: 11.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.0 CPython/3.12.5

File hashes

Hashes for extralo-0.12.3.tar.gz
Algorithm Hash digest
SHA256 7ae697b9401409874fc30b2b7eb36945140e90f45e3dd4ac04e78c6404d64cc0
MD5 bfbb2330820500351d613cc0dfc23c30
BLAKE2b-256 5147d8c5c471bd5659b42b741f482e3109ee9ae4ab0169dc1e9e95a1bdd03074

See more details on using hashes here.

File details

Details for the file extralo-0.12.3-py3-none-any.whl.

File metadata

  • Download URL: extralo-0.12.3-py3-none-any.whl
  • Upload date:
  • Size: 13.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.0 CPython/3.12.5

File hashes

Hashes for extralo-0.12.3-py3-none-any.whl
Algorithm Hash digest
SHA256 db578f1bb929fc36eeede7f7040a98ba0cf303956be6c8d893d8e6186452e097
MD5 5fa3c6bb186d191cdc0a1b777aa4a4cb
BLAKE2b-256 98998feb5619d247891659af12853a3dbc9c940f001138ca7ac79d1889e0c7c1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page