Skip to main content

AUTO1 ETL Pipeline

Project description

Auto1 ETL

Auto1 ETL is simple etl Pipeline, which extracts the data from given source dataFile and returns the transformed data.

Directory Structure

│   .gitignore
│   challenge_me.txt
│   README.md
│   sample.py
│   setup.py
│
├───.vscode
│       launch.json
│
└───auto1
    │   etl.py
    │   __init__.py
    │
    ├───helper
    │   │   helper.py
  • auto1/etl.py: Etl script
  • helper/helper.py: helper script with mapper and utility functions required for etl process.
  • challenge_me.txt: input dataFile to perform transformation
  • sample.py: sampy script to import etl class as module

Instructions

  • install the package as pip install auto1

  • import auto1.Etl in your python file

  • Initialize the Etl constructor with source dataFile path

  • Invoke the methods in order as below

    from auto1.etl import Etl
    etl = Etl('challenge_me.txt')
    etl.load(etl.dataFile)
    output = etl.transform(etl.dataFile)
    print(output)
    
  • load method creates a temporary dataFile, which is prepared for transformation.

  • transformation methods returns the transformed records as nested list and deletes the temporary dataFile created by load method.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

auto1-0.0.1.tar.gz (3.7 kB view hashes)

Uploaded Source

Built Distribution

auto1-0.0.1-py3-none-any.whl (4.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page