Skip to main content

Extract Transform Load (ETL) toolkit for python

Project description

Toolkit for doing data integration related work, using connected

transformations. Unlike java based tools like talend or pentaho data-integration, this is a DIY framework, and if you’re looking for a WYSIWIG ETL engine, you should probably go back to the previously cited ones.

Create a harness.

>>> from rdc.etl.harness.threaded import ThreadedHarness as Harness
>>> harness = Harness()

Create some data transformations.

>>> from rdc.etl.transform.extract import Extract
>>> extract = Extract(stream_data=({'foo': 'bar'}, {'foo': 'baz'}))
>>> from rdc.etl.transform.simple import SimpleTransform
>>> transform = SimpleTransform()
>>> transform.add('foo').filter('upper')
>>> from rdc.etl.transform.util import Log
>>> load = Log()

Tie everything together.

>>> harness.add_chain(extract, transform, load)

Run.

>>> harness()

This is a work in progress, the 1.0 API may change until 1.0 is released.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rdc.etl-1.0.0a2.tar.gz (26.0 kB view details)

Uploaded Source

File details

Details for the file rdc.etl-1.0.0a2.tar.gz.

File metadata

  • Download URL: rdc.etl-1.0.0a2.tar.gz
  • Upload date:
  • Size: 26.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for rdc.etl-1.0.0a2.tar.gz
Algorithm Hash digest
SHA256 fb7e79a64b771cea9a003c968ca5b90ea16c2529f1f20efe8a4c8e878bd3183b
MD5 e8e774cfcb7717bb91165a8844984ea9
BLAKE2b-256 ffd6a56ad7b1cf2704152d46e6d3448dda70e6e07c953b76a322c48fd22740e8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page