Skip to main content

Extract Transform Load (ETL) toolkit for python

Project description

Toolkit for doing data integration related work, using connected

transformations. Unlike java based tools like talend or pentaho data-integration, this is a DIY framework, and if you’re looking for a WYSIWIG ETL engine, you should probably go back to the previously cited ones.

Not so relevant example:

>>> from rdc.etl.harness.threaded import ThreadedHarness as Harness
>>> harness = Harness()
>>> from rdc.etl.transform.extract import Extract
>>> extract = Extract(stream_data=({'foo': 'bar'}, {'foo': 'baz'}))
>>> from rdc.etl.transform.simple import SimpleTransform
>>> transform = SimpleTransform()
>>> transform.add('foo').filter('upper')
>>> from rdc.etl.transform.util import Log
>>> load = Log()
>>> harness.chain_add(extract, transform, load)
>>> harness()

This is a work in progress, although it it used for a few different production systems, it may or may not fit your need, and you should expect to have to dive into the code for now, as neither documentation or tests are there to help.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rdc.etl-1.0.0a1.tar.gz (23.0 kB view details)

Uploaded Source

File details

Details for the file rdc.etl-1.0.0a1.tar.gz.

File metadata

  • Download URL: rdc.etl-1.0.0a1.tar.gz
  • Upload date:
  • Size: 23.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for rdc.etl-1.0.0a1.tar.gz
Algorithm Hash digest
SHA256 a441a157d22a5912d67f459b6812a00ee73aa61a55149a15189f3722e207d52a
MD5 714ded5d4ad1b96812888e9872d3d297
BLAKE2b-256 f3e68e8519f761ed51f967fbcfe819f8bba38f9b620bee9b6500a068bd5b7fa0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page