Extract Transform Load (ETL) toolkit for python
Project description
- Toolkit for doing data integration related work, using connected
transformations. Unlike java based tools like talend or pentaho data-integration, this is a DIY framework, and if you’re looking for a WYSIWIG ETL engine, you should probably go back to the previously cited ones.
Create a harness.
>>> from rdc.etl.harness.threaded import ThreadedHarness as Harness >>> harness = Harness()
Create some data transformations.
>>> from rdc.etl.transform.extract import Extract >>> extract = Extract(stream_data=({'foo': 'bar'}, {'foo': 'baz'}))>>> from rdc.etl.transform.simple import SimpleTransform >>> transform = SimpleTransform() >>> transform.add('foo').filter('upper')>>> from rdc.etl.transform.util import Log >>> load = Log()
Tie everything together.
>>> harness.add_chain(extract, transform, load)
Run.
>>> harness()
This is a work in progress, the 1.0 API may change until 1.0 is released.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
rdc.etl-1.0.0a2.tar.gz
(26.0 kB
view details)
File details
Details for the file rdc.etl-1.0.0a2.tar.gz.
File metadata
- Download URL: rdc.etl-1.0.0a2.tar.gz
- Upload date:
- Size: 26.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fb7e79a64b771cea9a003c968ca5b90ea16c2529f1f20efe8a4c8e878bd3183b
|
|
| MD5 |
e8e774cfcb7717bb91165a8844984ea9
|
|
| BLAKE2b-256 |
ffd6a56ad7b1cf2704152d46e6d3448dda70e6e07c953b76a322c48fd22740e8
|