Skip to main content

lightweight but versatile python-framework for multi-stage information processing

Project description

Tests

data-plumber

data-plumber is a lightweight but versatile python-framework for multi-stage information processing. It allows to construct processing pipelines from both atomic building blocks and via recombination of existing pipelines. Forks enable more complex (i.e. non-linear) orders of execution. Pipelines can also be collected into arrays that can be executed at once with the same input data.

Minimal usage example

Consider a scenario where the contents of a dictionary have to be validated and a suitable error message has to be generated. Specifically, a valid input- dictionary is expected to have a key "data" with the respective value being a list of integer numbers. A suitable pipeline might look like this

>>> from data_plumber import Stage, Pipeline, Previous
>>> pipeline = Pipeline(
        Stage(
            primer=lambda **kwargs: "data" in kwargs,
            status=lambda primer, **kwargs: 0 if primer else 1,
            message=lambda primer, **kwargs: "" if primer else "missing key"
        ),
        Stage(
            requires={Previous: 0},
            primer=lambda data, **kwargs: isinstance(data, list),
            status=lambda primer, **kwargs: 0 if primer else 1,
            message=lambda primer, **kwargs: "" if primer else "bad type"
        ),
        Stage(
            requires={Previous: 0},
            primer=lambda data, **kwargs: all(isinstance(i, int) for i in data),
            status=lambda primer, **kwargs: 0 if primer else 1,
            message=lambda primer, **kwargs: "validation success" if primer else "bad type in data"
        ),
        exit_on_status=1
    )
>>> pipeline.run(**{}).stages
[('missing key', 1)]
>>> pipeline.run(**{"data": 1}).stages
[('', 0), ('bad type', 1)]
>>> pipeline.run(**{"data": [1, "2", 3]}).stages
[('', 0), ('', 0), ('bad type in data', 1)]
>>> pipeline.run(**{"data": [1, 2, 3]}).stages
[('', 0), ('', 0), ('validation success', 0)]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

data-plumber-1.4.0.tar.gz (10.5 kB view details)

Uploaded Source

Built Distribution

data_plumber-1.4.0-py3-none-any.whl (12.9 kB view details)

Uploaded Python 3

File details

Details for the file data-plumber-1.4.0.tar.gz.

File metadata

  • Download URL: data-plumber-1.4.0.tar.gz
  • Upload date:
  • Size: 10.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.12

File hashes

Hashes for data-plumber-1.4.0.tar.gz
Algorithm Hash digest
SHA256 5e378fcdb5c9eb95debe549a859892af3146d85d0859d58e16ac1047e2a35a10
MD5 1fd29db7c867073e696460d7b41bcc5d
BLAKE2b-256 d507e7d2204551ad66d0f09516438ee081897e932a287b4d4615030863f4b951

See more details on using hashes here.

File details

Details for the file data_plumber-1.4.0-py3-none-any.whl.

File metadata

  • Download URL: data_plumber-1.4.0-py3-none-any.whl
  • Upload date:
  • Size: 12.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.12

File hashes

Hashes for data_plumber-1.4.0-py3-none-any.whl
Algorithm Hash digest
SHA256 dc7be4428f805af5c9bb4466d57baf8ff1a7ddc0f51dbb6aa22fa876b929c47c
MD5 0c38fedd9c4c64333e19048e64d2aedc
BLAKE2b-256 2b1f589ec22b9eaf096cbf4bd54dfba3810234a87c478e1161f059cb25155e34

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page