Skip to main content

Easy pipeline to process datas.

Project description

dl_data_pipeline

dl_data_pipeline is a flexible and dynamic Python package for building data processing pipelines. It allows you to create pipelines using input nodes, chain them through various processing functions, and validate the outputs at different stages. The package supports deferred execution for dynamic graph building.

Installation

You can install dl_data_pipeline via pip:

pip install dl_data_pipeline

Or, you can install it from the source:

git clone https://github.com/sacha-renault/DataPipeline.git
cd DataPipeline
pip install -r requirements_dist.txt
pip install python3 setup.py sdist bdist_wheel

Basic example

Here’s a quick example to demonstrate how to set up and run a basic pipeline:

# Import the necessary module
import dl_data_pipeline as dp
from dl_data_pipeline.process_functions import process_2d

# Define the inputs for the pipeline
input_node1 = dp.InputNode(name="1")

# Pass the input through functions to create the graph
x = process_2d.open_rgb_image(input_node1)
out1 = process_2d.padding_2d(x, (256,256), fill_value = 0.0)

# Create the pipeline by specifying the inputs and outputs
pipe = dp.Pipeline(inputs=[input_node1], outputs=[out1])

# Call the pipeline with the required inputs and get the outputs
img = pipe("path/to/image.png")

documentation

For more advanced usage, including adding multiple inputs/outputs, custom validators, and deferred execution, please check the full documentation here.

Contributing

We welcome anyone interested in adding their own preprocess functions to the package! If you have an idea or useful transformation you'd like to share, feel free to contribute by submitting a pull request. Your contributions are always appreciated!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dl_data_pipeline-0.1.0.tar.gz (28.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dl_data_pipeline-0.1.0-py3-none-any.whl (21.6 kB view details)

Uploaded Python 3

File details

Details for the file dl_data_pipeline-0.1.0.tar.gz.

File metadata

  • Download URL: dl_data_pipeline-0.1.0.tar.gz
  • Upload date:
  • Size: 28.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for dl_data_pipeline-0.1.0.tar.gz
Algorithm Hash digest
SHA256 e263fa00a404d2632d2a3a6c3812b60dfda6c820a49b56f157a7f5a795dd9a7d
MD5 e3ac11905406d7a5a07a44465138cf69
BLAKE2b-256 0bf2e635a4927d1859efcfecdbeae419e3c15ceca59b10b6afa7a8b501c3f92a

See more details on using hashes here.

File details

Details for the file dl_data_pipeline-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for dl_data_pipeline-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 c8e8f30f17a70f028f3e7ae39559192571feae85b46f61efa17d9c7578ca159a
MD5 18748f2ba2c3ba49999aa2e413d1da19
BLAKE2b-256 279f1d1f308e6020df1d35887b0db3f6025bb5cb232764206902f86bb510c5fd

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page