Skip to main content

Easy pipeline to process datas.

Project description

dl_data_pipeline

dl_data_pipeline is a flexible and dynamic Python package for building data processing pipelines. It allows you to create pipelines using input nodes, chain them through various processing functions, and validate the outputs at different stages. The package supports deferred execution for dynamic graph building.

Installation

You can install dl_data_pipeline via pip:

pip install dl_data_pipeline

Or, you can install it from the source:

git clone https://github.com/sacha-renault/DataPipeline.git
cd DataPipeline
pip install -r requirements_dist.txt
pip install python3 setup.py sdist bdist_wheel

Basic example

Here’s a quick example to demonstrate how to set up and run a basic pipeline:

# Import the necessary module
import dl_data_pipeline as dp
from dl_data_pipeline.process_functions import process_2d

# Define the inputs for the pipeline
input_node1 = dp.InputNode(name="1")

# Pass the input through functions to create the graph
x = process_2d.open_rgb_image(input_node1)
out1 = process_2d.padding_2d(x, (256,256), fill_value = 0.0)

# Create the pipeline by specifying the inputs and outputs
pipe = dp.Pipeline(inputs=[input_node1], outputs=[out1])

# Call the pipeline with the required inputs and get the outputs
img = pipe("path/to/image.png")

documentation

For more advanced usage, including adding multiple inputs/outputs, custom validators, and deferred execution, please check the full documentation here.

Contributing

We welcome anyone interested in adding their own preprocess functions to the package! If you have an idea or useful transformation you'd like to share, feel free to contribute by submitting a pull request. Your contributions are always appreciated!

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dl_data_pipeline-0.1.1.tar.gz (30.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dl_data_pipeline-0.1.1-py3-none-any.whl (23.4 kB view details)

Uploaded Python 3

File details

Details for the file dl_data_pipeline-0.1.1.tar.gz.

File metadata

  • Download URL: dl_data_pipeline-0.1.1.tar.gz
  • Upload date:
  • Size: 30.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.6

File hashes

Hashes for dl_data_pipeline-0.1.1.tar.gz
Algorithm Hash digest
SHA256 ccfd9ba9ba07d38839204caeb7a6fb976af8c44e6acd9bb062bc74e9cc64c21c
MD5 433f1ca84d86b8303c705986d21b206f
BLAKE2b-256 5b76c73f57bbcbaae5c95bca75e98ddd18e6e5488e2c1af4281753af175a27f1

See more details on using hashes here.

File details

Details for the file dl_data_pipeline-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for dl_data_pipeline-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 33e5628ef0a8a01de1d13fd518b121f64c366084af3ff430d072d5f88178c7d2
MD5 e5b9fccd836029e3fb74c8e823ddd607
BLAKE2b-256 fe0da632ad9ceedd7858d16f6326a75718ba6cb633ee1cfdbf88411c964c51b8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page