Easy pipeline to process datas.
Project description
dl_data_pipeline
dl_data_pipeline is a flexible and dynamic Python package for building data processing pipelines. It allows you to create pipelines using input nodes, chain them through various processing functions, and validate the outputs at different stages. The package supports deferred execution for dynamic graph building.
Installation
You can install dl_data_pipeline via pip:
pip install dl_data_pipeline
Or, you can install it from the source:
git clone https://github.com/sacha-renault/DataPipeline.git
cd DataPipeline
pip install -r requirements_dist.txt
pip install python3 setup.py sdist bdist_wheel
Basic example
Here’s a quick example to demonstrate how to set up and run a basic pipeline:
# Import the necessary module
import dl_data_pipeline as dp
from dl_data_pipeline.process_functions import process_2d
# Define the inputs for the pipeline
input_node1 = dp.InputNode(name="1")
# Pass the input through functions to create the graph
x = process_2d.open_rgb_image(input_node1)
out1 = process_2d.padding_2d(x, (256,256), fill_value = 0.0)
# Create the pipeline by specifying the inputs and outputs
pipe = dp.Pipeline(inputs=[input_node1], outputs=[out1])
# Call the pipeline with the required inputs and get the outputs
img = pipe("path/to/image.png")
documentation
For more advanced usage, including adding multiple inputs/outputs, custom validators, and deferred execution, please check the full documentation here.
Contributing
We welcome anyone interested in adding their own preprocess functions to the package! If you have an idea or useful transformation you'd like to share, feel free to contribute by submitting a pull request. Your contributions are always appreciated!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file dl_data_pipeline-0.1.1.tar.gz.
File metadata
- Download URL: dl_data_pipeline-0.1.1.tar.gz
- Upload date:
- Size: 30.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ccfd9ba9ba07d38839204caeb7a6fb976af8c44e6acd9bb062bc74e9cc64c21c
|
|
| MD5 |
433f1ca84d86b8303c705986d21b206f
|
|
| BLAKE2b-256 |
5b76c73f57bbcbaae5c95bca75e98ddd18e6e5488e2c1af4281753af175a27f1
|
File details
Details for the file dl_data_pipeline-0.1.1-py3-none-any.whl.
File metadata
- Download URL: dl_data_pipeline-0.1.1-py3-none-any.whl
- Upload date:
- Size: 23.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/5.1.1 CPython/3.12.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
33e5628ef0a8a01de1d13fd518b121f64c366084af3ff430d072d5f88178c7d2
|
|
| MD5 |
e5b9fccd836029e3fb74c8e823ddd607
|
|
| BLAKE2b-256 |
fe0da632ad9ceedd7858d16f6326a75718ba6cb633ee1cfdbf88411c964c51b8
|