A lightweight pipeline orchestration library inspired by Luigi

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

ruivieira

These details have not been verified by PyPI

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

nolead

A lightweight pipeline orchestration library.

Features

Simple task annotation with @Task() decorator
Automatic dependency resolution with uses() function
Clean task completion with done() function
Single entry point to run entire pipelines with run_task()
Parameter passing between tasks, similar to Luigi
Task dependency visualization with parameter information

Installation

pip install .

Quick Example

from nolead import Task, run_task, uses, done

@Task()
def fetch_data():
    print("Fetching data...")
    return [1, 2, 3, 4, 5]

@Task()
def process_data():
    print("Processing data...")
    # Get result from the dependent task
    data = uses(fetch_data)
    # Process the data
    processed_data = [x * 2 for x in data]
    return done(processed_data)

@Task()
def save_results():
    print("Saving results...")
    # Get result from the dependent task
    processed_data = uses(process_data)
    # Save the results
    print(f"Results saved: {processed_data}")
    return done(True)

if __name__ == "__main__":
    # Execute the pipeline by running the final task
    result = run_task(save_results)
    print(f"Pipeline completed with result: {result}")

Advanced Usage

Named Tasks

You can use named tasks:

@Task(name="fetch_users")
def fetch_users():
    # ... implementation ...
    return users

# Later, refer to the task by name
users = uses("fetch_users")

Parameter Passing

Tasks can accept parameters, which can be passed in several ways:

Default parameters in function definitions:

@Task(name="process_data")
def process_data(batch_size=100, validate=False):
    # Use parameters in task logic
    print(f"Processing with batch size {batch_size}")
    # ...

Pass parameters when calling uses():

@Task(name="generate_report")
def generate_report():
    # Pass parameters to upstream task
    data = uses("process_data", batch_size=200, validate=True)
    # ...

Pass parameters when running a task directly:

# Run with custom parameters
result = run_task("process_data", batch_size=500, validate=True)

Parameters are passed down to the task function and cached based on their values. Each unique combination of a task and parameters is cached separately.

Check out the examples/parameter_example.py file for more detailed examples of parameter passing.

Pipeline Visualization

NoLead supports visualization of task dependencies, including the parameters passed between tasks:

from nolead import generate_dependency_graph

# Generate a DOT file for the entire pipeline
generate_dependency_graph("pipeline.dot", output_format="dot")

# Generate a text representation of the dependencies
generate_dependency_graph("pipeline.txt", output_format="text")

Graphical Visualization

To render the DOT file as an image, you can use Graphviz:

dot -Tpng pipeline.dot -o pipeline.png

For example, this pipeline with parameters:

@Task(name="fetch_data")
def fetch_data(source="database", limit=10):
    # ... implementation ...
    return data

@Task(name="process_data")
def process_data(transformation="double"):
    # Pass specific parameters to the upstream task
    data = uses("fetch_data", source="api", limit=5)
    # ... implementation ...
    return done(result)

@Task(name="analyze_data")
def analyze_data(method="sum"):
    # Pass specific parameters to the upstream task
    data = uses("process_data", transformation="square")
    # ... implementation ...
    return done(result)

@Task(name="format_results")
def format_results(format_type="text"):
    # Pass specific parameters to the upstream task
    value = uses("analyze_data", method="avg")
    # ... implementation ...
    return done(result)

When visualized with Graphviz, shows the parameters passed between tasks:

Pipeline Visualization Example

Parallel Task Execution

NoLead supports running multiple tasks in parallel and visualizing parallel task groups in the pipeline graph.

from nolead import Task, uses, parallel, run_task

@Task()
def fetch_data():
    # ... implementation ...
    return data

@Task()
def process_data():
    raw_data = uses(fetch_data)
    # ... implementation ...
    return processed_data

@Task()
def calculate_sum():
    data = uses(process_data)
    # ... implementation ...
    return {"sum": sum(data)}

@Task()
def calculate_average():
    data = uses(process_data)
    # ... implementation ...
    return {"average": sum(data) / len(data)}

@Task()
def generate_report():
    # Run calculations in parallel
    results = parallel([calculate_sum, calculate_average])

    # Access results from each parallel task
    sum_result = results["calculate_sum"]["sum"]
    avg_result = results["calculate_average"]["average"]

    return {"summary": f"Sum: {sum_result}, Average: {avg_result}"}

Parallel Task Visualization

Parallel tasks are visualized with special styling in the dependency graph:

In DOT format (Graphviz):
- Parallel tasks are grouped in a subgraph with a dashed border
- Edges connecting to parallel task groups have a dashed, bold style
In text format:
- Parallel tasks are listed as a separate section
- Tasks that are part of parallel groups are marked with [parallel]

Example of a pipeline with parallel tasks visualized with Graphviz:

Parallel Tasks Visualization

The visualization clearly shows which tasks are executed in parallel (calculate_sum and calculate_average), making the pipeline structure easier to understand.

Parallel Task Result Format

When using parallel(), the results are returned as a dictionary where:

Keys are the task names
Values are the return values from each task

results = parallel([task1, task2])
# Results will be:
# {
#   "task1": {"result": "data from task1"},
#   "task2": {"result": "data from task2"}
# }

For more detailed information on parallel task execution, see the parallel tasks documentation and our detailed example.

Text-Based Visualization

Alternatively, you can use the text-based visualization:

Pipeline Dependency Graph
========================

Parallel Task Groups:
  Group 1: calculate_average, calculate_sum

Task: calculate_average
  Dependencies:
    - process_data

Task: calculate_sum
  Dependencies:
    - process_data

Task: fetch_data
  Dependencies: None

Task: generate_report
  Dependencies:
    - calculate_average [parallel]
    - calculate_sum [parallel]

Task: process_data
  Dependencies:
    - fetch_data

Development

This project uses several development tools to ensure code quality:

Ruff: For linting and code formatting
Mypy: For static type checking
Pytest: For unit testing

Development Setup

# Install development dependencies
make deps

Running Tests and Checks

# Run all checks (lint, type check, tests)
make all

# Run individual checks
make lint    # Run linting
make check   # Run type checking
make test    # Run unit tests

# Clean up project
make clean

License

Apache 2.0 License

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

ruivieira

These details have not been verified by PyPI

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.2.0

May 22, 2025

0.1.0

May 20, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nolead-0.2.0.tar.gz (19.2 kB view details)

Uploaded May 22, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

nolead-0.2.0-py3-none-any.whl (18.8 kB view details)

Uploaded May 22, 2025 Python 3

File details

Details for the file nolead-0.2.0.tar.gz.

File metadata

Download URL: nolead-0.2.0.tar.gz
Upload date: May 22, 2025
Size: 19.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for nolead-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`6abc83a940458d40845508c331ca1d50b3ef17d0f8a914fe0fdd04db9fa55a1f`
MD5	`373824070283b9a0c74542a663017f45`
BLAKE2b-256	`2e05865faef670488935b6f9ef672a1890c7d8a00286efd1c8019b0a964409b8`

See more details on using hashes here.

Provenance

The following attestation bundles were made for nolead-0.2.0.tar.gz:

Publisher: publish-to-pypi.yml on ruivieira/nolead

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: nolead-0.2.0.tar.gz
- Subject digest: 6abc83a940458d40845508c331ca1d50b3ef17d0f8a914fe0fdd04db9fa55a1f
- Sigstore transparency entry: 217640108
- Sigstore integration time: May 22, 2025
Source repository:
- Permalink: ruivieira/nolead@cf785ef07ce5e2052bdc641552f24d5b9d8f0430
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/ruivieira
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yml@cf785ef07ce5e2052bdc641552f24d5b9d8f0430
- Trigger Event: release

File details

Details for the file nolead-0.2.0-py3-none-any.whl.

File metadata

Download URL: nolead-0.2.0-py3-none-any.whl
Upload date: May 22, 2025
Size: 18.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for nolead-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`38badab5e309f1201678c2457c99a7b5e7601155339e252669ecd8257d27cf1f`
MD5	`d4e88e60ae996494bd17d4da40a85827`
BLAKE2b-256	`9cecab0cd327749f4a451a7cc092985befb9a1a2cd085ff503fbe0f0f350face`

See more details on using hashes here.

Provenance

The following attestation bundles were made for nolead-0.2.0-py3-none-any.whl:

Publisher: publish-to-pypi.yml on ruivieira/nolead

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: nolead-0.2.0-py3-none-any.whl
- Subject digest: 38badab5e309f1201678c2457c99a7b5e7601155339e252669ecd8257d27cf1f
- Sigstore transparency entry: 217640116
- Sigstore integration time: May 22, 2025
Source repository:
- Permalink: ruivieira/nolead@cf785ef07ce5e2052bdc641552f24d5b9d8f0430
- Branch / Tag: refs/tags/v0.2.0
- Owner: https://github.com/ruivieira
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yml@cf785ef07ce5e2052bdc641552f24d5b9d8f0430
- Trigger Event: release

nolead 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

nolead

Features

Installation

Quick Example

Advanced Usage

Named Tasks

Parameter Passing

Pipeline Visualization

Graphical Visualization

Parallel Task Execution

Parallel Task Visualization

Parallel Task Result Format

Text-Based Visualization

Development

Development Setup

Running Tests and Checks

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance