Orchestrate graphs of callables in Python with automatic dependency resolution, parallel execution, retries, timeouts, and HTML email alerts on failure — zero dependencies

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

oliverm91

These details have not been verified by PyPI

Project description

Processes: Smart Task Orchestration

Fast & Lightweight

Run a list of Python callables that depend on each other — in parallel when possible, with per-task log files and optional HTML email notification on failure. Zero dependencies. Pure Python 3.11+.

✨ Why Processes?

🔗 Declare what depends on what — write your tasks in any order; the runtime sorts them so every dependency runs first.
⚡ Run in parallel when you can — independent tasks run together on a thread pool; the runtime switches on automatically for jobs with 10+ tasks.
🛡️ One failure doesn't stop the rest — a failed task skips only the jobs that depend on it, and every other part of the workflow keeps running.
📝 One log file per task — share a single log across the whole run, or keep them separate for easier debugging.
📧 Email alerts when something breaks — pass an SMTPConfig to a task and get a styled HTML email (with traceback, task context, and the list of jobs that were skipped) the instant it raises.
🧰 Modern, strictly-typed Python 3.11+ — from __future__ import annotations, full mypy --strict clean, dict[str, TaskResult], set[str], | unions.

⚙️ How it works

A Process holds a list of Tasks. At construction it validates names, types, dependency references, and detects cycles — raising before anything runs.

When you call process.run(), tasks are topologically sorted and scheduled: dependencies first, independent tasks in parallel.

A TaskDependency can forward an upstream result directly into a downstream function, as a positional or keyword argument. The result is a ProcessExecutionReport with successes, errored, and skipped for inspection.

🚀 Quick start

A 15-line "hello pipeline" — one upstream task feeding a downstream one, run in parallel.

from processes import Process, Task, TaskDependency


def load_users() -> list[dict]:
    return [{"id": 1}, {"id": 2}, {"id": 3}]


def enrich(users: list[dict]) -> list[dict]:
    return [{**u, "name": f"user-{u['id']}"} for u in users]


tasks = [
    Task("load", load_users, "run.log"),
    Task(
        "enrich",
        enrich,
        "run.log",
        dependencies=[TaskDependency("load", use_result_as_additional_args=True)],
    ),
]

with Process(tasks) as p:
    result = p.run(parallel=True)

print(result.successes["enrich"].result)
# [{'id': 1, 'name': 'user-1'}, {'id': 2, 'name': 'user-2'}, {'id': 3, 'name': 'user-3'}]

🧪 End-to-end example

A realistic mini-pipeline: fetch two sources in parallel, transform them, aggregate, and notify — with per-task log files, result piping, and one task deliberately failing to show fault isolation.

Show the full end-to-end example

import logging
from pathlib import Path

from processes import EmailChannel, HTMLEmailStyle, Process, SMTPConfig, Task, TaskDependency

LOG_DIR = Path("logs")
LOG_DIR.mkdir(exist_ok=True)


# --- 1. Two independent "fetch" tasks that run in parallel -----------------
def fetch_orders() -> list[dict]:
    logging.info("querying orders API")
    return [{"order_id": 1, "amount": 42.0}, {"order_id": 2, "amount": 17.5}]


def fetch_inventory() -> list[dict]:
    logging.info("querying inventory API")
    return [{"sku": "A-1", "qty": 12}, {"sku": "B-2", "qty": 3}]


# --- 2. Two transforms that consume the upstream results -------------------
def total_revenue(orders: list[dict]) -> float:
    total = sum(o["amount"] for o in orders)
    logging.info("revenue computed: %s", total)
    return total


def stock_value(inventory: list[dict], *, price_per_unit: float = 10.0) -> float:
    value = sum(i["qty"] for i in inventory) * price_per_unit
    logging.info("stock value: %s", value)
    return value


# --- 3. An aggregator that joins the two branches --------------------------
def build_report(*, revenue: float, stock: float) -> str:
    return f"daily-report | revenue={revenue:.2f} stock={stock:.2f}"


# --- 4. A flaky notifier that ALWAYS fails — to show fault isolation -------
def notify_slack(report: str) -> None:
    raise RuntimeError("slack webhook returned 503")


# --- 5. A sibling task that does NOT depend on notify and still runs -------
def archive_report(report: str) -> str:
    out = LOG_DIR / "report.txt"
    out.write_text(report)
    return str(out)


# --- 6. Optional: SMTP config so failures page on-call --------------------
smtp = SMTPConfig(
    mailhost=("smtp.example.com", 587),
    fromaddr="alerts@example.com",
    toaddrs=["oncall@example.com"],
    credentials=("user", "pass"),
    secure=(),
)

tasks = [
    Task("fetch_orders",   fetch_orders,   LOG_DIR / "fetch_orders.log"),
    Task("fetch_inventory", fetch_inventory, LOG_DIR / "fetch_inventory.log"),

    Task(
        "compute_revenue",
        total_revenue,
        LOG_DIR / "compute_revenue.log",
        dependencies=[TaskDependency("fetch_orders", use_result_as_additional_args=True)],
    ),
    Task(
        "compute_stock",
        stock_value,
        LOG_DIR / "compute_stock.log",
        kwargs={"price_per_unit": 7.25},
        dependencies=[
            TaskDependency(
                "fetch_inventory",
                use_result_as_additional_kwargs=True,
                additional_kwarg_name="inventory",
            )
        ],
    ),

    Task(
        "build_report",
        build_report,
        LOG_DIR / "build_report.log",
        dependencies=[
            TaskDependency("compute_revenue", use_result_as_additional_kwargs=True,
                           additional_kwarg_name="revenue"),
            TaskDependency("compute_stock",    use_result_as_additional_kwargs=True,
                           additional_kwarg_name="stock"),
        ],
    ),

    # notify_slack fails on purpose. archive_report is a *sibling*
    # of notify_slack (both depend on build_report), so it has no
    # dependency on the failed task and runs normally — the rest of
    # the workflow is not blackholed by one broken step.
    Task(
        "notify_slack",
        notify_slack,
        LOG_DIR / "notify_slack.log",
        dependencies=[TaskDependency("build_report", use_result_as_additional_args=True)],
    ),
    Task(
        "archive_report",
        archive_report,
        LOG_DIR / "archive_report.log",
        dependencies=[TaskDependency("build_report", use_result_as_additional_args=True)],
    ),
]

with Process(tasks) as process:
    result = process.run(parallel=True)
    result.notify(EmailChannel(smtp), only_errors=True)  # one report email for the failed task(s)

print("passed:", sorted(result.successes))
# archive_report, build_report, compute_revenue, compute_stock, fetch_inventory, fetch_orders
print("failed:", sorted(set(result.errored) | set(result.skipped)))
# notify_slack
print("report:", result.successes["build_report"].result)
# daily-report | revenue=59.50 stock=262.50

The failing notify_slack task does not abort the run. archive_report is a sibling of the failed task (both depend on the successful build_report), so it runs unaffected — the rest of the workflow is not blackholed by one broken step. Calling result.notify(EmailChannel(smtp), only_errors=True) then delivers a single report email covering the failed task with its traceback and the downstream tasks that were skipped because of it.

📚 API Reference

Show API reference

`Task`

Task(
    name: str,
    func: Callable[..., Any],
    log_path: str | None = None,
    args: tuple = (),
    kwargs: dict | None = None,
    dependencies: list[TaskDependency] | None = None,
    traced_vars_frame_filter: str | None = None,
    timeout: float | None = None,
    retries: int | None = 0,
    retry_on: tuple[type[Exception], ...] | None = None,
)

name — unique within the Process; no spaces.
log_path — the file this task logs to (INFO level, format %(asctime)s - %(name)s - %(levelname)s - %(message)s), with the structured failure context appended on error. None (the default) means no file logging; a NullHandler is attached instead. A Task does not send notifications — error notification is delegated to ProcessExecutionReport.notify.
func — the callable; receives func(*args, **kwargs) after result-injection.
traced_vars_frame_filter — substring selecting which traceback frame's locals are captured into the failure context (and thus into both the logfile and any report notification). None (default) captures the outermost user frame.
timeout — seconds allowed per attempt; None means no limit. When the timeout fires the underlying thread is detached (Python threading limitation).
retries — additional attempts after the first failure; 0 or None means a single attempt. Defaults to 0.
retry_on — tuple of exception types that trigger a retry. When retries >= 1 and retry_on is None, defaults to (ConnectionError, TimeoutError) at call time.

`TaskDependency`

TaskDependency(
    task_name: str,
    use_result_as_additional_args: bool = False,
    use_result_as_additional_kwargs: bool = False,
    additional_kwarg_name: str = "",
)

use_result_as_additional_args=True — upstream result appended as the next positional arg.
use_result_as_additional_kwargs=True with a non-empty additional_kwarg_name — upstream result injected as a keyword arg.
Both flags can be combined (positional first, then kwarg).

`Process`

Process(tasks: list[Task])  # validates types, names, deps, cycles

process.run(parallel: bool | None = None, max_workers: int = 4) -> ProcessExecutionReport

Raises DependencyNotFoundError, CircularDependencyError, TypeError, ValueError on construction if the workflow is malformed.
parallel=None auto-parallelises when len(tasks) >= 10; max_workers=1 is always sequential.
Use as a context manager — it cleans up FileHandlers on exit.

`TaskResult`

TaskResult(
    status: TaskStatus,
    result: Any,
    exception: Exception | None,
    error_data: ErrorData | None = None,
    elapsed_seconds: float = 0.0,  # wall-clock time across all attempts
    attempts: int = 0,             # attempts actually executed (0 if never run)
)

status — TaskStatus.PENDING | SUCCESS | ERRORED | SKIPPED.
worked — True if status == TaskStatus.SUCCESS.

`ProcessExecutionReport`

report.entries    # dict[str, TaskReportEntry] — one entry per task, in topological order
report.successes  # dict[str, TaskReportEntry] — entries with status == TaskStatus.SUCCESS
report.errored    # dict[str, TaskReportEntry] — entries with status == TaskStatus.ERRORED
report.skipped    # dict[str, TaskReportEntry] — entries with status == TaskStatus.SKIPPED

A TaskReportEntry carries name, function, args, kwargs, status (TaskStatus.SUCCESS | ERRORED | SKIPPED), elapsed_seconds, attempts, plus result (set when SUCCESS) and error: ErrorData | None (set when ERRORED).

with Process([load_task, apply_task, notify_task]) as process:
    report = process.run()

for name, entry in report.entries.items():
    if entry.status is TaskStatus.ERRORED:
        print(f"{name} failed after {entry.attempts} attempt(s): {entry.error.exception}")

Deliver the report through one or more channels with notify:

report.notify(
    *channels: ReportChannel,
    only_errors: bool = False,        # restrict the payload to ERRORED tasks
    tasks: list[str] | None = None,   # restrict to these task names (case-insensitive)
    show_warnings: bool = True,       # warn (not raise) if a channel fails
)

only_errors and tasks compose (both filters apply). A failing channel never aborts the others. Built-in channels: EmailChannel (HTML email) and WebhookChannel (JSON POST).

`ErrorData`

ErrorData(
    task_name: str,
    function: str,
    args: tuple[Any, ...],
    kwargs: dict[str, Any],
    downstream_impact: list[str],
    exception: str,
    traceback_str: str,
    traced_vars: dict[str, str],
    traced_vars_location: str,
)

Structured failure context for a single task, available via TaskResult.error_data and TaskReportEntry.error.

`SMTPConfig`

SMTPConfig(
    mailhost,                      # (host, port)
    fromaddr,
    toaddrs,                       # list[str]
    credentials=None,              # (username, password) | None
    secure=None,                   # () = STARTTLS; omit for no encryption
    timeout=5,
)

`HTMLEmailStyle`

HTMLEmailStyle(
    palette="neutral",             # neutral | catppuccin | neobones | slate
    language="en",                 # en | es | pt | fr | de | it
)

`ReportContent`

ReportContent(
    show_traceback=True,    # include each failure's full traceback
    show_traced_vars=True,  # include each failure's traced local variables
)

Per-channel selection of how much per-task detail a report notification includes. Pass the same instance to several channels for uniform content, or give each its own.

`EmailChannel`

EmailChannel(
    smtp_config: SMTPConfig,
    style: HTMLEmailStyle | None = None,    # defaults to HTMLEmailStyle()
    content: ReportContent | None = None,   # defaults to ReportContent()
)

A ReportChannel that delivers a finished report as a styled HTML email when passed to report.notify(...). The body lists every task with its status and, for each failure, the exception, traceback, downstream impact, and traced variables (subject to content).

Traced Variables

On failure, each task captures the local variables of the outermost user frame in the traceback — i.e. the last frame that is not inside site-packages or your virtualenv — into both its logfile and the report email. A file:line reference next to the section shows exactly where those values were captured. Point this at a different frame per task with Task(traced_vars_frame_filter=…).

`WebhookChannel`

WebhookChannel(
    webhook_config: WebhookConfig,
    content: ReportContent | None = None,   # defaults to ReportContent()
)

A ReportChannel that POSTs the finished report as JSON when passed to report.notify(...).

WebhookConfig(
    url: str,
    headers: dict[str, str] = {},  # merged with default Content-Type: application/json
    timeout: int = 5,
    secret: str | None = None,  # HMAC-SHA256 signs the body when set
    extra_payload: dict[str, Any] = {},  # extra top-level keys merged into the JSON body
    nest_under: str | None = None,  # nest the generic fields under this key, e.g. "data"
)

Transport configuration for WebhookChannel. When the report is delivered, POSTs a generic JSON payload to url — an entries object mapping each task name to its status, function, elapsed_seconds, attempts, and (for failures) an error block with exception, traceback, downstream_impact, and traced_vars (subject to ReportContent). Not coupled to any specific service (Slack, Discord, etc.).

extra_payload keys are merged into the JSON body and take precedence over the generic fields on collision — useful for service-specific routing data (e.g. a Telegram chat_id or a Slack channel/username override) without subclassing.

nest_under nests the generic fields under a single key instead of leaving them top-level, e.g. nest_under="data" produces {"data": {"task_name": ..., ...}, "chat_id": "..."}. extra_payload keys always stay top-level and still take precedence on collision. None (the default) keeps the flat payload shape.

If secret is set, the request carries an X-Signature-SHA256 header with the hex-encoded hmac.new(secret, body, hashlib.sha256) digest of the JSON body, so receivers can verify the payload wasn't tampered with.

Show fault-tolerance rules in detail

When a task raises:

The exception is caught and stored in TaskResult.exception; the task's entry in the report gets status == TaskStatus.ERRORED.
Every task that depends on it (directly or indirectly) is skipped — its entry gets status == TaskStatus.SKIPPED without running.
Every other independent part of the workflow keeps running. With parallel=True they keep running concurrently on the worker pool.
After run() returns, ProcessExecutionReport.errored and ProcessExecutionReport.skipped let you distinguish root failures from cascade skips for triage or alerting.

When a task has retries >= 1, a failure matching retry_on triggers another attempt before the task is declared failed and its dependants are skipped. This gives transient errors (network blips, connection resets) a chance to resolve without aborting downstream work.

This makes the library a good fit for fan-out / fan-in pipelines, "best-effort" notifications, and any workflow where one broken step should not blackhole the rest.

Show scope & limitations

Processes is not a distributed scheduler — there are no workers on remote machines, no SLA monitoring, no web UI. If you need any of those, reach for a full orchestrator. If you want a small, fast, dependency-aware pipeline that just runs in a single process, this is it.

Show advanced configuration

Shared log file — pass the same log_path to every Task for a single combined run.log; pass distinct paths for per-task isolation.
Auto-parallel — Process.run() with no argument runs sequentially for small workflows and switches to parallel for len(tasks) >= 10. Pass parallel=True or parallel=False to force the mode.
Result inspection — iterate report.successes.items() to log or post-process every successful task; iterate set(report.errored) | set(report.skipped) for triage.
Re-raising — wrap process.run() in try/except if you need a non-zero exit code on any failure; the library itself does not raise on partial failure.

📦 Installation

From PyPI:

pip install processes

Or straight from the repository (pure Python, no build step):

pip install git+https://github.com/oliverm91/processes.git

Requires Python 3.11+.

📄 License & contributing

Released under the MIT License — see docs for full API details.

Contributions welcome — see CONTRIBUTING.md for the workflow, style, and commit-message conventions used by this project.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

oliverm91

These details have not been verified by PyPI

Release history Release notifications | RSS feed

7.0.4

Jun 22, 2026

This version

7.0.1

Jun 22, 2026

7.0.0

Jun 21, 2026

3.1.1

Jun 14, 2026

3.1.0

Jun 14, 2026

3.0.1

Jun 14, 2026

2.0.1

Jun 13, 2026

1.0.5

Jan 19, 2026

1.0.4

Jan 19, 2026

1.0.3

Jan 19, 2026

1.0.2

Jan 19, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

processes-7.0.1.tar.gz (261.2 kB view details)

Uploaded Jun 22, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

processes-7.0.1-py3-none-any.whl (45.9 kB view details)

Uploaded Jun 22, 2026 Python 3

File details

Details for the file processes-7.0.1.tar.gz.

File metadata

Download URL: processes-7.0.1.tar.gz
Upload date: Jun 22, 2026
Size: 261.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for processes-7.0.1.tar.gz
Algorithm	Hash digest
SHA256	`0e83b843e3ebd3bd0cf66b505b2c610c083cdab9983f1c3c169e7fb6170ed28c`
MD5	`5920551eea4956edb0d14f8c5ecd673d`
BLAKE2b-256	`af9f4f1a7406b473addced458a45faec72a5f6e9e372d2a95781836a1cc7662a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for processes-7.0.1.tar.gz:

Publisher: publish.yml on oliverm91/processes

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: processes-7.0.1.tar.gz
- Subject digest: 0e83b843e3ebd3bd0cf66b505b2c610c083cdab9983f1c3c169e7fb6170ed28c
- Sigstore transparency entry: 1912711517
- Sigstore integration time: Jun 22, 2026
Source repository:
- Permalink: oliverm91/processes@53ce801293ec471af917acd5ce72dbc39ac66c07
- Branch / Tag: refs/tags/v7.0.1
- Owner: https://github.com/oliverm91
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@53ce801293ec471af917acd5ce72dbc39ac66c07
- Trigger Event: release

File details

Details for the file processes-7.0.1-py3-none-any.whl.

File metadata

Download URL: processes-7.0.1-py3-none-any.whl
Upload date: Jun 22, 2026
Size: 45.9 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for processes-7.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`dbbeaaeabb0d5e806aa0d6447c9b77812e488fd276b395d85d4448f58dc7b121`
MD5	`b7745f72943c59a52cf817829c03f3e9`
BLAKE2b-256	`aabd19a56eec231717c007eefe15b4f9aec392df6213b70f8ff43a50d7e2aa6e`

See more details on using hashes here.

Provenance

The following attestation bundles were made for processes-7.0.1-py3-none-any.whl:

Publisher: publish.yml on oliverm91/processes

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: processes-7.0.1-py3-none-any.whl
- Subject digest: dbbeaaeabb0d5e806aa0d6447c9b77812e488fd276b395d85d4448f58dc7b121
- Sigstore transparency entry: 1912711602
- Sigstore integration time: Jun 22, 2026
Source repository:
- Permalink: oliverm91/processes@53ce801293ec471af917acd5ce72dbc39ac66c07
- Branch / Tag: refs/tags/v7.0.1
- Owner: https://github.com/oliverm91
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@53ce801293ec471af917acd5ce72dbc39ac66c07
- Trigger Event: release

processes 7.0.1

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

Processes: Smart Task Orchestration

✨ Why Processes?

⚙️ How it works

🚀 Quick start

🧪 End-to-end example

📚 API Reference

Task

TaskDependency

Process

TaskResult

ProcessExecutionReport

ErrorData

SMTPConfig

HTMLEmailStyle

ReportContent

EmailChannel

Traced Variables

WebhookChannel

📦 Installation

📄 License & contributing

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

`Task`

`TaskDependency`

`Process`

`TaskResult`

`ProcessExecutionReport`

`ErrorData`

`SMTPConfig`

`HTMLEmailStyle`

`ReportContent`

`EmailChannel`

`WebhookChannel`