A Python implementation of durable, event-sourced workflows inspired by Vercel Workflow

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

yashabro_io

These details have not been verified by PyPI

Project links

Project description

PyWorkflow

Distributed, durable workflow orchestration for Python

Build long-running, fault-tolerant workflows with automatic retry, sleep/delay capabilities, and complete observability. PyWorkflow uses event sourcing and Celery for production-grade distributed execution.

What is PyWorkflow?

PyWorkflow is a workflow orchestration framework that enables you to build complex, long-running business processes as simple Python code. It handles the hard parts of distributed systems: fault tolerance, automatic retries, state management, and horizontal scaling.

Key Features

Distributed by Default: All workflows execute across Celery workers for horizontal scaling
Durable Execution: Event sourcing ensures workflows can recover from any failure
Auto Recovery: Automatic workflow resumption after worker crashes with event replay
Time Travel: Sleep for minutes, hours, or days with automatic resumption
Fault Tolerant: Automatic retries with configurable backoff strategies
Zero-Resource Suspension: Workflows suspend without holding resources during sleep
Production Ready: Built on battle-tested Celery and Redis
Fully Typed: Complete type hints and Pydantic validation
Observable: Structured logging with workflow context

Quick Start

Installation

Basic installation (File and Memory storage backends):

pip install pyworkflow-engine

With optional storage backends:

# Redis backend (includes Redis as Celery broker)
pip install pyworkflow-engine[redis]

# SQLite backend
pip install pyworkflow-engine[sqlite]

# PostgreSQL backend
pip install pyworkflow-engine[postgres]

# All storage backends
pip install pyworkflow-engine[all]

# Development (includes all backends + dev tools)
pip install pyworkflow-engine[dev]

Prerequisites

For distributed execution (recommended for production):

PyWorkflow uses Celery for distributed task execution. You need a message broker:

Option 1: Redis (recommended)

# Install Redis support
pip install pyworkflow-engine[redis]

# Start Redis
docker run -d -p 6379:6379 redis:7-alpine

# Start Celery worker(s)
celery -A pyworkflow.celery.app worker --loglevel=info

# Start Celery Beat (for automatic sleep resumption)
celery -A pyworkflow.celery.app beat --loglevel=info

Or use the CLI to set up Docker infrastructure:

pyworkflow setup

Option 2: Other brokers (RabbitMQ, etc.)

# Celery supports multiple brokers
# Configure via environment: CELERY_BROKER_URL=amqp://localhost

For local development/testing:

# No broker needed - use in-process execution
pyworkflow configure --runtime local

See DISTRIBUTED.md for complete deployment guide.

Your First Workflow

from pyworkflow import workflow, step, start, sleep

@step()
async def send_welcome_email(user_id: str):
    # This runs on any available Celery worker
    print(f"Sending welcome email to user {user_id}")
    return f"Email sent to {user_id}"

@step()
async def send_tips_email(user_id: str):
    print(f"Sending tips email to user {user_id}")
    return f"Tips sent to {user_id}"

@workflow()
async def onboarding_workflow(user_id: str):
    # Send welcome email immediately
    await send_welcome_email(user_id)

    # Sleep for 1 day - workflow suspends, zero resources used
    await sleep("1d")

    # Automatically resumes after 1 day!
    await send_tips_email(user_id)

    return "Onboarding complete"

# Start workflow - executes across Celery workers
run_id = start(onboarding_workflow, user_id="user_123")
print(f"Workflow started: {run_id}")

What happens:

Workflow starts on a Celery worker
Welcome email is sent
Workflow suspends after calling sleep("1d")
Worker is freed to handle other tasks
After 1 day, Celery Beat automatically schedules resumption
Workflow resumes on any available worker
Tips email is sent

Core Concepts

Workflows

Workflows are the top-level orchestration functions. They coordinate steps, handle business logic, and can sleep for extended periods.

from pyworkflow import workflow, start

@workflow(name="process_order", max_duration="1h")
async def process_order(order_id: str):
    """
    Process a customer order.

    This workflow:
    - Validates the order
    - Processes payment
    - Creates shipment
    - Sends confirmation
    """
    order = await validate_order(order_id)
    payment = await process_payment(order)
    shipment = await create_shipment(order)
    await send_confirmation(order)

    return {"order_id": order_id, "status": "completed"}

# Start the workflow
run_id = start(process_order, order_id="ORD-123")

Steps

Steps are the building blocks of workflows. Each step is an isolated, retryable unit of work that runs on Celery workers.

from pyworkflow import step, RetryableError, FatalError

@step(max_retries=5, retry_delay="exponential")
async def call_external_api(url: str):
    """
    Call external API with automatic retry.

    Retries up to 5 times with exponential backoff if it fails.
    """
    try:
        response = await httpx.get(url)

        if response.status_code == 404:
            # Don't retry - resource doesn't exist
            raise FatalError("Resource not found")

        if response.status_code >= 500:
            # Retry - server error
            raise RetryableError("Server error", retry_after="30s")

        return response.json()
    except httpx.NetworkError:
        # Retry with exponential backoff
        raise RetryableError("Network error")

Sleep and Delays

Workflows can sleep for any duration. During sleep, the workflow suspends and consumes zero resources.

from pyworkflow import workflow, sleep

@workflow()
async def scheduled_reminder(user_id: str):
    # Send immediate reminder
    await send_reminder(user_id, "immediate")

    # Sleep for 1 hour
    await sleep("1h")
    await send_reminder(user_id, "1 hour later")

    # Sleep for 1 day
    await sleep("1d")
    await send_reminder(user_id, "1 day later")

    # Sleep for 1 week
    await sleep("7d")
    await send_reminder(user_id, "1 week later")

    return "All reminders sent"

Supported formats:

Duration strings: "5s", "10m", "2h", "3d"
Timedelta: timedelta(hours=2, minutes=30)
Datetime: datetime(2025, 12, 25, 9, 0, 0)

Architecture

Event-Sourced Execution

PyWorkflow uses event sourcing to achieve durable, fault-tolerant execution:

All state changes are recorded as events in an append-only log
Deterministic replay enables workflow resumption from any point
Complete audit trail of everything that happened in the workflow

Event Types (16 total):

Workflow: started, completed, failed, suspended, resumed
Step: started, completed, failed, retrying
Sleep: created, completed
Logging: info, warning, error, debug

Distributed Execution

┌─────────────────────────────────────────────────────┐
│                   Your Application                  │
│                                                     │
│  start(my_workflow, args)                          │
│         │                                           │
└─────────┼───────────────────────────────────────────┘
          │
          ▼
    ┌─────────┐
    │  Redis  │  ◄──── Message Broker
    └─────────┘
          │
          ├──────┬──────┬──────┐
          ▼      ▼      ▼      ▼
     ┌──────┐ ┌──────┐ ┌──────┐
     │Worker│ │Worker│ │Worker│  ◄──── Horizontal Scaling
     └──────┘ └──────┘ └──────┘
          │      │      │
          └──────┴──────┘
                 │
                 ▼
          ┌──────────┐
          │ Storage  │  ◄──── Event Log (File/Redis/PostgreSQL)
          └──────────┘

Storage Backends

PyWorkflow supports pluggable storage backends:

Backend	Status	Installation	Use Case
File	✅ Complete	Included	Development, single-machine
Memory	✅ Complete	Included	Testing, ephemeral workflows
SQLite	✅ Complete	`pip install pyworkflow-engine[sqlite]`	Embedded, local persistence
PostgreSQL	✅ Complete	`pip install pyworkflow-engine[postgres]`	Production, enterprise
Redis	📋 Planned	`pip install pyworkflow-engine[redis]`	High-performance, distributed

Advanced Features

Parallel Execution

Use Python's native asyncio.gather() for parallel step execution:

import asyncio
from pyworkflow import workflow, step

@step()
async def fetch_user(user_id: str):
    # Fetch user data
    return {"id": user_id, "name": "Alice"}

@step()
async def fetch_orders(user_id: str):
    # Fetch user orders
    return [{"id": "ORD-1"}, {"id": "ORD-2"}]

@step()
async def fetch_recommendations(user_id: str):
    # Fetch recommendations
    return ["Product A", "Product B"]

@workflow()
async def dashboard_data(user_id: str):
    # Fetch all data in parallel
    user, orders, recommendations = await asyncio.gather(
        fetch_user(user_id),
        fetch_orders(user_id),
        fetch_recommendations(user_id)
    )

    return {
        "user": user,
        "orders": orders,
        "recommendations": recommendations
    }

Error Handling

PyWorkflow distinguishes between retriable and fatal errors:

from pyworkflow import FatalError, RetryableError, step

@step(max_retries=3, retry_delay="exponential")
async def process_payment(amount: float):
    try:
        # Attempt payment
        result = await payment_gateway.charge(amount)
        return result
    except InsufficientFundsError:
        # Don't retry - user doesn't have enough money
        raise FatalError("Insufficient funds")
    except PaymentGatewayTimeoutError:
        # Retry - temporary issue
        raise RetryableError("Gateway timeout", retry_after="10s")
    except Exception as e:
        # Unknown error - retry with backoff
        raise RetryableError(f"Unknown error: {e}")

Retry strategies:

retry_delay="fixed" - Fixed delay between retries (default: 60s)
retry_delay="exponential" - Exponential backoff (1s, 2s, 4s, 8s, ...)
retry_delay="5s" - Custom fixed delay

Auto Recovery

Workflows automatically recover from worker crashes:

from pyworkflow import workflow, step, sleep

@workflow(
    recover_on_worker_loss=True,    # Enable recovery (default for durable)
    max_recovery_attempts=5,         # Max recovery attempts
)
async def resilient_workflow(data_id: str):
    data = await fetch_data(data_id)    # Completed steps are skipped on recovery
    await sleep("10m")                   # Sleep state is preserved
    return await process_data(data)      # Continues from here after crash

What happens on worker crash:

Celery detects worker loss, requeues task
New worker picks up the task
Events are replayed to restore state
Workflow resumes from last checkpoint

Configure globally:

import pyworkflow

pyworkflow.configure(
    default_recover_on_worker_loss=True,
    default_max_recovery_attempts=3,
)

Or via config file:

# pyworkflow.config.yaml
recovery:
  recover_on_worker_loss: true
  max_recovery_attempts: 3

Idempotency

Prevent duplicate workflow executions with idempotency keys:

from pyworkflow import start

# Same idempotency key = same workflow
run_id_1 = start(
    process_order,
    order_id="ORD-123",
    idempotency_key="order-ORD-123"
)

# This will return the same run_id, not start a new workflow
run_id_2 = start(
    process_order,
    order_id="ORD-123",
    idempotency_key="order-ORD-123"
)

assert run_id_1 == run_id_2  # True!

Observability

PyWorkflow includes structured logging with automatic context:

from pyworkflow import configure_logging

# Configure logging
configure_logging(
    level="INFO",
    log_file="workflow.log",
    json_logs=True,  # JSON format for production
    show_context=True  # Include run_id, step_id, etc.
)

# Logs automatically include:
# - run_id: Workflow execution ID
# - workflow_name: Name of the workflow
# - step_id: Current step ID
# - step_name: Name of the step

Testing

PyWorkflow uses a unified API for testing with local execution:

import pytest
from pyworkflow import workflow, step, start, configure, reset_config
from pyworkflow.storage.memory import InMemoryStorageBackend

@step()
async def my_step(x: int):
    return x * 2

@workflow()
async def my_workflow(x: int):
    result = await my_step(x)
    return result + 1

@pytest.fixture(autouse=True)
def setup_storage():
    reset_config()
    storage = InMemoryStorageBackend()
    configure(storage=storage, default_durable=True)
    yield storage
    reset_config()

@pytest.mark.asyncio
async def test_my_workflow(setup_storage):
    storage = setup_storage
    run_id = await start(my_workflow, 5)

    # Get workflow result
    run = await storage.get_run(run_id)
    assert run.status.value == "completed"

Production Deployment

Docker Compose

version: '3.8'

services:
  redis:
    image: redis:7-alpine
    ports:
      - "6379:6379"

  worker:
    build: .
    command: celery -A pyworkflow.celery.app worker --loglevel=info
    depends_on:
      - redis
    deploy:
      replicas: 3  # Run 3 workers

  beat:
    build: .
    command: celery -A pyworkflow.celery.app beat --loglevel=info
    depends_on:
      - redis

  flower:
    build: .
    command: celery -A pyworkflow.celery.app flower --port=5555
    ports:
      - "5555:5555"

Start everything using the CLI:

pyworkflow setup

See DISTRIBUTED.md for complete deployment guide with Kubernetes.

Examples

Check out the examples/ directory for complete working examples:

basic_workflow.py - Complete example with retries, errors, and sleep
distributed_example.py - Multi-worker distributed execution example

Project Status

✅ Status: Production Ready (v1.0)

Completed Features:

✅ Core workflow and step execution
✅ Event sourcing with 16 event types
✅ Distributed execution via Celery
✅ Sleep primitive with automatic resumption
✅ Error handling and retry strategies
✅ File storage backend
✅ Structured logging
✅ Comprehensive test coverage (68 tests)
✅ Docker Compose deployment
✅ Idempotency support

Next Milestones:

📋 Redis storage backend
📋 PostgreSQL storage backend
📋 Webhook integration
📋 Web UI for monitoring
📋 CLI management tools

Contributing

Contributions are welcome!

Development Setup

# Clone repository
git clone https://github.com/QualityUnit/pyworkflow
cd pyworkflow

# Install with Poetry
poetry install

# Run tests
poetry run pytest

# Format code
poetry run black pyworkflow tests
poetry run ruff check pyworkflow tests

# Type checking
poetry run mypy pyworkflow

Documentation

Distributed Deployment Guide - Production deployment with Docker Compose and Kubernetes
Examples - Working examples and patterns
API Reference (Coming soon)
Architecture Guide (Coming soon)

License

Apache License 2.0 - See LICENSE file for details.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

yashabro_io

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.3.2

May 6, 2026

0.3.1

Apr 29, 2026

0.3.0

Apr 23, 2026

0.2.15

Apr 20, 2026

0.2.14

Apr 19, 2026

0.2.13

Apr 12, 2026

0.2.12

Apr 12, 2026

0.2.11

Apr 8, 2026

0.2.10

Apr 8, 2026

0.2.9

Apr 8, 2026

0.2.8

Apr 8, 2026

0.2.7

Apr 8, 2026

0.2.6

Apr 5, 2026

0.2.5

Apr 5, 2026

0.2.3

Mar 28, 2026

0.2.2

Mar 26, 2026

0.2.1

Mar 26, 2026

0.2.0

Mar 26, 2026

0.2.0b2 pre-release

Mar 9, 2026

0.2.0b1 pre-release

Mar 9, 2026

0.1.37

Mar 1, 2026

0.1.36

Feb 27, 2026

0.1.35

Feb 18, 2026

0.1.34

Feb 18, 2026

0.1.33

Feb 16, 2026

0.1.32

Feb 11, 2026

0.1.31

Feb 8, 2026

0.1.30

Feb 7, 2026

This version

0.1.29

Feb 7, 2026

0.1.28

Feb 4, 2026

0.1.27

Feb 4, 2026

0.1.26

Feb 3, 2026

0.1.25

Feb 3, 2026

0.1.24

Feb 3, 2026

0.1.23

Feb 3, 2026

0.1.22

Feb 3, 2026

0.1.21

Feb 2, 2026

0.1.20

Feb 2, 2026

0.1.19

Jan 31, 2026

0.1.18

Jan 27, 2026

0.1.17

Jan 27, 2026

0.1.16

Jan 17, 2026

0.1.15

Jan 17, 2026

0.1.14

Jan 8, 2026

0.1.13

Jan 2, 2026

0.1.12

Jan 1, 2026

0.1.11

Dec 30, 2025

0.1.10

Dec 29, 2025

0.1.9

Dec 29, 2025

0.1.7

Dec 27, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyworkflow_engine-0.1.29.tar.gz (447.6 kB view details)

Uploaded Feb 7, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

pyworkflow_engine-0.1.29-py3-none-any.whl (284.3 kB view details)

Uploaded Feb 7, 2026 Python 3

File details

Details for the file pyworkflow_engine-0.1.29.tar.gz.

File metadata

Download URL: pyworkflow_engine-0.1.29.tar.gz
Upload date: Feb 7, 2026
Size: 447.6 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pyworkflow_engine-0.1.29.tar.gz
Algorithm	Hash digest
SHA256	`2141e332f2329a8ceffc49c41cf3e90170392a9b50c4aa96d7022d9a945eaca7`
MD5	`e3a015edd7a4634d2bc0b5862fb4b6ad`
BLAKE2b-256	`425d009456539680aaa3b000638aeeda1136b9e181541066b96f9d0f3bc6b641`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pyworkflow_engine-0.1.29.tar.gz:

Publisher: release.yml on QualityUnit/pyworkflow

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pyworkflow_engine-0.1.29.tar.gz
- Subject digest: 2141e332f2329a8ceffc49c41cf3e90170392a9b50c4aa96d7022d9a945eaca7
- Sigstore transparency entry: 926980783
- Sigstore integration time: Feb 7, 2026
Source repository:
- Permalink: QualityUnit/pyworkflow@d888bcc44dd6b161099dbf2e52cabba5ff1f2c81
- Branch / Tag: refs/tags/v0.1.29
- Owner: https://github.com/QualityUnit
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@d888bcc44dd6b161099dbf2e52cabba5ff1f2c81
- Trigger Event: push

File details

Details for the file pyworkflow_engine-0.1.29-py3-none-any.whl.

File metadata

Download URL: pyworkflow_engine-0.1.29-py3-none-any.whl
Upload date: Feb 7, 2026
Size: 284.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pyworkflow_engine-0.1.29-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d034e62bbe3d39648cad0ff5824c07480cacee0b9af1275045481f0c0d139110`
MD5	`dd3939a21507fa2fc1c10e08a0e0949a`
BLAKE2b-256	`3ec7e56ba454a6974f2afb0a10a20612d5eb78991dd64b2893befa6cb13e04e9`

See more details on using hashes here.

Provenance

The following attestation bundles were made for pyworkflow_engine-0.1.29-py3-none-any.whl:

Publisher: release.yml on QualityUnit/pyworkflow

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: pyworkflow_engine-0.1.29-py3-none-any.whl
- Subject digest: d034e62bbe3d39648cad0ff5824c07480cacee0b9af1275045481f0c0d139110
- Sigstore transparency entry: 926980811
- Sigstore integration time: Feb 7, 2026
Source repository:
- Permalink: QualityUnit/pyworkflow@d888bcc44dd6b161099dbf2e52cabba5ff1f2c81
- Branch / Tag: refs/tags/v0.1.29
- Owner: https://github.com/QualityUnit
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@d888bcc44dd6b161099dbf2e52cabba5ff1f2c81
- Trigger Event: push

pyworkflow-engine 0.1.29

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

PyWorkflow

What is PyWorkflow?

Key Features

Quick Start

Installation

Prerequisites

Your First Workflow

Core Concepts

Workflows

Steps

Sleep and Delays

Architecture

Event-Sourced Execution

Distributed Execution

Storage Backends

Advanced Features

Parallel Execution

Error Handling

Auto Recovery

Idempotency

Observability

Testing

Production Deployment

Docker Compose

Examples

Project Status

Contributing

Development Setup

Documentation

License

Links

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance