Skip to main content

Durable execution framework for Python

Project description

PyErgon

PyPI version Python 3.11+

A minimal durable execution framework for Python, based on SQLite and Redis.

PyErgon stores method invocations in a database. When restarting a flow after a failure (e.g., machine crash), it resumes from the last successful step, driving it to completion. Flows can suspend awaiting external signals (human approval, webhook callbacks) or timers, and automatically resume when ready.

Source code: https://github.com/richinex/pyergon Bug reports: https://github.com/richinex/pyergon/issues

What is PyErgon?

PyErgon is a durable execution framework that:

  • Survives crashes: Flows resume from last successful step after machine/process failures
  • Suspends awaiting signals: Flows pause for external events (approvals, webhooks) and automatically resume
  • Caches expensive steps: Completed steps never re-execute on retry
  • Multiple backends: InMemory, SQLite or Redis
  • Supports complex orchestration: Child flows, parallel DAG execution, timers, retry policies

It provides:

  • @flow and @step decorators for durable workflows
  • Automatic retry with exponential backoff
  • External signal coordination
  • Event-driven worker architecture
  • Type-safe APIs with protocols

Installation

From PyPI (Recommended)

pip install pyergon

Requires Python 3.11 or higher.

From Source

# Clone repository
git clone https://github.com/richinex/pyergon.git
cd pyergon

# Install with uv
uv sync

# Or with pip
pip install -e .

Quick Start

import asyncio
from dataclasses import dataclass
from pyergon import flow, flow_type, step, Executor
from pyergon.storage.sqlite import SqliteExecutionLog

@dataclass
@flow_type
class OrderProcessor:
    order_id: str
    amount: float

    @step
    async def validate(self):
        print(f"[{self.order_id}] Validating...")
        return self.amount > 0

    @step
    async def process_payment(self):
        print(f"[{self.order_id}] Processing ${self.amount}...")
        return f"payment-{self.order_id}"

    @flow
    async def run(self):
        await self.validate()
        return await self.process_payment()

async def main():
    # Setup storage
    storage = await SqliteExecutionLog.in_memory()

    # Execute workflow
    order = OrderProcessor("ORD-001", 100.0)
    executor = Executor(order, storage, "order-001")
    outcome = await executor.run(lambda o: o.run())

    print(f"Result: {outcome.result}")

    await storage.close()

asyncio.run(main())

Distributed Workers with Timers

from dataclasses import dataclass
from pyergon import flow, flow_type, step, Scheduler, Worker
from pyergon.storage.sqlite import SqliteExecutionLog
from pyergon.executor.timer import schedule_timer_named
from pyergon.core import TaskStatus

@dataclass
@flow_type
class TimedOrderProcessor:
    order_id: str

    @step
    async def wait_for_fraud_check(self):
        print(f"[{self.order_id}] Waiting for fraud check...")
        await schedule_timer_named(2.0, f"fraud-{self.order_id}")
        print(f"[{self.order_id}] Fraud check complete")

    @flow
    async def process(self):
        await self.wait_for_fraud_check()
        return "completed"

async def main():
    # Setup
    storage = SqliteExecutionLog("distributed.db")
    await storage.connect()

    scheduler = Scheduler(storage)

    # Start workers with timer processing
    worker1 = Worker(storage, "worker-1", enable_timers=True)
    await worker1.register(TimedOrderProcessor)
    handle1 = await worker1.start()

    worker2 = Worker(storage, "worker-2", enable_timers=True)
    await worker2.register(TimedOrderProcessor)
    handle2 = await worker2.start()

    # Schedule flows
    for i in range(3):
        order = TimedOrderProcessor(f"ORD-{i:03d}")
        await scheduler.schedule(order)

    # Wait for completion
    await asyncio.sleep(5)

    # Shutdown
    await handle1.shutdown()
    await handle2.shutdown()
    await storage.close()

asyncio.run(main())

Advanced Features

Waiting for External Signals ("Human in the Loop")

Flows can suspend awaiting external signals (approvals, webhooks, callbacks):

from pyergon.executor.signal import await_external_signal

@dataclass
@flow_type
class DocumentApprovalFlow:
    document_id: str

    @flow
    async def process(self):
        await self.validate_document()

        # Suspend until manager approves
        decision = await self.await_manager_approval()

        if not decision.approved:
            raise ManagerRejectionError(decision.reason)

        await self.publish_document()
        return "approved"

    @step
    async def await_manager_approval(self):
        signal_name = f"manager_approval_{self.document_id}"
        decision_bytes = await await_external_signal(signal_name)
        return pickle.loads(decision_bytes)

Resume the flow externally:

worker = Worker(storage, "worker-1")
worker = worker.with_signals(signal_source, poll_interval=0.5)
await worker.register(DocumentApprovalFlow)
await worker.start()

# Later, from webhook or API:
await signal_source.send_signal("manager_approval_DOC-001", decision_data)

Child Flow Invocation

Parent flows can invoke child flows and await their results:

@dataclass
@flow_type(invokable=str)
class PaymentFlow:
    order_id: str
    amount: float

    @flow(retry_policy=RetryPolicy.STANDARD)
    async def process(self):
        await self.charge_card()
        return f"PAYMENT-{self.order_id}"

@dataclass
@flow_type
class OrderFlow:
    order_id: str
    amount: float

    @flow
    async def process(self):
        await self.validate_order()

        # Invoke child flow and await result
        payment_flow = PaymentFlow(self.order_id, self.amount)
        payment_id = await self.invoke(payment_flow).result()

        return f"Order complete: {payment_id}"

Retry Policies

Control retry behavior for transient vs permanent errors:

class ApiTimeoutError(RetryableError):
    def is_retryable(self) -> bool:
        return True  # Will be retried

class ItemNotFoundError(RetryableError):
    def is_retryable(self) -> bool:
        return False  # Fails immediately

@flow(retry_policy=RetryPolicy.STANDARD)
async def process_order(self):
    await self.check_inventory()  # Retries on ApiTimeoutError
    await self.process_payment()   # Fails fast on ItemNotFoundError

Examples

Complete examples in examples/ directory:

Example Demonstrates
comprehensive_demo.py Full orchestration with versioning, child flows, DAG execution
external_signal_abstraction.py Document approval workflow with external signals
nested_flows.py Parent-child flow execution patterns
retryable_error_proof.py Sophisticated retry behavior
numpy_computation_pipeline.py Scientific computing pipeline with step caching

Run examples:

# From PyPI installation
python examples/comprehensive_demo.py

# From source
PYTHONPATH=src python examples/comprehensive_demo.py

Architecture

Core Components

  1. Core Types (pyergon.core)

    • Invocation: Single step execution record
    • InvocationStatus: Step state machine
    • ScheduledFlow: Distributed queue item
  2. Storage Layer (pyergon.storage)

    • ExecutionLog: Abstract protocol for persistence
    • SqliteExecutionLog: SQLite backend
    • InMemoryExecutionLog: In-memory backend for testing
  3. Executor (pyergon.executor)

    • Executor: Execute flows with durable context
    • Scheduler: Enqueue flows for distributed processing
    • Worker: Process flows from queue
    • schedule_timer: Durable timers
    • await_external_signal: External event coordination
  4. Decorators (pyergon.decorators)

    • @flow_type: Mark workflow class
    • @flow: Mark flow entry point method
    • @step: Mark durable step method

Testing

# Run all tests (60 tests, 48% coverage)
uv run pytest tests/

# Run specific test file
uv run pytest tests/test_durability.py -v

# Type checking
mypy src/pyergon/

# Linting
ruff check src/pyergon/

Development

Code Quality

  • Comprehensive docstrings
  • Type hints throughout
  • Protocol-based interfaces
  • Example usage in every module
  • Test coverage with property-based testing

License

MIT / Apache 2.0 (dual license)

Credits

Inspired by:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pyergon-0.8.1.tar.gz (118.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pyergon-0.8.1-py3-none-any.whl (90.9 kB view details)

Uploaded Python 3

File details

Details for the file pyergon-0.8.1.tar.gz.

File metadata

  • Download URL: pyergon-0.8.1.tar.gz
  • Upload date:
  • Size: 118.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pyergon-0.8.1.tar.gz
Algorithm Hash digest
SHA256 5e4f77202735d6b2ebb99190b1f11dd86e01b3a4ea40e4f8b13bd7ce6d71e5f4
MD5 1fa739392a05d72c0040398397e73da3
BLAKE2b-256 814046a41018c5fd0bfa9aa96c09c6da54cdcad71dda79c0ca28df1070d7eb92

See more details on using hashes here.

Provenance

The following attestation bundles were made for pyergon-0.8.1.tar.gz:

Publisher: publish.yml on richinex/pyergon

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file pyergon-0.8.1-py3-none-any.whl.

File metadata

  • Download URL: pyergon-0.8.1-py3-none-any.whl
  • Upload date:
  • Size: 90.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for pyergon-0.8.1-py3-none-any.whl
Algorithm Hash digest
SHA256 dc3b833de8d36048df190464488c4b91afad5ae89f358c6e8bab180ddd2bdeff
MD5 0893c6141d7dfccfe8a79575579685f9
BLAKE2b-256 84d9323bc3961266a749e5fe0883e96e56f15dd13be67853ee745f845a98389c

See more details on using hashes here.

Provenance

The following attestation bundles were made for pyergon-0.8.1-py3-none-any.whl:

Publisher: publish.yml on richinex/pyergon

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page