A lightweight, high-performance, in-process micro-orchestrator for structured, declarative, and parallel asynchronous tasks in Python.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

owebeeone

These details have not been verified by PyPI

Framework
- AsyncIO
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
- Python :: 3.11
Topic
- Software Development :: Libraries :: Python Modules

Project description

sdax - Structured Declarative Async eXecution

sdax is a lightweight, high-performance, in-process micro-orchestrator for Python's asyncio. It is designed to manage complex, tiered, parallel asynchronous tasks with a declarative API, guaranteeing a correct and predictable order of execution.

It is ideal for building the internal logic of a single, fast operation, such as a complex API endpoint, where multiple dependent I/O calls (to databases, feature flags, or other services) must be reliably initialized, executed, and torn down.

Links:

Key Features

Immutable Builder Pattern: Build processors using a fluent builder API that produces immutable, reusable processor instances.
Structured Lifecycle: Enforces a rigid pre-execute -> execute -> post-execute lifecycle for all tasks.
Tiered Parallel Execution: Tasks are grouped into integer "levels." All tasks at a given level are executed in parallel, and the framework ensures all tasks at level N complete successfully before level N+1 begins.
Guaranteed Cleanup: post-execute runs for any task whose pre-execute was started, regardless of whether it succeeded, failed, or was cancelled. This ensures resources are always released.
Concurrent Execution Safe: Multiple concurrent executions of the same processor instance are fully isolated, perfect for high-throughput API endpoints.
Declarative & Flexible: Define tasks and task functions as frozen dataclasses. Methods for each phase are optional, and each can have its own timeout and retry configuration.
Lightweight: Runs directly inside your application's event loop with minimal dependencies (datatrees, frozendict), with minimal overhead (see Performance section for details).

Installation

pip install sdax

Or for development:

git clone https://github.com/owebeeone/sdax.git
cd sdax
pip install -e .

Quick Start

import asyncio
from dataclasses import dataclass
from sdax import AsyncTaskProcessor, AsyncTask, TaskFunction

# 1. Define your context class with typed fields
@dataclass
class TaskContext:
    user_id: int | None = None
    feature_flags: dict | None = None
    db_connection = None

# 2. Define your task functions
async def check_auth(ctx: TaskContext):
    print("Level 1: Checking authentication...")
    await asyncio.sleep(0.1)
    ctx.user_id = 123
    print("Auth successful.")

async def load_feature_flags(ctx: TaskContext):
    print("Level 1: Loading feature flags...")
    await asyncio.sleep(0.2)
    ctx.feature_flags = {"new_api": True}
    print("Flags loaded.")

async def fetch_user_data(ctx: TaskContext):
    print("Level 2: Fetching user data...")
    if not ctx.user_id:
        raise ValueError("Auth failed, cannot fetch user data.")
    await asyncio.sleep(0.1)
    print("User data fetched.")

async def close_db_connection(ctx: TaskContext):
    print("Tearing down db connection...")
    await asyncio.sleep(0.05)
    print("Connection closed.")

async def main():
    # 3. Create your context
    ctx = TaskContext()

    # 4. Build an immutable processor with declarative workflow
    processor = (
        AsyncTaskProcessor.builder()
        .add_task(
            level=1,
            task=AsyncTask(
                name="Authentication",
                pre_execute=TaskFunction(function=check_auth),
                post_execute=TaskFunction(function=close_db_connection)
            )
        )
        .add_task(
            level=1,
            task=AsyncTask(
                name="FeatureFlags",
                pre_execute=TaskFunction(function=load_feature_flags)
            )
        )
        .add_task(
            level=2,
            task=AsyncTask(
                name="UserData",
                execute=TaskFunction(function=fetch_user_data)
            )
        )
        .build()
    )

    # 5. Run the processor (can be reused for multiple concurrent executions)
    try:
        await processor.process_tasks(ctx)
        print("\nWorkflow completed successfully!")
    except* Exception as e:
        print(f"\nWorkflow failed: {e.exceptions[0]}")

if __name__ == "__main__":
    asyncio.run(main())

Important: Cleanup Guarantees & Resource Management

⚠️ Critical Behavior: post_execute runs for any task whose pre_execute was started, even if:

pre_execute raised an exception
pre_execute was cancelled (due to a sibling task failure)
pre_execute timed out

This is by design for resource management. If your pre_execute acquires resources (opens files, database connections, locks), your post_execute must be idempotent and handle partial initialization.

Example: Safe Resource Management

@dataclass
class TaskContext:
    lock: asyncio.Lock | None = None
    lock_acquired: bool = False

async def acquire_lock(ctx: TaskContext):
    ctx.lock = await some_lock.acquire()
    # If cancelled here, lock is acquired but flag not set
    ctx.lock_acquired = True

async def release_lock(ctx: TaskContext):
    # ✅ GOOD: Check if we actually acquired the lock
    if ctx.lock_acquired and ctx.lock:
        await ctx.lock.release()
    # ✅ GOOD: Or use try/except for safety
    try:
        if ctx.lock:
            await ctx.lock.release()
    except Exception:
        pass  # Already released or never acquired

Why this matters: In parallel execution, if one task fails, all other tasks in that level are cancelled. Without guaranteed cleanup, you'd leak resources.

Execution Model

The "Elevator" Pattern

Tasks execute in a strict "elevator up, elevator down" pattern:

Level 1: [A-pre, B-pre, C-pre] ─┐
                                 ├─→ (parallel)
Level 2: [D-pre, E-pre] ────────┘
                                 ├─→ (parallel)  
Execute: [A-exec, B-exec, D-exec, E-exec] ─┘
                                 
Teardown: ┌─ [D-post, E-post] (parallel)
          └─ [A-post, B-post, C-post] (parallel)

Key Rules:

Within a level, tasks run in parallel
Levels execute sequentially (level N+1 waits for level N)
execute phase runs after all pre_execute phases complete
post_execute runs in reverse level order (LIFO)
If any task fails, remaining tasks are cancelled but cleanup still runs

Task Phases

Each task can define up to 3 optional phases:

Phase	When It Runs	Purpose	Cleanup Guarantee
`pre_execute`	First, by level	Initialize resources, setup	`post_execute` runs if started
`execute`	After all pre_execute	Do main work	`post_execute` runs if pre_execute started
`post_execute`	Last, reverse order	Cleanup, release resources	Always runs if pre_execute started

Performance

Benchmarks (single-threaded, zero-work tasks):

Python Version	Multi-level	Single Large Level	Framework Overhead
Python 3.13	~137,000 tasks/sec	~21,500 tasks/sec	~7µs per task
Python 3.11	~15,000 tasks/sec	~159 tasks/sec	~67µs per task

Python 3.13 has significantly improved asyncio performance compared to 3.11. Benchmarks show 9x better throughput in many scenarios.

Key Observations:

Multi-level execution: ~79% of raw asyncio performance (Python 3.13)
Scalability: Tested with 1,000+ tasks across 100 levels
Real-world performance: For typical I/O-bound tasks (10ms+), framework overhead is <0.1% and negligible

When to use:

✅ I/O-bound workflows (database, HTTP, file operations)
✅ Complex multi-step operations with dependencies
✅ Multiple levels with reasonable task counts (5-50 tasks/level)
✅ Tasks where guaranteed cleanup is critical

When NOT to use:

❌ CPU-bound work (use ProcessPoolExecutor instead)
❌ Single level with 100+ parallel tasks (use raw asyncio.TaskGroup)
❌ Simple linear async/await (unnecessary overhead)
❌ Ultra high-frequency operations (>100k ops/sec needed)

Use Cases

✅ Perfect For

Complex API Endpoints

Level 1: [Auth, RateLimit, FeatureFlags]  # Parallel
Level 2: [FetchUser, FetchPermissions]     # Depends on Level 1
Level 3: [LoadData, ProcessRequest]        # Depends on Level 2

Data Pipeline Steps

Level 1: [OpenDBConnection, OpenFileHandle]
Level 2: [ReadData, TransformData]
Level 3: [WriteResults]
Post: Always close connections/files

Build/Deploy Systems

Level 1: [CheckoutCode, ValidateConfig]
Level 2: [RunTests, BuildArtifacts]
Level 3: [Deploy, NotifySlack]

High-Throughput API Server (Concurrent Execution)

# Build immutable workflow once at startup
processor = (
    AsyncTaskProcessor.builder()
    .add_task(AsyncTask("Auth", ...), level=1)
    .add_task(AsyncTask("FetchData", ...), level=2)
    .build()
)

# Reuse processor for thousands of concurrent requests
@app.post("/api/endpoint")
async def handle_request(user_id: int):
    ctx = RequestContext(user_id=user_id)
    await processor.process_tasks(ctx)
    return ctx.results

❌ Not Suitable For

Simple sequential operations (just use await)
Fire-and-forget background tasks (use asyncio.create_task)
Distributed workflows (use Celery, Airflow)
Event-driven systems (use message queues)

Error Handling

Tasks can fail at any phase. The framework:

Cancels remaining tasks at the same level
Runs cleanup for all tasks that started pre_execute
Collects all exceptions into an ExceptionGroup
Raises the group after cleanup completes

try:
    await processor.process_tasks(ctx)
except* ValueError as eg:
    # Handle specific exception type
    for exc in eg.exceptions:
        print(f"Validation error: {exc}")
except* TimeoutError as eg:
    # Handle timeouts
    for exc in eg.exceptions:
        print(f"Task timed out: {exc}")
except ExceptionGroup as eg:
    # Handle all errors
    print(f"Multiple failures: {eg}")

Advanced Features

Per-Task Configuration

Each task function can have its own timeout and retry settings:

AsyncTask(
    name="FlakeyAPI",
    execute=TaskFunction(
        function=call_external_api,
        timeout=5.0,         # 5 second timeout (use None for no timeout)
        retries=3,           # Retry 3 times
        initial_delay=1.0,   # Start retries at 1 second (default)
        backoff_factor=2.0   # Exponential backoff: 1s, 2s, 4s
    )
)

Retry Timing Calculation:

Each retry delay: initial_delay * (backoff_factor ** attempt) * uniform(0.5, 1.0)
With initial_delay=1.0, backoff_factor=2.0:
- First retry: 0.5s to 1.0s (average 0.75s)
- Second retry: 1.0s to 2.0s (average 1.5s)
- Third retry: 2.0s to 4.0s (average 3.0s)
The uniform(0.5, 1.0) jitter prevents thundering herd

Note: AsyncTask and TaskFunction are frozen dataclasses, ensuring immutability and thread-safety. Once created, they cannot be modified.

Shared Context

You define your own context class with typed fields:

@dataclass
class TaskContext:
    user_id: int | None = None
    permissions: list[str] = field(default_factory=list)
    db_connection: Any = None

async def task_a(ctx: TaskContext):
    ctx.user_id = 123  # Set data

async def task_b(ctx: TaskContext):
    user_id = ctx.user_id  # Read data from task_a, with full type hints!

Note: The context is shared but not thread-safe. Since tasks run in a single asyncio event loop, no locking is needed.

Concurrent Execution

You can safely run multiple concurrent executions of the same immutable AsyncTaskProcessor instance:

# Build immutable processor once at startup
processor = (
    AsyncTaskProcessor.builder()
    .add_task(AsyncTask(...), level=1)
    .build()
)

# Reuse processor for multiple concurrent requests - each with its own context
await asyncio.gather(
    processor.process_tasks(RequestContext(user_id=123)),
    processor.process_tasks(RequestContext(user_id=456)),
    processor.process_tasks(RequestContext(user_id=789)),
)

⚠️ Critical Requirements for Concurrent Execution:

Context Must Be Self-Contained
- Your context must fully contain all request-specific state
- Do NOT rely on global variables, class attributes, or module-level state
- Each execution gets its own isolated context instance
Task Functions Must Be Pure (No External Side Effects)
- ❌ BAD: Writing to shared files, databases, or caches without coordination
- ❌ BAD: Modifying global state or class variables
- ❌ BAD: Using non-isolated external resources
- ✅ GOOD: Reading from the context
- ✅ GOOD: Writing to the context
- ✅ GOOD: Making HTTP requests (each execution independent)
- ✅ GOOD: Database operations with per-execution connections
Example - Safe Concurrent Execution:

@dataclass
class RequestContext:
    # All request state contained in context
    user_id: int
    db_connection: Any = None
    api_results: dict = field(default_factory=dict)

async def open_db(ctx: RequestContext):
    # Each execution gets its own connection
    ctx.db_connection = await db_pool.acquire()

async def fetch_user_data(ctx: RequestContext):
    # Uses this execution's connection
    ctx.api_results["user"] = await ctx.db_connection.fetch_user(ctx.user_id)

async def close_db(ctx: RequestContext):
    # Cleans up this execution's connection
    if ctx.db_connection:
        await ctx.db_connection.close()

# Safe - each execution isolated
processor.add_task(
    AsyncTask("DB", pre_execute=TaskFunction(open_db), post_execute=TaskFunction(close_db)),
    level=1
)

Example - UNSAFE Concurrent Execution:

# ❌ WRONG - shared state causes race conditions
SHARED_CACHE = {}

async def unsafe_task(ctx: RequestContext):
    # Race condition! Multiple executions writing to same dict
    SHARED_CACHE[ctx.user_id] = await fetch_data(ctx.user_id)  # BAD!

When NOT to use concurrent execution:

Your task functions have uncoordinated side effects (file writes, shared caches)
Your tasks rely on global or class-level state
Your tasks modify shared resources without proper locking

When concurrent execution is perfect:

Each request has its own isolated resources (DB connections, API clients)
All state is contained in the context
Tasks are functionally pure (output depends only on context input)
High-throughput API endpoints serving independent requests

Testing

Run the test suite:

pytest tests/ -v

Performance benchmarks:

python tests/test_performance.py -v

Monte Carlo stress testing (runs ~2,750 tasks with random failures):

python tests/test_monte_carlo.py -v

Comparison to Alternatives

Feature	sdax	Celery	Airflow	Raw asyncio
Setup complexity	Minimal	High	Very High	None
External dependencies	None	Redis/RabbitMQ	PostgreSQL/MySQL	None
Throughput	~137k tasks/sec	~500 tasks/sec	~50 tasks/sec	~174k ops/sec
Overhead	~7µs/task	Varies	High	Minimal
Use case	In-process workflows	Distributed tasks	Complex DAGs	Simple async
Guaranteed cleanup	✅ Yes	❌ No	❌ No	Manual
Level-based execution	✅ Yes	❌ No	✅ Yes	Manual

License

MIT License - see LICENSE file for details.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

owebeeone

These details have not been verified by PyPI

Framework
- AsyncIO
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
- Python :: 3.11
Topic
- Software Development :: Libraries :: Python Modules

Release history Release notifications | RSS feed

0.7.1

Oct 26, 2025

0.7.0

Oct 26, 2025

0.6.1

Oct 25, 2025

0.5.2

Oct 25, 2025

0.5.1

Oct 24, 2025

0.5.0

Oct 24, 2025

0.4.0

Oct 18, 2025

This version

0.3.2

Oct 12, 2025

0.3.1

Oct 12, 2025

0.2.1

Oct 12, 2025

0.2.0

Oct 11, 2025

0.1.1

Oct 11, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sdax-0.3.2.tar.gz (21.4 kB view details)

Uploaded Oct 12, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

sdax-0.3.2-py3-none-any.whl (11.7 kB view details)

Uploaded Oct 12, 2025 Python 3

File details

Details for the file sdax-0.3.2.tar.gz.

File metadata

Download URL: sdax-0.3.2.tar.gz
Upload date: Oct 12, 2025
Size: 21.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for sdax-0.3.2.tar.gz
Algorithm	Hash digest
SHA256	`40cbed36503a073369a1d1c2f0d6023ddb454f5ce2a3a03658c2d2733c508e1e`
MD5	`895981dc82a7af387d590da452767b3c`
BLAKE2b-256	`5f6a508cc00b54273a1d3a042ba0f2113c5850922671b975ab025ca2cfb33b3b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sdax-0.3.2.tar.gz:

Publisher: publish.yml on owebeeone/sdax

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sdax-0.3.2.tar.gz
- Subject digest: 40cbed36503a073369a1d1c2f0d6023ddb454f5ce2a3a03658c2d2733c508e1e
- Sigstore transparency entry: 601208054
- Sigstore integration time: Oct 12, 2025
Source repository:
- Permalink: owebeeone/sdax@40a051566f936dc3923f0c7b9e030e77015f748a
- Branch / Tag: refs/tags/v0.3.2
- Owner: https://github.com/owebeeone
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@40a051566f936dc3923f0c7b9e030e77015f748a
- Trigger Event: release

File details

Details for the file sdax-0.3.2-py3-none-any.whl.

File metadata

Download URL: sdax-0.3.2-py3-none-any.whl
Upload date: Oct 12, 2025
Size: 11.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for sdax-0.3.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0c482ee4d1185794baa7f3c22dbd6938eddaa2bd90d8df847cba286852587abe`
MD5	`261db65d14ecf997cea5bdf2f2f54160`
BLAKE2b-256	`da9538441976cee9a60453b11d71008d50a63ac2976d7845523a016c6152a920`

See more details on using hashes here.

Provenance

The following attestation bundles were made for sdax-0.3.2-py3-none-any.whl:

Publisher: publish.yml on owebeeone/sdax

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: sdax-0.3.2-py3-none-any.whl
- Subject digest: 0c482ee4d1185794baa7f3c22dbd6938eddaa2bd90d8df847cba286852587abe
- Sigstore transparency entry: 601208056
- Sigstore integration time: Oct 12, 2025
Source repository:
- Permalink: owebeeone/sdax@40a051566f936dc3923f0c7b9e030e77015f748a
- Branch / Tag: refs/tags/v0.3.2
- Owner: https://github.com/owebeeone
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@40a051566f936dc3923f0c7b9e030e77015f748a
- Trigger Event: release

sdax 0.3.2

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

sdax - Structured Declarative Async eXecution

Key Features

Installation

Quick Start

Important: Cleanup Guarantees & Resource Management

Example: Safe Resource Management

Execution Model

The "Elevator" Pattern

Task Phases

Performance

Use Cases

✅ Perfect For

❌ Not Suitable For

Error Handling

Advanced Features

Per-Task Configuration

Shared Context

Concurrent Execution

Testing

Comparison to Alternatives

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance