toller

Intelligent async flow controller for Python - making complex asyncio workflows manageable

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

nolantremelling

These details have not been verified by PyPI

Project links

Documentation

Project description

Toller Logo

What is Toller?

Toller is a lightweight Python library designed to make your asynchronous calls to microservices, GenAI solutions, external APIs, etc., more robust and reliable. It provides a simple yet powerful decorator to add rate limiting, retries (with backoff & jitter), and circuit breaking to your async functions with minimal boilerplate.

Just as the Nova Scotia Duck Tolling Retriever lures and guides ducks, Toller "lures" unruly asynchronous tasks into well-managed, predictable flows, guiding the overall execution path and making concurrency easier to reason about.

Why Toller?

Modern applications that integrate with numerous LLMs, vector databases, and other microservices, face a constant challenge: external services can be unreliable. They might be temporarily down, enforce rate limits, or return transient errors.

Building robust applications in this environment means every external call needs careful handling, but repeating this logic for every API call leads to boilerplate, inconsistency, and often, poorly managed asynchronous processes. Toller was built to solve this. It provides a declarative way to add these resilience patterns.

Toller offers this standard, both for client-side calls and potentially for protecting server-side resources.

Features

@toller.task Decorator: A single, easy-to-use decorator to apply all resilience patterns.
Rate Limiting:
- Async-safe Token Bucket-based CallRateLimiter.
- Configurable call rates and burst capacity.
- Automatic asynchronous waiting when limits are hit.
Retries:
- Strategies: Max attempts, fixed delay, exponential backoff with jitter.
- Conditional retries on specific exceptions (e.g., TransientError).
- Conditional stopping on specific exceptions (e.g., FatalError).
- Raises MaxRetriesExceeded wrapping the last encountered error.
Circuit Breaker:
- Standard states: CLOSED, OPEN, HALF_OPEN.
- Configurable failure thresholds and recovery timeouts.
- Trips on specified exceptions (e.g., MaxRetriesExceeded, or custom fatal errors).
- Prevents calls to a failing service, allowing it time to recover.
Custom Exception Hierarchy: Clear exceptions like OpenCircuitError, TransientError, FatalError for better error handling.
Async Native: Built for asyncio.
Lightweight: Minimal dependencies.

Installation

pip install toller

Usage and Examples

Example 1: Basic Resilience for Generative AI Calls

For a function that calls out to an LLM, we want to handle rate limits, retry on temporary server issues, and stop if the service is truly down.

import asyncio
import random
from toller import TransientError, FatalError, MaxRetriesExceeded, OpenCircuitError

# Define potential API errors
class LLMRateLimitError(TransientError): pass
class LLMServerError(TransientError): pass
class LLMInputError(FatalError): pass # e.g., prompt too long

# Simulate an LLM call
LLM_DOWN_FOR_DEMO = 0 # Counter for demoing circuit breaker
async def call_llm_api(prompt: str):
    global LLM_DOWN_FOR_DEMO
    print(f"LLM API: Processing '{prompt[:20]}...' (Attempt for this task)")
    await asyncio.sleep(random.uniform(0.1, 0.3)) # Network latency

    if LLM_DOWN_FOR_DEMO > 0:
        LLM_DOWN_FOR_DEMO -=1
        print("LLM API: Simulating 503 Service Unavailable")
        raise LLMServerError("LLM service is temporarily down")
    if random.random() < 0.2: # 20% chance of a transient rate limit error
        print("LLM API: Simulating 429 Rate Limit")
        raise LLMRateLimitError("Hit LLM rate limit")
    if len(prompt) < 5:
        print("LLM API: Simulating 400 Bad Request (prompt too short)")
        raise LLMInputError("Prompt is too short")
    
    return f"LLM Response for '{prompt[:20]}...': Generated text."

# Apply Toller
@toller.task(
    # Rate Limiter: 60 calls per minute (1 per sec), burst 5
    rl_calls_per_second=1.0,  # 60 RPM / 60s
    rl_max_burst_calls=5,
    
    # Retries: 3 attempts on transient LLM errors
    retry_max_attempts=3,
    retry_delay=1.0, # Start with 1s delay for LLM errors
    retry_backoff=2.0,
    retry_on_exception=(LLMRateLimitError, LLMServerError),
    retry_stop_on_exception=(LLMInputError,), # Don't retry bad input

    # Circuit Breaker: Opens if retries fail 2 times consecutively
    cb_failure_threshold=2, # Low threshold for demo
    cb_recovery_timeout=20.0, # Wait 20s before one test call
    cb_expected_exception=MaxRetriesExceeded # CB trips when all retries are exhausted
)
async def get_llm_completion(prompt: str):
    return await call_llm_api(prompt)

async def run_example1():
    print("Example 1: Basic Resilience for Generative AI Calls")
    prompts = [
        "Tell me a story about a brave duck.",
        "Explain async programming.",
        "Short", # This will cause a FatalError (LLMInputError)
        "Another valid prompt after fatal.",
        "Prompt to trigger server errors 1", # Will hit retry then MaxRetriesExceeded
        "Prompt to trigger server errors 2", # Will hit retry then MaxRetriesExceeded, tripping CB
        "Prompt after CB should open", # Should hit OpenCircuitError
    ]

    # Simulated LLM service downtime for the relevant prompts
    global LLM_DOWN_FOR_DEMO
    LLM_DOWN_FOR_DEMO = 4

    for i, p in enumerate(prompts):
        print(f"\Sending request: '{p}'")
        try:
            result = await get_llm_completion(p)
            print(f"Success: {result}")
        except MaxRetriesExceeded as e:
            print(f"Toller: Max retries exceeded. Last error: {type(e.last_exception).__name__}: {e.last_exception}")
        except OpenCircuitError as e:
            print(f"Toller: Circuit is open! {e}. Further calls blocked temporarily.")
            if i == len(prompts) - 2: # If this is the call just before the last one
                print("Waiting for circuit breaker recovery timeout for demo...")
                await asyncio.sleep(21) # Wait for CB to go HALF_OPEN
        except FatalError as e:
            print(f"Toller: Fatal error, no retries. Error: {type(e).__name__}: {e}")
        except Exception as e:
            print(f"Toller: Unexpected error. Type: {type(e).__name__}, Error: {e}")
        
        await asyncio.sleep(0.3) # Small pause between top-level requests to see rate limiter too

if __name__ == "__main__":
    asyncio.run(run_example1())

Example 2: Shared Rate Limiter for Multiple Related API Calls

Often, different API endpoints for the same service share an overall rate limit.

import time
from toller import CallRateLimiter # For creating a shared instance

# Assume these two functions call endpoints that share a single rate limit pool
shared_api_rl = CallRateLimiter(calls_per_second=2, max_burst_calls=2, name="MyServiceSharedRL")

@toller.task(
    rate_limiter_instance=shared_api_rl,
    # Disable retry/CB for this simple RL demo
    enable_retry=False, enable_circuit_breaker=False 
)
async def call_endpoint_a(item_id: int):
    print(f"Calling A for {item_id}...")
    await asyncio.sleep(0.1)
    return f"A {item_id} done"

@toller.task(
    rate_limiter_instance=shared_api_rl,
    enable_retry=False, enable_circuit_breaker=False
)
async def call_endpoint_b(item_id: int):
    print(f"Calling B for {item_id}...")
    await asyncio.sleep(0.1)
    return f"B {item_id} done"

async def run_example2():
    print("\nExample 2: Shared Rate Limiter")
    tasks = []
    # These 4 calls will exceed the burst of 2 for the shared limiter (rate 2/sec), so, some will be delayed.
    tasks.append(call_endpoint_a(1))
    tasks.append(call_endpoint_b(1))
    tasks.append(call_endpoint_a(2))
    tasks.append(call_endpoint_b(2))

    start_time = time.time()
    results = await asyncio.gather(*tasks)
    duration = time.time() - start_time

    for res in results:
        print(f"Shared RL Result: {res}")
    print(f"Total time for 4 calls with shared RL (2/sec, burst 2): {duration:.2f}s (expected > ~1.0s)")

if __name__ == "__main__":
    asyncio.run(run_example2())

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

nolantremelling

These details have not been verified by PyPI

Project links

Documentation

Release history Release notifications | RSS feed

This version

0.0.3

May 13, 2025

0.0.2

May 13, 2025

0.0.1

Apr 5, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

toller-0.0.3.tar.gz (1.6 MB view details)

Uploaded May 13, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

toller-0.0.3-py3-none-any.whl (14.2 kB view details)

Uploaded May 13, 2025 Python 3

File details

Details for the file toller-0.0.3.tar.gz.

File metadata

Download URL: toller-0.0.3.tar.gz
Upload date: May 13, 2025
Size: 1.6 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for toller-0.0.3.tar.gz
Algorithm	Hash digest
SHA256	`399713c1518e219bdd6115a3306967c2c80a1c7e92df3301c11e368dd391be97`
MD5	`467afd48edd39a7330ebfbccda393a82`
BLAKE2b-256	`e3dafb2b9a726047616d54b5670d3cb0911deb3b7500b457292c8cfb68bdaa5c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for toller-0.0.3.tar.gz:

Publisher: publish-to-pypi.yml on NolanTrem/toller

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: toller-0.0.3.tar.gz
- Subject digest: 399713c1518e219bdd6115a3306967c2c80a1c7e92df3301c11e368dd391be97
- Sigstore transparency entry: 211956665
- Sigstore integration time: May 13, 2025
Source repository:
- Permalink: NolanTrem/toller@da0da6a25379fd9557a8ce7759914f103733fbcb
- Branch / Tag: refs/tags/v0.0.3
- Owner: https://github.com/NolanTrem
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yml@da0da6a25379fd9557a8ce7759914f103733fbcb
- Trigger Event: release

File details

Details for the file toller-0.0.3-py3-none-any.whl.

File metadata

Download URL: toller-0.0.3-py3-none-any.whl
Upload date: May 13, 2025
Size: 14.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.12.9

File hashes

Hashes for toller-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b7fa0b16eea7d0cfd45277ad9cd48b1d730ed937366a248560b9ead390087e7b`
MD5	`1dfa615758c6720cdcfb673353e573ab`
BLAKE2b-256	`94f00ea071e148d4a99bcd07521b9f61f586774f9ec3ca34cf26a5b7ee8d9755`

See more details on using hashes here.

Provenance

The following attestation bundles were made for toller-0.0.3-py3-none-any.whl:

Publisher: publish-to-pypi.yml on NolanTrem/toller

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: toller-0.0.3-py3-none-any.whl
- Subject digest: b7fa0b16eea7d0cfd45277ad9cd48b1d730ed937366a248560b9ead390087e7b
- Sigstore transparency entry: 211956667
- Sigstore integration time: May 13, 2025
Source repository:
- Permalink: NolanTrem/toller@da0da6a25379fd9557a8ce7759914f103733fbcb
- Branch / Tag: refs/tags/v0.0.3
- Owner: https://github.com/NolanTrem
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-to-pypi.yml@da0da6a25379fd9557a8ce7759914f103733fbcb
- Trigger Event: release

toller 0.0.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

What is Toller?

Why Toller?

Features

Installation

Usage and Examples

Example 1: Basic Resilience for Generative AI Calls

Example 2: Shared Rate Limiter for Multiple Related API Calls

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance