Last released Apr 2, 2025
A flexible Python library for managing batches of requests to LLM inference providers.
Supported by