Async OpenAI client for fast and efficient API requests using AIOHTTP module.

These details have not been verified by PyPI

Project links

Project description

Speedy OpenAI

A high-performance, asynchronous Python client for the OpenAI API with built-in rate limiting and concurrency control.

Demo

Features

⚡ Asynchronous request handling for optimal performance
🔄 Built-in rate limiting for both requests and tokens
🎛️ Configurable concurrency control
🔁 Automatic retry mechanism with backoff
📊 Progress tracking for batch requests
🎯 Token counting and management
📝 Comprehensive logging

Installation

pip install speedy-openai

Quick Start

import asyncio
from speedy_openai import OpenAIClient

async def main():
    # Initialize the client
    client = OpenAIClient(
        api_key="your-api-key",
        max_requests_per_min=5000,  # Optional: default 5000
        max_tokens_per_min=15000000,  # Optional: default 15M
        max_concurrent_requests=250  # Optional: default 250
    )

    # Single request
    request = {
        "custom_id": "req1",
        "method": "POST",
        "url": "/v1/chat/completions",
        "body": {
            "model": "gpt-3.5-turbo",
            "messages": [{"role": "user", "content": "Hello!"}]
        }
    }
    
    response = await client.process_request(request)

    # Batch requests
    requests = [request, request]  # List of requests
    responses = await client.process_batch(requests)

if __name__ == "__main__":
    asyncio.run(main())

Configuration Options

Parameter	Default	Description
`api_key`	Required	Your OpenAI API key
`max_requests_per_min`	5000	Maximum API requests per minute
`max_tokens_per_min`	15000000	Maximum tokens per minute
`max_concurrent_requests`	250	Maximum concurrent requests
`max_retries`	5	Maximum retry attempts
`max_sleep_time`	60	Maximum sleep time between retries (seconds)

Features in Detail

Rate Limiting

The client includes a sophisticated rate limiter that manages both request frequency and token usage:

Automatically tracks remaining requests and tokens
Updates limits from API response headers
Implements waiting periods when limits are reached
Supports dynamic limit adjustments

Concurrency Control

Manages concurrent requests using asyncio semaphores
Prevents overwhelming the API with too many simultaneous requests
Configurable maximum concurrent requests

Retry Mechanism

Built-in retry logic for handling common API errors:

Automatic retries with fixed wait times
Configurable maximum retry attempts
Specific exception handling for API-related errors

Progress Tracking

Batch requests include:

Progress bar visualization using tqdm
Processing time logging
Detailed success/failure reporting

Error Handling

The client includes comprehensive error handling:

API response validation
Rate limit handling
Network error recovery
Invalid request detection

Requirements

Python 3.7+
aiohttp
tiktoken
tenacity
tqdm
loguru
pydantic

Common Use Cases

1. Chat Completion with GPT-4

import asyncio
from speedy_openai import OpenAIClient

async def chat_with_gpt4():
    client = OpenAIClient(api_key="your-api-key")
    
    request = {
        "custom_id": "chat-1",
        "method": "POST",
        "url": "/v1/chat/completions",
        "body": {
            "model": "gpt-4",
            "messages": [
                {"role": "system", "content": "You are a helpful assistant."},
                {"role": "user", "content": "Explain quantum computing in simple terms."}
            ],
            "temperature": 0.7
        }
    }
    
    response = await client.process_request(request)
    print(response["response"]["choices"][0]["message"]["content"])

asyncio.run(chat_with_gpt4())

2. Batch Processing Multiple Conversations

async def process_multiple_conversations():
    client = OpenAIClient(api_key="your-api-key")
    
    conversations = [
        {"role": "user", "content": "What is AI?"},
        {"role": "user", "content": "Explain machine learning."},
        {"role": "user", "content": "What is deep learning?"}
    ]
    
    requests = [
        {
            "custom_id": f"batch-{i}",
            "method": "POST",
            "url": "/v1/chat/completions",
            "body": {
                "model": "gpt-3.5-turbo",
                "messages": [conv],
                "temperature": 0.7
            }
        }
        for i, conv in enumerate(conversations)
    ]
    
    responses = await client.process_batch(requests)
    return responses

Testing

The project uses pytest for testing. To run the tests:

Clone the repository:

git clone https://github.com/yourusername/speedy-openai.git
cd speedy-openai

Install development dependencies:

poetry install

Run tests:

poetry run pytest

Test Structure

The test suite includes:

Unit tests for core functionality
Integration tests for API interactions
Rate limiting tests
Concurrency tests
Error handling tests

Writing Tests

When contributing new features, please ensure:

All new features have corresponding tests
Test coverage remains above 80%
Tests are properly documented
Both success and failure cases are covered

License

This project is licensed under the MIT License - see the LICENSE file for details.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Support

For issues, questions, or contributions, please create an issue in the GitHub repository.

Credits

Blogpost by Villoro

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.3.0

Nov 13, 2025

0.2.0

Dec 16, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

speedy_openai-0.3.0.tar.gz (11.6 kB view details)

Uploaded Nov 13, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

speedy_openai-0.3.0-py3-none-any.whl (13.8 kB view details)

Uploaded Nov 13, 2025 Python 3

File details

Details for the file speedy_openai-0.3.0.tar.gz.

File metadata

Download URL: speedy_openai-0.3.0.tar.gz
Upload date: Nov 13, 2025
Size: 11.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.5.1 CPython/3.10.19 Linux/6.11.0-1018-azure

File hashes

Hashes for speedy_openai-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`476e38e01408e21dad58996456b91e97ec2b9827c91fe6fbd4c089df33189822`
MD5	`9f105e9f1ed3878b7358048a2ecf2885`
BLAKE2b-256	`d005735e2b77506cadeecc28631bd910ea7d9b642bc7130aa41fae1ffe5240ea`

See more details on using hashes here.

File details

Details for the file speedy_openai-0.3.0-py3-none-any.whl.

File metadata

Download URL: speedy_openai-0.3.0-py3-none-any.whl
Upload date: Nov 13, 2025
Size: 13.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/1.5.1 CPython/3.10.19 Linux/6.11.0-1018-azure

File hashes

Hashes for speedy_openai-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`42471ea2424cd1cdd6bc3a491ec7c7776f5363f82a94e1c5920e14e2c4211006`
MD5	`0c933779170e40f026f8542643f000fb`
BLAKE2b-256	`bf74e42af6127b138c6092c32c9e60ade8766de237b59db847f8e1faea9e3583`

See more details on using hashes here.

speedy-openai 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Speedy OpenAI

Demo

Features

Installation

Quick Start

Configuration Options

Features in Detail

Rate Limiting

Concurrency Control

Retry Mechanism

Progress Tracking

Error Handling

Requirements

Common Use Cases

1. Chat Completion with GPT-4

2. Batch Processing Multiple Conversations

Testing

Test Structure

Writing Tests

License

Contributing

Support

Credits

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes