A Python module that provides rate limiting capabilities for the OpenAI API, utilizing Redis as a caching service. It helps to manage API usage to avoid exceeding OpenAI's rate limits.

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
License
- OSI Approved :: MIT License
Programming Language

Project description

openai-ratelimiter

openai-ratelimiter is a simple and efficient rate limiter for the OpenAI API. It is designed to help prevent the API rate limit from being reached when using the OpenAI library. Currently, it supports only Redis as the caching service.

Installation

To install the openai-ratelimiter library, use pip:

pip install openai-ratelimiter

Redis Setup

This library uses Redis for caching. If you don't have a Redis server setup, you can pull the Redis Docker image and run a container as follows:

# Pull the Redis image
docker pull redis

# Run the Redis container
docker run --name some-redis -p 6379:6379 -d redis

This will set up a Redis server accessible at localhost on port 6379.

Usage

The library provides two classes, ChatCompletionLimiter and TextCompletionLimiter, for limiting rate of API calls.

ChatCompletionLimiter

from openai_ratelimiter import ChatCompletionLimiter
import openai

openai.api_key = "{your API key}"
model_name = "gpt-3.5-turbo-16k"
messages = [
    {"role": "system", "content": "You are a helpful assistant."},
    {"role": "user", "content": "What is the capital of Morocco."},
]
max_tokens = 200
chatlimiter = ChatCompletionLimiter(
    model_name=model_name,
    RPM=3_000,
    TPM=250_000,
    redis_host="localhost",
    redis_port=6379,
)
with chatlimiter.limit(messages=messages, max_tokens=max_tokens):
    response = openai.ChatCompletion.create(
        model=model_name, messages=messages, max_tokens=max_tokens
    )
    ...

TextCompletionLimiter

from openai_ratelimiter import TextCompletionLimiter
import openai

openai.api_key = "{your API key}"
model_name = "text-davinci-003"
prompt = "What is the capital of Morocco."
max_tokens = 200
textlimiter = TextCompletionLimiter(
    model_name=model_name,
    RPM=3_000,
    TPM=250_000,
    redis_host="localhost",
    redis_port=6379,
)
with textlimiter.limit(prompt=prompt, max_tokens=max_tokens):
    response = openai.Completion.create(
        model=model_name, prompt=prompt, max_tokens=max_tokens
    )
    ...

Future Plans

In-memory caching
Limiting for embeddings
Limiting for DALLÂ·E image model
Implementing more functions that provide information about the current state

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 3 - Alpha
License
- OSI Approved :: MIT License
Programming Language

Release history Release notifications | RSS feed

0.7

Sep 15, 2024

0.6

Sep 14, 2024

0.5

Feb 2, 2024

0.4

Jun 20, 2023

0.3

Jun 16, 2023

This version

0.1

Jun 14, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openai-ratelimiter-0.1.tar.gz (4.6 kB view hashes)

Uploaded Jun 14, 2023 Source

Built Distribution

openai_ratelimiter-0.1-py3-none-any.whl (3.1 kB view hashes)

Uploaded Jun 14, 2023 Python 3

Hashes for openai-ratelimiter-0.1.tar.gz

Hashes for openai-ratelimiter-0.1.tar.gz
Algorithm	Hash digest
SHA256	`ecaa41f9991f925b3d481acc16e6d7a62ea5864a6fafd3d33c30d31dce017d92`
MD5	`ffda21372ba16f6d45e1585d468f11f4`
BLAKE2b-256	`2ab8001beaa86d87b1a467cfcffd1abef27159471933ff54a8933cbad094a2d4`

Hashes for openai_ratelimiter-0.1-py3-none-any.whl

Hashes for openai_ratelimiter-0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`62ee02bc5c7d5a5755349c16f8d05a36286e6af9b22f1134675ba8b145a531d1`
MD5	`9ab48ff71aeeabcc33676267d19668ac`
BLAKE2b-256	`60e6bc0dab0f8872f6efc6c638590fdea7a2482d1cee2c835ad76f2574f1a4ae`