OpenAI API proxy for fine-grained cost tracking & control and caching of responses

These details have not been verified by PyPI

Project description

OpenAI Proxy [`openai-wrapi`]

A drop-in wrapper to the openai package that tracks costs per user, project, model and staging account.

Problem statement

OpenAI does not currently provide any way to monitor or limit API usage costs by user*, project or model. In fact, there is no concept of "project", only users (which correspond to email addresses), organizations (which correspond to OpenAI accounts and must be individually funded) and API keys (which can be used interchangeably across any organizations to which a user belongs).

This leads to a proliferation of API keys and users opening up a wider attack surface from a security point of view. Furthermore, users cannot be forced to use MFA and may continue to use the API and create API keys, even if their email no longer exists.

Lastly, it is easy to make redundant calls to the API incurring unnecessary costs, especially when developing in an interactive environment such as a Jupyter notebook.

* The latest version of the OpenAI usage dashboard shows number of calls per user, but not cost.

Solution

This repo provides a wrapper which checks usage limits before passing on the request to the OpenAI API and records the usage costs per user, project, model and staging account. It leverages the IAM permission framework of AWS to control access to the OpenAI API, without exposing the unique API keys per staging account. Responses from the OpenAI API are cached by default. Infrastructure As Code (IAC) is given to deploy the solution using a serverless architecture in AWS at a minimal extra cost and latency.

Deploy

Ideally, you should have one OpenAI account per staging account (dev, prod). Create a terraform.tfvars file in the iac directory with the following variables:

profile                  = "default"   # AWS profile to use
region                   = "eu-west-2" # AWS region to deploy to
openai_api_key_dev       = "sk-XXX"    # OpenAI API key for dev account
openai_organization_dev  = "org-XXX"   # OpenAI organization ID for dev account
openai_api_key_prod      = "sk-YYY"    # OpenAI API key for prod account
openai_organization_prod = "org-YYY"   # OpenAI organization ID for prod account
num_azs                  = 3           # Number of availability zones to deploy to (limited by available Elastic IP addresses)
use_elasticache          = true        # Whether to use ElastiCache Memcache

To deploy run:

cd iac
terraform init
terraform apply -auto-approve

This will create

A Lambda function to proxy calls to the OpenAI API per staging account (dev, prod).
A Lambda function to set usage limits and flush the cache per staging account (dev, prod).
A DynamoDB table to store usage and limit data per staging account (dev, prod).
An optional ElastiCache Memcache cluster to cache OpenAI API responses.

Install

From PyPI

pip install openai-wrapi

From source

git clone
cd openai-proxy
pip install .

Usage

In order to use the proxy in your Python code, provided you have the appropriate IAM permissions, you can run:

import openai_proxy as openai

You no longer need set the OpenAI API key or organization ID as these are securely stored in the corresponding Lambda functions. If you plan to use packages such as langchain which use the openai package internally, you need only ensure you have previously imported openai_proxy.

By default, the project associated with any API calls will be N/A. In order to set the project name:

openai.set_project("my-project")

If you want to disable caching (enabled by default):

openai.set_caching(False)

Admin

Again, supposing you have the IAM permissions to be able to invoke the openai-admin-{staging} Lambda function, you can

set the usage limits per user, project and model:

openai.set_limits(
    limit=10,              # 10 USD
    staging="dev",         # Dev account
    project="my-project",  # Project name
    user="me",             # Optional
    model="gpt-4",         # Optional
)

flush the cache:

openai.flush_cache()

Note that this wrapper currently works for major versions 0 and 1 of the openai package.

TODO

Pass in timeout
Dashboard to view usage.
Rate limits.
Handle streaming / async requests.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.8

Nov 17, 2023

0.1.7

Nov 17, 2023

0.1.6

Nov 15, 2023

0.1.5

Nov 14, 2023

0.1.4

Nov 14, 2023

0.1.3

Nov 13, 2023

0.1.2

Nov 13, 2023

0.1.1

Nov 13, 2023

0.1.0

Nov 13, 2023

0.0.5

Nov 8, 2023

0.0.4

Nov 8, 2023

0.0.3

Nov 8, 2023

This version

0.0.2

Nov 8, 2023

0.0.1

Nov 8, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openai_wrapi-0.0.2.tar.gz (6.9 kB view details)

Uploaded Nov 8, 2023 Source

File details

Details for the file openai_wrapi-0.0.2.tar.gz.

File metadata

Download URL: openai_wrapi-0.0.2.tar.gz
Upload date: Nov 8, 2023
Size: 6.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.8.0 colorama/0.4.6 importlib-metadata/5.2.0 keyring/23.13.1 pkginfo/1.9.6 readme-renderer/34.0 requests-toolbelt/0.10.1 requests/2.31.0 rfc3986/1.5.0 tqdm/4.65.0 urllib3/1.26.16 CPython/3.10.12

File hashes

Hashes for openai_wrapi-0.0.2.tar.gz
Algorithm	Hash digest
SHA256	`4700f0a03c2d0cc8e6fee5c41036cb62cd8c1bfde7a013f4b4897425023d7070`
MD5	`7c58a21a816c4063104f66e61026ca1f`
BLAKE2b-256	`21ba9997a9197a3cbe5edc9279fdc14bf083af90131fe34ac483dca5aba035cd`

See more details on using hashes here.

openai-wrapi 0.0.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

OpenAI Proxy [`openai-wrapi`]

Problem statement

Solution

Deploy

Install

Usage

Admin

TODO

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes

openai-wrapi 0.0.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

OpenAI Proxy [openai-wrapi]

Problem statement

Solution

Deploy

Install

Usage

Admin

TODO

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes

OpenAI Proxy [`openai-wrapi`]