Drop-in AsyncOpenAI replacement that transparently batches requests using the batch API

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

autobatcher

Drop-in replacement for AsyncOpenAI that transparently batches requests. This library is designed or use with the Doubleword Batch API. Support for OpenAI's batch API or other compatible APIs is best effort. If you experience any issues, please open an issue.

Why?

Batch LLM APIs offers 50% cost savings (and specialist inference providers like Doubleword offer 80%+ savings), but these APIs you to restructure your code around file uploads and polling. autobatcher lets you keep your existing async code while getting batch pricing automatically.

# Before: regular async calls (full price)
from openai import AsyncOpenAI
client = AsyncOpenAI()

# After: batched calls (50% off)
from autobatcher import BatchOpenAI
client = BatchOpenAI()

# Same interface, same code
response = await client.chat.completions.create(
    model="gpt-4o",
    messages=[{"role": "user", "content": "Hello!"}]
)

How it works

Requests are collected over a configurable time window (default: 10 seconds)
When the window closes or batch size is reached, requests are submitted as a batch
Results are polled and returned to waiting callers as they complete
Your code sees normal response objects (ChatCompletion, CreateEmbeddingResponse, Response)

Different request types (chat completions, embeddings, responses) can be mixed in a single batch — each result is parsed with the correct type automatically.

Installation

pip install autobatcher

Usage

Chat completions

import asyncio
from autobatcher import BatchOpenAI

async def main():
    client = BatchOpenAI(
        api_key="sk-...",  # or set OPENAI_API_KEY env var
    )

    response = await client.chat.completions.create(
        model="gpt-4o",
        messages=[{"role": "user", "content": "What is 2+2?"}],
    )
    print(response.choices[0].message.content)

    await client.close()

asyncio.run(main())

Embeddings

async def embed(client: BatchOpenAI):
    response = await client.embeddings.create(
        model="text-embedding-3-small",
        input="Hello, world!",
    )
    print(response.data[0].embedding[:5])

Responses API

async def respond(client: BatchOpenAI):
    response = await client.responses.create(
        model="gpt-4o",
        input="Explain quantum computing in one sentence.",
    )
    print(response.output[0].content[0].text)

Parallel requests

The real power comes when you have many requests:

async def process_many(prompts: list[str]) -> list[str]:
    client = BatchOpenAI(batch_size=500, batch_window_seconds=5.0)

    async def get_response(prompt: str) -> str:
        response = await client.chat.completions.create(
            model="gpt-4o-mini",
            messages=[{"role": "user", "content": prompt}],
        )
        return response.choices[0].message.content

    # All requests are batched together automatically
    results = await asyncio.gather(*[get_response(p) for p in prompts])

    await client.close()
    return results

Mixed batching

Different request types are automatically mixed into the same batch:

async def mixed(client: BatchOpenAI):
    chat, embedding = await asyncio.gather(
        client.chat.completions.create(
            model="gpt-4o",
            messages=[{"role": "user", "content": "Hello!"}],
        ),
        client.embeddings.create(
            model="text-embedding-3-small",
            input="Hello!",
        ),
    )

Context manager

async with BatchOpenAI() as client:
    response = await client.chat.completions.create(...)

Configuration

Parameter	Default	Description
`api_key`	`None`	OpenAI API key (falls back to `OPENAI_API_KEY` env var)
`base_url`	`None`	API base URL (for proxies or compatible APIs)
`batch_size`	`1000`	Submit batch when this many requests are queued
`batch_window_seconds`	`10.0`	Submit batch after this many seconds
`poll_interval_seconds`	`5.0`	How often to poll for batch completion
`completion_window`	`"24h"`	Batch completion window (`"24h"` or `"1h"`)

Supported endpoints

Endpoint	Method	Return type
`client.chat.completions.create()`	Chat completions	`ChatCompletion`
`client.embeddings.create()`	Embeddings	`CreateEmbeddingResponse`
`client.responses.create()`	Responses API	`Response`

Limitations

Batch API has a 24-hour completion window by default. 1hr SLAs is also offered with Doubleword.
No escalations when the completion window elapses
Not suitable for real-time/interactive use cases

License

MIT

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

fergusfinn

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.10.0

Apr 23, 2026

0.9.0

Apr 23, 2026

0.8.0

Apr 22, 2026

0.7.0

Apr 16, 2026

0.6.1

Apr 14, 2026

0.6.0

Apr 14, 2026

This version

0.5.0

Apr 14, 2026

0.4.1

Apr 13, 2026

0.4.0

Apr 13, 2026

0.3.1

Mar 16, 2026

0.3.0

Mar 16, 2026

0.2.0

Mar 3, 2026

0.1.1

Jan 5, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

autobatcher-0.5.0.tar.gz (25.0 kB view details)

Uploaded Apr 14, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

autobatcher-0.5.0-py3-none-any.whl (14.0 kB view details)

Uploaded Apr 14, 2026 Python 3

File details

Details for the file autobatcher-0.5.0.tar.gz.

File metadata

Download URL: autobatcher-0.5.0.tar.gz
Upload date: Apr 14, 2026
Size: 25.0 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for autobatcher-0.5.0.tar.gz
Algorithm	Hash digest
SHA256	`875cc1863f2f9eae625d5373635328492d96615260ebb5b6898e514bdef89397`
MD5	`6320ba8d9a6f98593ccd2e0f27670951`
BLAKE2b-256	`ac5bfce9a4c7c43d374100b59b498ed25970f9f6c31f48064bbe4c9577431868`

See more details on using hashes here.

Provenance

The following attestation bundles were made for autobatcher-0.5.0.tar.gz:

Publisher: publish.yml on doublewordai/autobatcher

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: autobatcher-0.5.0.tar.gz
- Subject digest: 875cc1863f2f9eae625d5373635328492d96615260ebb5b6898e514bdef89397
- Sigstore transparency entry: 1293689402
- Sigstore integration time: Apr 14, 2026
Source repository:
- Permalink: doublewordai/autobatcher@6af0c3c5e016e11ebce30655fecf7e9a67da7ff2
- Branch / Tag: refs/tags/autobatcher-v0.5.0
- Owner: https://github.com/doublewordai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@6af0c3c5e016e11ebce30655fecf7e9a67da7ff2
- Trigger Event: release

File details

Details for the file autobatcher-0.5.0-py3-none-any.whl.

File metadata

Download URL: autobatcher-0.5.0-py3-none-any.whl
Upload date: Apr 14, 2026
Size: 14.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for autobatcher-0.5.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`df3959cdf6215864662118b40958acb05a9c1d579fe00cc170c0d46ee0191a87`
MD5	`8255ff557baff76deed4ffb835b3e093`
BLAKE2b-256	`8584548e328ca7f2373f118e9419cb894835e29b3c7951a27cd87f13e722b354`

See more details on using hashes here.

Provenance

The following attestation bundles were made for autobatcher-0.5.0-py3-none-any.whl:

Publisher: publish.yml on doublewordai/autobatcher

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: autobatcher-0.5.0-py3-none-any.whl
- Subject digest: df3959cdf6215864662118b40958acb05a9c1d579fe00cc170c0d46ee0191a87
- Sigstore transparency entry: 1293689405
- Sigstore integration time: Apr 14, 2026
Source repository:
- Permalink: doublewordai/autobatcher@6af0c3c5e016e11ebce30655fecf7e9a67da7ff2
- Branch / Tag: refs/tags/autobatcher-v0.5.0
- Owner: https://github.com/doublewordai
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@6af0c3c5e016e11ebce30655fecf7e9a67da7ff2
- Trigger Event: release

autobatcher 0.5.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

autobatcher

Why?

How it works

Installation

Usage

Chat completions

Embeddings

Responses API

Parallel requests

Mixed batching

Context manager

Configuration

Supported endpoints

Limitations

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance