Reference keel-llm-protocol adapter for any OpenAI-compatible chat endpoint (OpenAI, Groq, OpenRouter, Mistral, vLLM, Ollama, ...).
Project description
keel-llm-adapter-openai
A reference
keel-llm-protocoladapter for any OpenAI-compatible chat endpoint — OpenAI, Groq, OpenRouter, Mistral, Together, Fireworks, vLLM, Ollama (OpenAI mode), and more.
This package grounds the protocol standard in a working implementation. The status-code → error-taxonomy mapping, response/usage/finish-reason parsing, SSE streaming, and tool-call parsing here are real and tested against the actual OpenAI-compatible wire format — so keel-llm-protocol is proven against a real provider API, not asserted in the abstract.
Is this for you?
Adopt when — you're consuming via keel-llm-protocol / keel-llm-reliability and want one adapter that maps OpenAI / Groq / OpenRouter / Mistral / Together / Fireworks / vLLM / Ollama failures to the same typed taxonomy.
Skip when — you're calling one provider directly in a simple app — the official openai SDK has a wider surface (batch, fine-tune, files, embeddings…); this adapter covers chat / streaming / tools.
Install
pip install keel-llm-adapter-openai # pulls in keel-llm-protocol + httpx
Use
from keel_llm_adapter_openai import OpenAIAdapter
from keel_llm_protocol import user
from keel_llm_protocol.errors import RateLimitError, AdapterError
# Point base_url at any OpenAI-compatible endpoint:
adapter = OpenAIAdapter(
model="llama-3.3-70b-versatile",
api_key="...",
base_url="https://api.groq.com/openai/v1",
provider="groq", # -> model_key = "groq:llama-3.3-70b-versatile"
)
try:
resp = await adapter.generate([user("Explain circuit breakers in one sentence.")])
print(resp.text, resp.usage.total_tokens, resp.finish_reason)
except RateLimitError as e:
... # healthy but throttled — back off; don't trip a breaker
except AdapterError as e:
if e.retryable:
... # transient / timeout — retry or fail over
It implements all three capability protocols, so it type-checks anywhere any of them is required:
from keel_llm_protocol import ModelAdapter, StreamingModelAdapter, ToolCallingModelAdapter
assert isinstance(adapter, ModelAdapter)
assert isinstance(adapter, StreamingModelAdapter)
assert isinstance(adapter, ToolCallingModelAdapter)
Streaming
async for chunk in adapter.stream([user("Write a haiku.")]):
print(chunk.delta, end="", flush=True)
if chunk.usage:
print(f"\n[{chunk.usage.total_tokens} tokens]")
Tools
from keel_llm_protocol import ToolSpec
weather = ToolSpec(name="get_weather", description="Get weather",
parameters={"type": "object", "properties": {"city": {"type": "string"}}})
resp = await adapter.generate_with_tools([user("Weather in SF?")], [weather])
for call in resp.tool_calls:
print(call.name, call.arguments) # YOU dispatch — the adapter only reports the request
Error mapping
Provider HTTP responses are mapped to the standard taxonomy so reliability logic is provider-agnostic:
| Provider response | Raised |
|---|---|
429 (+ Retry-After) |
RateLimitError(retry_after=...) — retryable |
| 401 / 403 | AuthenticationError |
400 + context_length_exceeded |
ContextLengthError |
| 400 + content-filter | ContentFilterError |
| 400 (other) | BadRequestError |
| 5xx | TransientError — retryable |
| timeout | AdapterTimeoutError — retryable |
| connection error | TransientError — retryable |
A 200 response whose finish_reason is content_filter is returned (not raised) with finish_reason="content_filter".
Status
0.1.2 — 0.x while the API stabilizes through year one (breaking changes possible at minor bumps, documented in the CHANGELOG; pin exact versions).
The Keel toolkit
Composable, vendor-neutral LLM reliability libraries on PyPI:
keel-llm-reliability · keel-llm-protocol · keel-llm-adapter-openai · keel-llm-adapter-anthropic · keel-llm-adapter-google · keel-circuit-breaker
MIT licensed.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file keel_llm_adapter_openai-0.1.2.tar.gz.
File metadata
- Download URL: keel_llm_adapter_openai-0.1.2.tar.gz
- Upload date:
- Size: 9.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1dfc21846f20a6f54caee8fae5ff08153e420e3559854c1123a8d42662bdc637
|
|
| MD5 |
99078ae2c7c9844da0afe42bd42a70c7
|
|
| BLAKE2b-256 |
c44926f78343ae82ec9f943a4c444a6f94235571444624189ecb15f945b8dc69
|
Provenance
The following attestation bundles were made for keel_llm_adapter_openai-0.1.2.tar.gz:
Publisher:
publish-py.yml on keelplatform/keel
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
keel_llm_adapter_openai-0.1.2.tar.gz -
Subject digest:
1dfc21846f20a6f54caee8fae5ff08153e420e3559854c1123a8d42662bdc637 - Sigstore transparency entry: 1610072826
- Sigstore integration time:
-
Permalink:
keelplatform/keel@2d54c903ad991fb68652520570a6571c69c477fb -
Branch / Tag:
refs/tags/py-llm-adapter-openai-v0.1.2 - Owner: https://github.com/keelplatform
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-py.yml@2d54c903ad991fb68652520570a6571c69c477fb -
Trigger Event:
push
-
Statement type:
File details
Details for the file keel_llm_adapter_openai-0.1.2-py3-none-any.whl.
File metadata
- Download URL: keel_llm_adapter_openai-0.1.2-py3-none-any.whl
- Upload date:
- Size: 7.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
13644d1b541e27495a1a23cc38dc94974db8bff59322baf9c6b7ab23501f51eb
|
|
| MD5 |
6f2de7df2de76fab0b562fc2687a5ef3
|
|
| BLAKE2b-256 |
85d916e64b28013d97f59735dbc8642dd9e283271b552112871bc3578ea55254
|
Provenance
The following attestation bundles were made for keel_llm_adapter_openai-0.1.2-py3-none-any.whl:
Publisher:
publish-py.yml on keelplatform/keel
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
keel_llm_adapter_openai-0.1.2-py3-none-any.whl -
Subject digest:
13644d1b541e27495a1a23cc38dc94974db8bff59322baf9c6b7ab23501f51eb - Sigstore transparency entry: 1610072900
- Sigstore integration time:
-
Permalink:
keelplatform/keel@2d54c903ad991fb68652520570a6571c69c477fb -
Branch / Tag:
refs/tags/py-llm-adapter-openai-v0.1.2 - Owner: https://github.com/keelplatform
-
Access:
private
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish-py.yml@2d54c903ad991fb68652520570a6571c69c477fb -
Trigger Event:
push
-
Statement type: