Auriko API client - Intelligent LLM routing
Project description
Auriko Python SDK
Type-safe Python SDK for the Auriko intelligent LLM routing API.
Overview
Auriko provides a unified API for accessing multiple LLM providers with intelligent routing based on cost, latency, throughput, and availability. The API is fully compatible with OpenAI's Chat Completions API.
Two ways to use Auriko:
- OpenAI SDK (drop-in) — change
base_urlandapi_key, everything else works as-is - Auriko SDK (native) — typed access to Auriko-specific features: multi-model routing, routing metadata, cost tracking, and more
Installation
pip install auriko
Quick Start with OpenAI SDK
import openai
client = openai.OpenAI(
base_url="https://api.auriko.ai/v1",
api_key="your-api-key", # or set AURIKO_API_KEY env var
)
response = client.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "What is inference arbitrage?"}],
)
print(response.choices[0].message.content)
Quick Start with Auriko SDK
from auriko import Client
client = Client(api_key="your-api-key") # or set AURIKO_API_KEY env var
# Non-streaming
response = client.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)
# Streaming
stream = client.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "Hello!"}],
stream=True,
)
for chunk in stream:
if chunk.choices and chunk.choices[0].delta.content:
print(chunk.choices[0].delta.content, end="", flush=True)
Async
import asyncio
from auriko import AsyncClient
async def main():
client = AsyncClient(api_key="your-api-key")
response = await client.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "Hello!"}],
)
print(response.choices[0].message.content)
asyncio.run(main())
Authentication
Set AURIKO_API_KEY as an environment variable, or pass it directly:
client = Client(api_key="sk-auriko-...")
The SDK looks for the API key in this order:
api_keyargumentAURIKO_API_KEYenvironment variable- Raises
AuthenticationError
Auriko Extensions
Beyond OpenAI compatibility, the Auriko SDK provides:
Multi-model routing
response = client.chat.completions.create(
messages=[{"role": "user", "content": "Hello!"}],
gateway={
"models": ["gpt-4o", "claude-sonnet-4-6", "deepseek-chat"],
"routing": {"optimize": "cost"},
},
)
Routing metadata
stream = client.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "Hello!"}],
stream=True,
)
for chunk in stream:
pass # consume stream
print(stream.routing_metadata) # provider, model, latency, cost
print(stream.usage) # token counts
Reasoning effort
response = client.chat.completions.create(
model="claude-sonnet-4-6",
messages=[{"role": "user", "content": "Think step by step."}],
reasoning_effort="high",
)
Provider extensions
response = client.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "Hello"}],
extensions={"openai": {"logit_bias": {"1234": -100}}},
)
Strict parameter routing
Only route to providers that support the optional parameters you sent:
response = client.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "Hello!"}],
seed=42,
gateway={"routing": {"require_parameters": True}},
)
See Filter by parameter support for the full list of parameters this applies to.
Available Resources
| Resource | Methods | Auth |
|---|---|---|
client.chat.completions |
create() |
API key |
client.models |
list_directory(), list_registry(), list_providers() |
Public |
client.me |
get() |
API key |
Configuration
client = Client(
api_key="sk-auriko-...", # default: AURIKO_API_KEY env var
base_url="https://api.auriko.ai/v1", # default
timeout=60.0, # seconds, default: 60
max_retries=2, # default: 2
)
Retries
The SDK retries automatically when both conditions hold:
error.typeisrate_limit_errororapi_errorerror.codeis NOTbudget_exhausted,insufficient_quota, orinternal_error
Network failures (DNS, connection refused, timeout) retry under a separate rule and surface as APIConnectionError after the last attempt.
Retry uses exponential backoff (0.5s base, 1.5x multiplier, 30s cap) with jitter and respects Retry-After headers.
Error Handling
from auriko import Client
from auriko.errors import (
AurikoAPIError,
AuthenticationError,
BadRequestError,
RateLimitError,
)
client = Client()
try:
response = client.chat.completions.create(
model="gpt-4o",
messages=[{"role": "user", "content": "Hello!"}],
)
except AuthenticationError:
print("Invalid API key")
except BadRequestError as e:
print(f"Bad request ({e.code}): {e.message}")
except RateLimitError as e:
if e.code == "budget_exhausted":
print("Budget limit reached")
else:
print("Rate limited — retries exhausted")
except AurikoAPIError as e:
print(f"API error {e.status_code} ({e.code}): {e.message}")
Error classes
All errors inherit from AurikoAPIError. Dispatch is HTTP-status-driven; use .code for fine-grained branching.
| Error | HTTP status | Description |
|---|---|---|
BadRequestError |
400, 413 | Invalid request parameters |
AuthenticationError |
401 | Invalid or missing API key |
PermissionDeniedError |
403 | Feature disabled or insufficient permissions |
NotFoundError |
404 | Resource or model not found |
ConflictError |
409 | Idempotency-key conflict |
RateLimitError |
429 | Rate limit, quota, or budget exhausted |
InternalServerError |
500 | Internal gateway error |
APIStatusError |
502, 503, 504 | Upstream/gateway unavailable |
APIConnectionError |
— | No response received (network/DNS/timeout) |
Resource Management
Use the client as a context manager to ensure connections are properly closed:
with Client() as client:
response = client.chat.completions.create(...)
# Async
async with AsyncClient() as client:
response = await client.chat.completions.create(...)
Development
Status
This SDK is in beta. There may be breaking changes between minor versions. We recommend pinning to a specific version.
Requirements
Python 3.10+
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file auriko-0.1.1.tar.gz.
File metadata
- Download URL: auriko-0.1.1.tar.gz
- Upload date:
- Size: 40.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5aec048b2b44d8aa746532a997269398a40dbcc16f999f9f91c1638a8b3a5881
|
|
| MD5 |
9c674d62fa555000bb5b6c6f733395b9
|
|
| BLAKE2b-256 |
761c883ffc3186e227f48877b72d344c8a662af8ec9e3dac632b08a46989e2b9
|
File details
Details for the file auriko-0.1.1-py3-none-any.whl.
File metadata
- Download URL: auriko-0.1.1-py3-none-any.whl
- Upload date:
- Size: 49.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.8.15
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
42832701f48d67c45b122f8bb8779d1c9a81b7e638b6d3df96112bdb477e4213
|
|
| MD5 |
fda2ca1bfabdd20d2678d8a6ff9d7b5b
|
|
| BLAKE2b-256 |
1f9220710aad92d6624d2fbca1bae1eb94fd6c9cbaac00f85b0787751a7f41bd
|