Python SDK for TokenRouter - Intelligent LLM Routing API

These details have not been verified by PyPI

Project links

Project description

TokenRouter Python SDK

Official Python SDK for TokenRouter — an intelligent LLM router that provides OpenAI‑compatible endpoints and a native routing endpoint.

This README focuses on the routing interfaces you’ll use today:

client.create(...) → Native routing endpoint (/route)
client.chat.completions.create(...) → OpenAI chat completions (/v1/chat/completions)
client.completions.create(...) → OpenAI legacy text completions (/v1/completions)

All calls are BYOK. Provide your TokenRouter API key, and configure provider keys in TokenRouter.

Installation

pip install tokenrouter

Quick Start (Native Route)

from tokenrouter import TokenRouter

client = TokenRouter(
    api_key="tr_...",
    base_url="http://localhost:8000"  # or https://api.tokenrouter.io
)

response = client.create(
  model="auto",
  mode="balanced",
  model_preferences=["gpt-4o", "gpt-4o-mini"],
  messages=[
    {"role": "developer", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Hello!"}
  ],
  # Optional (native route only): select key behavior
  # inline|stored|mixed|auto (default)
  key_mode="auto",
)

print(response.choices[0].message.content)

Endpoints

Native Route (/route)

OpenAI‑like request/response shape plus TokenRouter metadata: cost_usd, latency_ms, routed_model, routed_provider, service_tier, etc.

Non‑streaming

response = client.create(
  model="auto",
  mode="balanced",
  model_preferences=["gpt-4o", "gpt-4o-mini"],
  messages=[
    {"role": "developer", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Hello!"}
  ],
)
print(response.choices[0].message.content)

Streaming

for chunk in client.create(
  model="auto",
  stream=True,
  messages=[
    {"role": "developer", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Stream a short greeting."}
  ],
):
  delta = (chunk.choices[0].get("delta", {}) if chunk.choices else {})
  if delta.get("content"):
    print(delta["content"], end="")

Chat Completions (/v1/chat/completions)

OpenAI‑compatible chat completions.

Non‑streaming

response = client.chat.completions.create(
  model="auto",
  mode="balanced",
  model_preferences=["gpt-4o", "gpt-4o-mini"],
  messages=[
    {"role": "developer", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Hello!"}
  ],
)
print(response.choices[0].message.content)

Streaming

for chunk in client.chat.completions.create(
  model="auto",
  stream=True,
  messages=[
    {"role": "developer", "content": "You are a helpful assistant."},
    {"role": "user", "content": "Hello!"}
  ],
):
  delta = (chunk.choices[0].get("delta", {}) if chunk.choices else {})
  if delta.get("content"):
    print(delta["content"], end="")

Legacy Completions (/v1/completions)

OpenAI legacy text completion format. The SDK returns the raw OpenAI‑style dict.

Non‑streaming

resp = client.completions.create(
  model="auto",
  prompt="Say this is a test",
  mode="balanced",
)
print(resp["choices"][0]["text"])  # text completion shape

Streaming

for chunk in client.completions.create(
  model="auto",
  prompt="Stream this as text",
  stream=True,
):
  if chunk.get("choices"):
    print(chunk["choices"][0].get("text", ""), end="")

Errors

from tokenrouter import AuthenticationError, RateLimitError, InvalidRequestError, APIConnectionError

try:
  response = client.chat.completions.create(
    messages=[{"role": "user", "content": "Hello"}],
    model="auto"
  )
  print(response.choices[0].message.content)
except RateLimitError as e:
  print(f"Rate limited, retry after: {e.retry_after}s")
except AuthenticationError:
  print("Invalid API key")
except InvalidRequestError as e:
  print(f"Invalid request: {e}")
except APIConnectionError as e:
  print(f"Connection error: {e}")

Environment

export TOKENROUTER_API_KEY=tr_your-api-key
# Optional
export TOKENROUTER_BASE_URL=https://api.tokenrouter.io

# Optional provider keys (auto-detected for inline encryption on native /route only)
export OPENAI_API_KEY=sk-...
export ANTHROPIC_API_KEY=sk-ant-...
export GEMINI_API_KEY=...
export MISTRAL_API_KEY=...
export DEEPSEEK_API_KEY=...
export META_API_KEY=...

When `key_mode` is `inline`, `mixed`, or `auto` (native `/route` only), the SDK:
- Auto-loads provider keys from your environment or local `.env` (dev/CI) with the names above
- Encrypts keys client-side using the API's published public key (fetched from `/.well-known/tr-public-key`)
- Sends the encrypted bundle in the `X-TR-Provider-Keys` header (not in JSON)
- Never persists or logs provider secrets

Note: `key_mode` is not used on the OpenAI-compatible endpoints (`/v1/chat/completions`, `/v1/completions`).

Using OpenAI SDK against TokenRouter

from openai import OpenAI
client = OpenAI(api_key="sk_...", base_url="https://api.tokenrouter.io/v1")
response = client.chat.completions.create(
  model="auto",
  messages=[{"role": "user", "content": "Hello"}],
)

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.2.1

Nov 16, 2025

1.2.0

Nov 7, 2025

1.1.0

Nov 3, 2025

1.0.16

Nov 3, 2025

1.0.15

Sep 17, 2025

1.0.14

Sep 17, 2025

1.0.13

Sep 16, 2025

1.0.12

Sep 16, 2025

1.0.11

Sep 16, 2025

This version

1.0.8

Sep 4, 2025

1.0.7

Sep 3, 2025

1.0.5

Sep 3, 2025

1.0.4

Sep 2, 2025

1.0.2

Sep 2, 2025

1.0.1

Sep 2, 2025

1.0.0

Aug 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tokenrouter-1.0.8.tar.gz (12.4 kB view details)

Uploaded Sep 4, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tokenrouter-1.0.8-py3-none-any.whl (11.1 kB view details)

Uploaded Sep 4, 2025 Python 3

File details

Details for the file tokenrouter-1.0.8.tar.gz.

File metadata

Download URL: tokenrouter-1.0.8.tar.gz
Upload date: Sep 4, 2025
Size: 12.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.13

File hashes

Hashes for tokenrouter-1.0.8.tar.gz
Algorithm	Hash digest
SHA256	`86321e5b24b028ee38c53301fc0a85480fd43341dbf81dab55e3cc8750f7207c`
MD5	`fc69e37b0fcc08e6ce23f9f00eb8e6a7`
BLAKE2b-256	`53363056a30c1b4d366dc8941951edfddc7b5dfba266e4e8cb8c321656c3a29c`

See more details on using hashes here.

File details

Details for the file tokenrouter-1.0.8-py3-none-any.whl.

File metadata

Download URL: tokenrouter-1.0.8-py3-none-any.whl
Upload date: Sep 4, 2025
Size: 11.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.13

File hashes

Hashes for tokenrouter-1.0.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8b1bfd26bdfc040d690827de378452c1929b7845ea327000ed889dc719a01517`
MD5	`5a4572f6464d09d9132c920756fb897d`
BLAKE2b-256	`c18e9aa06c05ef6a9279b6aff0dee498a246e8f84d3a29c3b447e4411a684f3c`

See more details on using hashes here.

tokenrouter 1.0.8

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TokenRouter Python SDK

Installation

Quick Start (Native Route)

Endpoints

Native Route (/route)

Chat Completions (/v1/chat/completions)

Legacy Completions (/v1/completions)

Errors

Environment

Using OpenAI SDK against TokenRouter

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes