Python SDK for TokenRouter - Intelligent LLM Routing API

These details have not been verified by PyPI

Project links

Project description

TokenRouter Python SDK

OpenAI Responses API compatible client for TokenRouter - intelligent LLM routing service.

Installation

pip install tokenrouter

Quick Start

import tokenrouter

client = tokenrouter.TokenRouter(
    api_key=os.environ['TOKENROUTER_API_KEY'],  # This is the default and can be omitted
    base_url=os.environ.get('TOKENROUTER_BASE_URL', 'https://api.tokenrouter.io/api'),  # Default
)

response = client.responses.create({
    "model": "gpt-4.1",
    "input": "Tell me a three sentence bedtime story about a unicorn."
})

print(response.output_text)

OpenAI Compatibility

This SDK is designed to be a drop-in replacement for OpenAI's SDK when using the Responses API. Simply change your import and API key:

# Before (OpenAI)
from openai import OpenAI
client = OpenAI(api_key=os.environ["OPENAI_API_KEY"])

# After (TokenRouter)
import tokenrouter
client = tokenrouter.TokenRouter(api_key=os.environ["TOKENROUTER_API_KEY"])

API Reference

Create Response

response = client.responses.create({
    # Required
    "input": "Your prompt here",  # or list of input items

    # Optional
    "model": "gpt-4.1",  # Model to use
    "instructions": "System instructions",
    "max_output_tokens": 1000,
    "temperature": 0.7,
    "top_p": 0.9,
    "stream": False,  # Set to True for streaming
    "tools": [],  # Function calling tools
    "tool_choice": "auto",
    "text": {"format": {"type": "text"}},  # Response format
    # ... other OpenAI-compatible parameters
})

# Access the response text directly
print(response.output_text)

Streaming Responses

stream = client.responses.create({
    "input": "Write a poem",
    "stream": True
})

for event in stream:
    if event.type == 'response.delta' and event.delta and event.delta.output:
        for item in event.delta.output:
            if item.get("content"):
                for content in item["content"]:
                    if content.get("text"):
                        print(content["text"], end="")

Function Calling

response = client.responses.create({
    "input": "What's the weather in San Francisco?",
    "tools": [
        {
            "type": "function",
            "function": {
                "name": "get_weather",
                "description": "Get the current weather",
                "parameters": {
                    "type": "object",
                    "properties": {
                        "location": {"type": "string"}
                    },
                    "required": ["location"]
                }
            }
        }
    ]
})

# Check for function calls in the response
for item in response.output:
    if item.get("type") == "tool_call" and item.get("tool_calls"):
        for tool_call in item["tool_calls"]:
            if tool_call.get("function"):
                print(f"Function: {tool_call['function']['name']}")
                print(f"Arguments: {tool_call['function']['arguments']}")

Multi-turn Conversations

# First message
response1 = client.responses.create({
    "input": "My name is Alice",
    "store": True  # Store for later retrieval
})

# Continue conversation
response2 = client.responses.create({
    "input": "What's my name?",
    "previous_response_id": response1.id
})

Other Methods

# Get response by ID
response = client.responses.get("resp_123")

# Delete response
result = client.responses.delete("resp_123")

# Cancel background response
response = client.responses.cancel("resp_123")

# List input items
items = client.responses.list_input_items("resp_123")

Error Handling

from tokenrouter import (
    TokenRouterError,
    AuthenticationError,
    RateLimitError,
    InvalidRequestError
)

try:
    response = client.responses.create({
        "input": "Hello"
    })
except AuthenticationError:
    print('Invalid API key')
except RateLimitError as e:
    print(f'Rate limit exceeded, retry after: {e.retry_after}')
except InvalidRequestError as e:
    print(f'Invalid request: {e.message}')
except TokenRouterError as e:
    print(f'Unexpected error: {e}')

Configuration

Environment Variables

export TOKENROUTER_API_KEY=tr_your-api-key
export TOKENROUTER_BASE_URL=https://api.tokenrouter.io/api  # Optional

Client Options

client = tokenrouter.TokenRouter(
    api_key='tr_...',  # Your API key
    base_url='https://api.tokenrouter.io/api',  # API base URL
    timeout=60.0,  # Request timeout in seconds (default: 60)
    max_retries=3,  # Max retry attempts (default: 3)
    headers={  # Additional headers
        'X-Custom-Header': 'value'
    }
)

Type Support

The SDK provides type hints for better IDE support:

from tokenrouter import TokenRouter, Response, ResponseStreamEvent
from tokenrouter.types import ResponsesCreateParams

params: ResponsesCreateParams = {
    "input": "Hello",
    "model": "gpt-4.1"
}

response: Response = client.responses.create(params)

Examples

See the examples directory for more detailed usage examples:

simple.py - Basic usage
responses_example.py - Comprehensive examples

License

MIT

Client Options

from tokenrouter import TokenRouter

client = TokenRouter(
    api_key='tr_...',  # Your API key
    base_url='https://api.tokenrouter.io/api',  # API base URL
    timeout=60.0,  # Request timeout in seconds (default: 60)
    max_retries=3,  # Max retry attempts (default: 3)
    headers={  # Additional headers
        'X-Custom-Header': 'value'
    }
)

API Reference

Create Response

response = client.responses.create(
    # Required
    input="Your prompt here",  # or list of input items

    # Optional
    model="gpt-4.1",  # Model to use
    instructions="System instructions",
    max_output_tokens=1000,
    temperature=0.7,
    top_p=0.9,
    stream=False,  # Set to True for streaming
    tools=[],  # Function calling tools
    tool_choice="auto",
    text={"format": {"type": "text"}},  # Response format
    # ... other OpenAI-compatible parameters
)

# Access the response text directly
print(response.output_text)

Streaming Responses

stream = client.responses.create(
    input="Write a poem",
    stream=True
)

for event in stream:
    if event.type == "response.delta" and event.delta and event.delta.output:
        for item in event.delta.output:
            if item.get("content"):
                for content in item["content"]:
                    if content.get("text"):
                        print(content["text"], end="", flush=True)

Function Calling

response = client.responses.create(
    input="What's the weather in San Francisco?",
    tools=[
        {
            "type": "function",
            "function": {
                "name": "get_weather",
                "description": "Get the current weather",
                "parameters": {
                    "type": "object",
                    "properties": {
                        "location": {"type": "string"}
                    },
                    "required": ["location"]
                }
            }
        }
    ]
)

# Check for function calls in the response
for item in response.output:
    if item.type == "tool_call" and item.tool_calls:
        for tool_call in item.tool_calls:
            if tool_call.function:
                print(f"Function: {tool_call.function.get('name')}")
                print(f"Arguments: {tool_call.function.get('arguments')}")

Multi-turn Conversations

# First message
response1 = client.responses.create(
    input="My name is Alice",
    store=True  # Store for later retrieval
)

# Continue conversation
response2 = client.responses.create(
    input="What's my name?",
    previous_response_id=response1.id
)

Other Methods

# Get response by ID
response = client.responses.get("resp_123")

# Delete response
result = client.responses.delete("resp_123")

# Cancel background response
response = client.responses.cancel("resp_123")

# List input items
items = client.responses.list_input_items("resp_123")

Async Support

The SDK provides a fully async client for asynchronous applications:

import asyncio
from tokenrouter import AsyncTokenRouter

async def main():
    async with AsyncTokenRouter(api_key="tr_...") as client:
        response = await client.responses.create(
            input="Hello, world!"
        )
        print(response.output_text)

asyncio.run(main())

Async Streaming

async with AsyncTokenRouter(api_key="tr_...") as client:
    stream = await client.responses.create(
        input="Count to 5",
        stream=True
    )

    async for event in stream:
        if event.type == "response.delta" and event.delta:
            # Process streaming chunks
            pass

Response Format

The SDK adds a convenience property output_text to responses that aggregates all text output:

response = client.responses.create(input="Hello")

# Access aggregated text directly
print(response.output_text)

# Or access the full response structure
print(response.output)  # List of output items
print(response.usage)  # Token usage
print(response.model)  # Model used

Error Handling

from tokenrouter import (
    TokenRouterError,
    AuthenticationError,
    RateLimitError,
    InvalidRequestError
)

try:
    response = client.responses.create(input="Hello")
except AuthenticationError:
    print("Invalid API key")
except RateLimitError as e:
    print(f"Rate limit exceeded, retry after: {e.retry_after}")
except InvalidRequestError as e:
    print(f"Invalid request: {e.message}")
except TokenRouterError as e:
    print(f"Unexpected error: {e}")

Type Hints

The SDK provides comprehensive type hints for all models:

from tokenrouter import TokenRouter, Response, ResponseStreamEvent
from typing import Iterator

def process_response(response: Response) -> str:
    return response.output_text or ""

def handle_stream(stream: Iterator[ResponseStreamEvent]) -> None:
    for event in stream:
        # Process events with full type support
        pass

Examples

See the examples directory for more detailed usage:

simple.py - Basic usage matching the OpenAI pattern
responses_example.py - Comprehensive examples of all features

Requirements

Python 3.7+
httpx>=0.24.0
typing-extensions>=4.0.0

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.2.1

Nov 16, 2025

1.2.0

Nov 7, 2025

1.1.0

Nov 3, 2025

1.0.16

Nov 3, 2025

This version

1.0.15

Sep 17, 2025

1.0.14

Sep 17, 2025

1.0.13

Sep 16, 2025

1.0.12

Sep 16, 2025

1.0.11

Sep 16, 2025

1.0.8

Sep 4, 2025

1.0.7

Sep 3, 2025

1.0.5

Sep 3, 2025

1.0.4

Sep 2, 2025

1.0.2

Sep 2, 2025

1.0.1

Sep 2, 2025

1.0.0

Aug 10, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tokenrouter-1.0.15.tar.gz (12.1 kB view details)

Uploaded Sep 17, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

tokenrouter-1.0.15-py3-none-any.whl (9.7 kB view details)

Uploaded Sep 17, 2025 Python 3

File details

Details for the file tokenrouter-1.0.15.tar.gz.

File metadata

Download URL: tokenrouter-1.0.15.tar.gz
Upload date: Sep 17, 2025
Size: 12.1 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.13

File hashes

Hashes for tokenrouter-1.0.15.tar.gz
Algorithm	Hash digest
SHA256	`2137b3c13d5f7eb86248001f7934c4de60785c38a48b13f159d094d104f4ad31`
MD5	`82226fd7bd4a53f4ce9984954522703a`
BLAKE2b-256	`898b4ea2e1c33453f1b30f64b050c4a6eeffd7648984f0a60b30ab2c3d187b0d`

See more details on using hashes here.

File details

Details for the file tokenrouter-1.0.15-py3-none-any.whl.

File metadata

Download URL: tokenrouter-1.0.15-py3-none-any.whl
Upload date: Sep 17, 2025
Size: 9.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.10.13

File hashes

Hashes for tokenrouter-1.0.15-py3-none-any.whl
Algorithm	Hash digest
SHA256	`981ae75b52a76453828253c91a56a211833a5375e2058cca3757a6471e5248fd`
MD5	`c083d52f1416c27a606d3f576fa6f7a0`
BLAKE2b-256	`5993ed1716ab3b2afdf143efa48c1abc60521d4def7b797abb43ea9667300577`

See more details on using hashes here.

tokenrouter 1.0.15

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

TokenRouter Python SDK

Installation

Quick Start

OpenAI Compatibility

API Reference

Create Response

Streaming Responses

Function Calling

Multi-turn Conversations

Other Methods

Error Handling

Configuration

Environment Variables

Client Options

Type Support

Examples

License

Client Options

API Reference

Create Response

Streaming Responses

Function Calling

Multi-turn Conversations

Other Methods

Async Support

Async Streaming

Response Format

Error Handling

Type Hints

Examples

Requirements

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes