Multi-provider LLM client with unified message format and tool support

These details have not been verified by PyPI

Project description

LocalRouter

A unified multi-provider LLM client with consistent message formats and tool support across OpenAI, Anthropic, and Google GenAI.

Quick Start

Install the package:

pip install localrouter

Set your API keys as environment variables:

export OPENAI_API_KEY="your-openai-key"
export ANTHROPIC_API_KEY="your-anthropic-key" 
export GEMINI_API_KEY="your-gemini-key"  # or GOOGLE_API_KEY

Basic usage:

import asyncio
from localrouter import get_response, ChatMessage, MessageRole, TextBlock

async def main():
    messages = [
        ChatMessage(
            role=MessageRole.user, 
            content=[TextBlock(text="Hello, how are you?")]
        )
    ]
    
    response = await get_response(
        model="gpt-4.1",  # or "o3", "claude-sonnet-4-20250514", "gemini-2.5-pro", etc
        messages=messages
    )
    
    print(response.content[0].text)

asyncio.run(main())

Alternative Response Functions

LocalRouter provides several variants of get_response for different use cases:

Caching

To use disk caching, import get_response_cached as get_response:

# Import as get_response for consistent usage
from localrouter import get_response_cached as get_response

response = await get_response(
    model="gpt-4o-mini",
    messages=messages,
    cache_seed=12345  # Required for caching
)

This will return cached results whenever get_response is called with identical inputs and cache_seed is provided. If no cache_seed is provided, it will behave exactly like localrouter.get_response.

Retry with Backoff

Automatically retry failed requests with exponential backoff:

from localrouter import get_response_with_backoff as get_response

response = await get_response(
    model="gpt-4o-mini", 
    messages=messages
)

Caching + Backoff

Combine caching with retry logic:

from localrouter import get_response_cached_with_backoff as get_response

response = await get_response(
    model="gpt-4o-mini",
    messages=messages,
    cache_seed=12345  # Required for caching
)

Note: When using cached functions without cache_seed, they behave like non-cached versions (no caching occurs).

Images

from localrouter import ChatMessage, MessageRole, TextBlock, ImageBlock

# Text message
text_msg = ChatMessage(
    role=MessageRole.user,
    content=[TextBlock(text="Hello world")]
)
# Image message  
image_msg = ChatMessage(
    role=MessageRole.user,
    content=[
        ImageBlock.from_base64(base64_data, media_type="image/png"), # or: ImageBlock.from_file("image.png")
        TextBlock(text="What's in this image?")
    ]
)

Tool Calling

Define tools and get structured function calls:

from localrouter import ToolDefinition, get_response

# Define a tool
weather_tool = ToolDefinition(
    name="get_weather",
    description="Get current weather for a location",
    input_schema={
        "type": "object",
        "properties": {
            "location": {"type": "string", "description": "City name"}
        },
        "required": ["location"]
    }
)

# Use the tool
response = await get_response(
    model="gpt-4.1-nano",
    messages=[ChatMessage(
        role=MessageRole.user,
        content=[TextBlock(text="What's the weather in Paris?")]
    )],
    tools=[weather_tool]
)

# Check for tool calls
for block in response.content:
    if isinstance(block, ToolUseBlock):
        print(f"Tool: {block.name}, Args: {block.input}")

Structured Output

Get validated Pydantic models as responses:

from pydantic import BaseModel
from typing import List

class Event(BaseModel):
    name: str
    date: str
    participants: List[str]

response = await get_response(
    model="gpt-4.1-mini",
    messages=[ChatMessage(
        role=MessageRole.user,
        content=[TextBlock(text="Alice and Bob meet for lunch Friday")]
    )],
    response_format=Event
)

event = response.parsed  # Validated Event instance
print(f"Event: {event.name} on {event.date}")

Conversation Flow

Handle multi-turn conversations with tool results:

from localrouter import ToolResultBlock

# Initial request
messages = [ChatMessage(
    role=MessageRole.user,
    content=[TextBlock(text="Get weather for Tokyo")]
)]

# Get response with tool call
response = await get_response(model="gpt-4o-mini", messages=messages, tools=[weather_tool])
messages.append(response)

# Execute tool and add result
tool_call = response.content[0]  # ToolUseBlock
tool_result = ToolResultBlock(
    tool_use_id=tool_call.id,
    content=[TextBlock(text="Tokyo: 22°C, sunny")] # Tool result may also contain ImageBlock parts
)
messages.append(ChatMessage(role=MessageRole.user, content=[tool_result]))

# Continue conversation
final_response = await get_response(model="gpt-4o-mini", messages=messages, tools=[weather_tool])

Tool Definition

ToolDefinition(name, description, input_schema) - Define available tools
SubagentToolDefinition() - Predefined tool for sub-agents

Reasoning/Thinking Support

Configure reasoning budgets for models that support explicit thinking (GPT-5, Claude Sonnet 4+, Gemini 2.5):

from localrouter import ReasoningConfig

# Using effort levels (OpenAI-style)
response = await get_response(
    model="gpt-5",  # When available
    messages=messages,
    reasoning=ReasoningConfig(effort="high")  # "minimal", "low", "medium", "high"
)

# Using explicit token budget (Anthropic/Gemini-style)
response = await get_response(
    model="gemini-2.5-pro",
    messages=messages,
    reasoning=ReasoningConfig(budget_tokens=8000)
)

# Let model decide (Gemini dynamic thinking)
response = await get_response(
    model="gemini-2.5-flash",
    messages=messages,
    reasoning=ReasoningConfig(dynamic=True)
)

# Backward compatible dict config
response = await get_response(
    model="claude-sonnet-4-20250514",  # When available
    messages=messages,
    reasoning={"effort": "medium"}
)

The reasoning configuration automatically converts between provider formats:

OpenAI (GPT-5): Uses effort levels
Anthropic (Claude 4+): Uses budget_tokens
Google (Gemini 2.5): Uses thinking_budget with dynamic option

Models that don't support reasoning will ignore the configuration.

Custom Providers and Model Routing

LocalRouter supports regex patterns for model matching and prioritized provider selection. OpenRouter serves as a fallback for any model containing "/" (e.g., "meta-llama/llama-3.3-70b") with lowest priority.

from localrouter import add_provider, re

# Add a custom provider with regex pattern support
async def custom_get_response(model, messages, **kwargs):
    # Your custom implementation
    pass

add_provider(
    custom_get_response,
    models=["custom-model-1", re.compile(r"custom-.*")],  # Exact match or regex
    priority=50  # Lower = higher priority (default: 100, OpenRouter: 1000)
)

Request-Level Routing

LocalRouter allows you to register router functions that can dynamically modify model selection based on request parameters. This is useful for:

Creating model aliases
Routing requests with images to vision models
Selecting models based on temperature, tools, or other parameters
Implementing fallback strategies

from localrouter import register_router

# Example 1: Simple alias
def alias_router(req):
    if req['model'] == 'default':
        return 'gpt-5'
    return None  # Keep original model

register_router(alias_router)

# Now you can use the alias
response = await get_response(
    model="default",  # Will be routed to gpt-5
    messages=messages
)

# Example 2: Route based on message content
def vision_router(req):
    """Route requests with images to vision-capable models"""
    messages = req.get('messages', [])
    for msg in messages:
        for block in msg.content:
            if hasattr(block, '__class__') and 'ImageBlock' in block.__class__.__name__:
                return 'qwen/qwen3-vl-30b-a3b-instruct'
    return None  # Use original model for text-only requests

register_router(vision_router)

# Example 3: Route based on parameters
def temperature_router(req):
    """Use different models based on temperature"""
    temperature = req.get('temperature', 0)
    if temperature > 0.8:
        return 'gpt-5'  # Creative tasks
    return 'gpt-4.1-mini'  # Deterministic tasks

register_router(temperature_router)

Router Function Interface:

Input: Dictionary with keys: model, messages, tools, response_format, reasoning, and any other kwargs
Output: String (new model name) or None (keep original model)
Execution: Routers are applied in registration order, and each router sees the model name from the previous router

Logging

LocalRouter provides a flexible logging system to capture LLM requests and responses for debugging, monitoring, and analysis.

Basic Logging

from localrouter import register_logger

def my_logger(request, response, error):
    """
    request: Dict with model, messages, tools, etc.
    response: ChatMessage object (None if error occurred)
    error: Exception object (None if successful)
    """
    if error:
        print(f"Error calling {request['model']}: {error}")
    else:
        print(f"Success: {request['model']} returned {len(response.content)} blocks")

register_logger(my_logger)

File-Based Logging

Use the built-in log_to_dir() helper to automatically save requests and responses as JSON files:

from localrouter import register_logger, log_to_dir

# Log all requests to .llm/logs directory
register_logger(log_to_dir('.llm/logs'))

# Now all LLM calls will be logged
response = await get_response(
    model="gpt-4.1",
    messages=messages
)

Each log file contains:

Complete request parameters (model, messages, tools, etc.)
Full response with all content blocks
Error information if the request failed
Timestamp

Log files are named: {model-slug}_{timestamp}.json

Multiple Loggers

You can register multiple loggers that will all be called:

# Log to disk
register_logger(log_to_dir('.llm/logs'))

# Also send to monitoring service
def monitoring_logger(request, response, error):
    send_to_datadog(request, response, error)

register_logger(monitoring_logger)

Note: Logger errors are silently caught to prevent them from breaking your LLM calls.

Project details

These details have not been verified by PyPI

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language

Release history Release notifications | RSS feed

0.2.23

Apr 23, 2026

This version

0.2.22

Mar 15, 2026

0.2.21

Mar 4, 2026

0.2.20

Feb 20, 2026

0.2.19

Feb 20, 2026

0.2.18

Feb 20, 2026

0.2.17

Feb 19, 2026

0.2.16

Feb 19, 2026

0.2.15

Jan 23, 2026

0.2.14

Dec 16, 2025

0.2.13

Dec 13, 2025

0.2.12

Dec 13, 2025

0.2.11

Dec 13, 2025

0.2.9

Nov 20, 2025

0.2.7

Nov 19, 2025

0.2.6

Nov 10, 2025

0.2.5

Aug 25, 2025

0.2.4

Aug 10, 2025

0.2.3

Aug 8, 2025

0.2.2

Aug 8, 2025

0.2.1

Aug 8, 2025

0.2.0

Aug 8, 2025

0.1.1

Aug 5, 2025

0.1.0

Aug 5, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

localrouter-0.2.22.tar.gz (60.4 kB view details)

Uploaded Mar 15, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

localrouter-0.2.22-py3-none-any.whl (27.0 kB view details)

Uploaded Mar 15, 2026 Python 3

File details

Details for the file localrouter-0.2.22.tar.gz.

File metadata

Download URL: localrouter-0.2.22.tar.gz
Upload date: Mar 15, 2026
Size: 60.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for localrouter-0.2.22.tar.gz
Algorithm	Hash digest
SHA256	`d6b5532481c65fbc4bbc3a575a2dd7ee5c97f8234640e6a948fcf09aca5aa5cc`
MD5	`b8f183aad62d92d690e8a539f7f01f34`
BLAKE2b-256	`625372572928a0b144ac15f1f53fd36ece346f83a64fd41efc1b75db3f9e0444`

See more details on using hashes here.

Provenance

The following attestation bundles were made for localrouter-0.2.22.tar.gz:

Publisher: publish.yaml on longtermrisk/localrouter

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: localrouter-0.2.22.tar.gz
- Subject digest: d6b5532481c65fbc4bbc3a575a2dd7ee5c97f8234640e6a948fcf09aca5aa5cc
- Sigstore transparency entry: 1108549479
- Sigstore integration time: Mar 15, 2026
Source repository:
- Permalink: longtermrisk/localrouter@c8feb0653475e9e199c4bde251b1e508e715c9c3
- Branch / Tag: refs/heads/main
- Owner: https://github.com/longtermrisk
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yaml@c8feb0653475e9e199c4bde251b1e508e715c9c3
- Trigger Event: push

File details

Details for the file localrouter-0.2.22-py3-none-any.whl.

File metadata

Download URL: localrouter-0.2.22-py3-none-any.whl
Upload date: Mar 15, 2026
Size: 27.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for localrouter-0.2.22-py3-none-any.whl
Algorithm	Hash digest
SHA256	`056acf1ff10a7c0026f34947474dd7927864a06d3c9eab18ab4e77a6ba70c20e`
MD5	`6e93d7cee4a7db0ffd96d8685e7196db`
BLAKE2b-256	`d5c014ea9d5aa7e374ff8e84e7c1bd59b467d8f280078ca5fdfecb7e5082ed74`

See more details on using hashes here.

Provenance

The following attestation bundles were made for localrouter-0.2.22-py3-none-any.whl:

Publisher: publish.yaml on longtermrisk/localrouter

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: localrouter-0.2.22-py3-none-any.whl
- Subject digest: 056acf1ff10a7c0026f34947474dd7927864a06d3c9eab18ab4e77a6ba70c20e
- Sigstore transparency entry: 1108549483
- Sigstore integration time: Mar 15, 2026
Source repository:
- Permalink: longtermrisk/localrouter@c8feb0653475e9e199c4bde251b1e508e715c9c3
- Branch / Tag: refs/heads/main
- Owner: https://github.com/longtermrisk
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yaml@c8feb0653475e9e199c4bde251b1e508e715c9c3
- Trigger Event: push

localrouter 0.2.22

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

LocalRouter

Quick Start

Alternative Response Functions

Caching

Retry with Backoff

Caching + Backoff

Images

Tool Calling

Structured Output

Conversation Flow

Tool Definition

Reasoning/Thinking Support

Custom Providers and Model Routing

Request-Level Routing

Logging

Basic Logging

File-Based Logging

Multiple Loggers

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance