A unified interface for querying Large Language Models (LLMs) across multiple providers using LiteLLM and OpenRouter

These details have not been verified by PyPI

Project description

LLM Interface

A unified interface for querying Large Language Models (LLMs) across multiple providers using LiteLLM and OpenRouter. This package provides intelligent model routing that automatically selects the best provider for each model request.

What is LiteLLM?

LiteLLM is a Python library that provides a unified interface to call multiple LLM APIs with a consistent OpenAI-like API. It supports 100+ LLM providers including:

OpenAI (GPT-4, GPT-3.5, etc.)
Anthropic (Claude models)
Azure OpenAI
Google (Gemini, PaLM)
OpenRouter (aggregator for multiple models)
And many more...

LiteLLM handles provider-specific differences, retries, rate limiting, and error handling, allowing you to switch between providers with minimal code changes.

What is OpenRouter?

OpenRouter is a unified API that provides access to 100+ LLM models from various providers through a single interface. It's particularly useful when:

You don't have direct API keys for specific providers
You want to access models not available through your direct provider accounts
You need a fallback option when your primary provider is unavailable
You want to compare models across different providers

OpenRouter requires credits (free tier available) and routes requests to the appropriate provider on your behalf.

Architecture

This package uses a three-tier routing system:

Azure (highest priority): Direct Azure OpenAI deployments
Provider (medium priority): Direct API access (OpenAI, Anthropic, etc.)
OpenRouter (fallback): Unified API for models not available through other routes

The system prioritizes routes in this order: Azure → Provider → OpenRouter.

The routing is handled by a "routing judge" - an LLM that intelligently selects the best route based on:

Model name matching (semantic and exact)
Available API keys
Model availability in each catalog

Setup

1. Install the Package

Option A: Install from Local Directory (Development)

cd all-the-llms
pip install -e .

Option B: Install from Source

cd all-the-llms
pip install .

Option C: Build and Install Distribution

cd all-the-llms
pip install build
python -m build
pip install dist/all_the_llms-*.whl

The package will automatically install all required dependencies (litellm, openrouter, python-dotenv, pydantic).

2. Environment Variables

Create a .env file in your project root. At minimum, you need OPENROUTER_API_KEY:

Required

OPENROUTER_API_KEY: Get your key from OpenRouter

Optional (Direct Provider Access)

Only use if you have free credits you prefer over OpenRouter. Otherwise, prefer OpenRouter.

OPENAI_API_KEY, ANTHROPIC_API_KEY, GOOGLE_API_KEY, or any {PROVIDER}_API_KEY

Optional (Azure)

AZURE_API_KEY: Azure OpenAI API key
AZURE_API_BASE: Endpoint URL
AZURE_API_VERSION: API version
AZURE_API_MODELS: Comma-separated list of deployed models (e.g., "gpt-5,gpt-4.1,gpt-4.1-mini")

3. Import and Use

After installation, you can import the package:

from all_the_llms import LLM

4. Example `.env` File

# OpenRouter (required for routing judge and fallback models)
OPENROUTER_API_KEY=...

# Direct provider access (optional - only use if you have free credits available 
# that you prefer over OpenRouter. Prefer OpenRouter otherwise.)
OPENAI_API_KEY=...
ANTHROPIC_API_KEY=...

# Azure (specified for Harvard Medical School)
AZURE_API_KEY=...
AZURE_API_BASE="https://azure-ai.hms.edu"
AZURE_API_VERSION="2024-10-21"
AZURE_API_MODELS="gpt-5,gpt-5-mini,gpt-4.1,gpt-4.1-mini,gpt-4.1-nano"

Usage

Basic Example

from all_the_llms import LLM

# Initialize an LLM - routing happens automatically
llm = LLM("gpt-4o")

# Make a completion request
response = llm.completion(
    messages=[{"role": "user", "content": "Hello, how are you?"}],
    temperature=0.7,
    max_tokens=100
)

print(response.choices[0].message.content)

Note: All LiteLLM features (streaming, tools, structured output, etc.) are supported through the completion() method's **kwargs. See the LiteLLM documentation for a complete list of available parameters.

Advanced Example with Pydantic Structured Output

from all_the_llms import LLM
from pydantic import BaseModel
from enum import Enum

# Define a Pydantic model for structured output
class CoffeeQuality(str, Enum):
    excellent = "excellent"
    terrible = "terrible"
    meh = "meh"

class CoffeeReview(BaseModel):
    quality: CoffeeQuality
    caffeine_level: int  # 1-10 scale
    complaints: list[str]
    verdict: str

llm = LLM("claude-sonnet-4.5")

# Make a completion request with structured output using response_format
response = llm.completion(
    messages=[{"role": "user", "content": "Review this coffee: 'It tastes like someone dissolved a tire in hot water and called it a day.'"}],
    response_format=CoffeeReview,
    temperature=0.3,
)

# Extract and validate the structured response
content = response.choices[0].message.content
review = CoffeeReview.model_validate_json(content)
print(f"Quality: {review.quality}")
print(f"Caffeine Level: {review.caffeine_level}/10")
print(f"Complaints: {', '.join(review.complaints)}")
print(f"Verdict: {review.verdict}")

Custom Routing Judge

By default, the system uses openrouter/openai/gpt-4o-mini as the routing judge (free as long as you have an OpenRouter API key). This default can be customized by passing a different model to the routing_judge parameter:

llm = LLM("gpt-5-2025-11-16", routing_judge="azure/gpt-4.1-mini")

What the Code Does

`LLM` Class

The LLM class is a thin wrapper that:

Resolves the model: Takes a user-friendly model name (e.g., "gpt-5-2025-11-16") and resolves it to a concrete provider-specific model ID (e.g., "azure/gpt-5" or "openrouter/openai/gpt-3.5-turbo")
Tests the connection: On initialization, sends a test request to verify the model is accessible and working
Exposes a simple API: Provides a completion() method that wraps litellm.completion() with the resolved model

`ModelRouter` Class

The ModelRouter class handles intelligent routing:

Loads model catalogs:
- Azure models from AZURE_API_MODELS environment variable
- Provider models from LiteLLM's catalog (based on available API keys)
- OpenRouter models by querying the OpenRouter API
Exact matching: First tries to find exact matches in the catalogs (prioritizing Azure → Provider → OpenRouter). Model names are normalized (lowercase, whitespace removed) for matching.
LLM-based routing: If no exact match, uses a "routing judge" LLM to decide which route to use
Model resolution: Uses the routing judge again to map the requested model name to a specific model in the selected route
Fallback handling: If the selected route has no available models, falls back to other routes

Example Behavior

When you initialize an LLM, you'll see output like this:

Routing model gpt-5-2025-11-16 to valid LLM...
Selected route azure because the requested model 'gpt-5-2025-11-16' semantically matches the azure model 'gpt-5'.
Resolved gpt-5-2025-11-16 to azure/gpt-5
Testing LLM at azure/gpt-5
Successfully recieved response from gpt-5-2025-08-07

Routing Examples

Azure Route (when model matches Azure deployment):

Routing model gpt-5-2025-11-16 to valid LLM...
Selected route azure because the requested model 'gpt-5-2025-11-16' semantically matches the azure model 'gpt-5'.
Resolved gpt-5-2025-11-16 to azure/gpt-5

Provider Route (when direct API key is available):

Routing model claude-sonnet-4.5 to valid LLM...
Selected route provider because the requested model 'claude-sonnet-4.5' matches a model available from the provider with a direct api key.
Resolved claude-sonnet-4.5 to claude-sonnet-4-5

OpenRouter Route (fallback when no direct access):

Routing model gpt-3.5-turbo to valid LLM...
Selected route openrouter because requested model 'gpt-3.5-turbo' does not match any available azure models and there are no applicable provider options.
Resolved gpt-3.5-turbo to openrouter/openai/gpt-3.5-turbo

Error Handling

The system validates each model on initialization. If a model fails, you'll see an error:

RuntimeError: Could not get a valid response from openrouter/deepseek/deepseek-r1-0528. 
litellm.APIError: APIError: OpenrouterException - {"error":{"message":"This request requires more credits, 
or fewer max_tokens. You requested up to 7168 tokens, but can only afford 4706..."}}

Common issues:

Insufficient credits: OpenRouter account needs more credits
Invalid API key: Check your environment variables
Model unavailable: The requested model may not be available on the selected route
Rate limiting: Provider may be rate limiting requests

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.10

Dec 14, 2025

0.1.9

Dec 14, 2025

0.1.8

Dec 14, 2025

0.1.7

Dec 14, 2025

0.1.6

Dec 14, 2025

0.1.5

Dec 14, 2025

0.1.3

Nov 16, 2025

0.1.2

Nov 16, 2025

This version

0.1.1

Nov 16, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

all_the_llms-0.1.1.tar.gz (13.9 kB view details)

Uploaded Nov 16, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

all_the_llms-0.1.1-py3-none-any.whl (11.6 kB view details)

Uploaded Nov 16, 2025 Python 3

File details

Details for the file all_the_llms-0.1.1.tar.gz.

File metadata

Download URL: all_the_llms-0.1.1.tar.gz
Upload date: Nov 16, 2025
Size: 13.9 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for all_the_llms-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`c7167f35bd2e040af5b93d7b0f2132a01c63032d0daca84462db30f5354288c8`
MD5	`37cac4996a8effb6cf62bed61efab908`
BLAKE2b-256	`51743055981425ee6228ee8bb4a627cb237e5dd329c36cb6adf7dd7720036845`

See more details on using hashes here.

File details

Details for the file all_the_llms-0.1.1-py3-none-any.whl.

File metadata

Download URL: all_the_llms-0.1.1-py3-none-any.whl
Upload date: Nov 16, 2025
Size: 11.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.14.0

File hashes

Hashes for all_the_llms-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`0f0b5db15c3032955663c73c5b3932ac37dee130172cb226a809594ff51f74c5`
MD5	`e67ae1318b78e87c8f06d834d47d6af2`
BLAKE2b-256	`d1fab53c0d2af63bb815cc778f1569543f46f3ab828b33690ddd3d97b2abcc67`

See more details on using hashes here.

all-the-llms 0.1.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

LLM Interface

What is LiteLLM?

What is OpenRouter?

Architecture

Setup

1. Install the Package

Option A: Install from Local Directory (Development)

Option B: Install from Source

Option C: Build and Install Distribution

2. Environment Variables

Required

Optional (Direct Provider Access)

Optional (Azure)

3. Import and Use

4. Example .env File

Usage

Basic Example

Advanced Example with Pydantic Structured Output

Custom Routing Judge

What the Code Does

LLM Class

ModelRouter Class

Example Behavior

Routing Examples

Error Handling

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

4. Example `.env` File

`LLM` Class

`ModelRouter` Class