A lightweight AI model router for seamlessly switching between multiple AI providers (OpenAI, Anthropic, Google AI) with unified API interface.

Project description

v-router

A unified LLM interface that provides automatic fallback between different LLM providers. Route your AI requests seamlessly across Anthropic, OpenAI, Google, and Azure with intelligent failover strategies and a consistent API.

✨ Features

🚀 Automatic Fallback: Seamless switching between models and providers when failures occur
📚 Unified API: Same interface works across all major LLM providers
⚡ Smart Routing: Intelligent model selection based on availability and configuration
🔧 Function Calling: Unified tool calling interface across all providers
🖼️ Multimodal Support: Send images and PDFs with automatic format conversion
🎯 Consistent Responses: Standardized response format regardless of provider
⚙️ Flexible Configuration: Fine-tune parameters, backup models, and provider priorities

📦 Installation

pip install v-router

Or with development dependencies:

uv sync --all-extras

🚀 Quick Start

Basic Conversation

from v_router import Client, LLM
from v_router import HumanMessage, AIMessage, SystemMessage

# Initialize client
client = Client(
    llm_config=LLM(
        model_name="claude-3-sonnet-20240229",
        provider="anthropic",
        max_tokens=500
    )
)

# Basic conversation
response = await client.messages.create(
    messages=[
        SystemMessage("You are a helpful assistant"),
        HumanMessage("What's the capital of France?")
    ]
)

print(response.content)  # "The capital of France is Paris."

Response Format

All providers return a consistent AIMessage format:

# The response object
print(response.content)        # "The capital of France is Paris." (or ["Part 1", "Part 2"] for multi-part)
print(response.model)          # "claude-3-sonnet-20240229"
print(response.provider)       # "anthropic"
print(response.usage.input_tokens)   # 25
print(response.usage.output_tokens)  # 8

Multi-turn Conversations

# Start a conversation
messages = [
    SystemMessage("You are a helpful math tutor"),
    HumanMessage("What's 15% of 80?")
]

response = await client.messages.create(messages=messages)
print(response.content)  # "To find 15% of 80..."

# Continue the conversation - just append the response
messages.append(response)  # AIMessage is automatically handled
messages.append(HumanMessage("Can you show me another way to calculate it?"))

response2 = await client.messages.create(messages=messages)
print(response2.content)  # "Sure! Another way to calculate 15% of 80..."

🔧 Function Calling (Tool Use)

from v_router import Client, LLM, HumanMessage, ToolMessage
from v_router.classes.tools import ToolCall, Tools
from pydantic import BaseModel, Field

# Define your tool
class Calculator(BaseModel):
    expression: str = Field(..., description="Mathematical expression to evaluate")

calculator_tool = ToolCall(
    name="calculator",
    description="Evaluate mathematical expressions",
    input_schema=Calculator.model_json_schema()
)

# Create client with tools
client = Client(
    llm_config=LLM(
        model_name="gpt-4",
        provider="openai",
        tools=Tools(tools=[calculator_tool])
    )
)

# Ask a question that requires the tool
messages = [HumanMessage("What's 392 * 47?")]
response = await client.messages.create(messages=messages)

# Check if the model wants to use a tool
if response.tool_calls:
    tool_call = response.tool_calls[0]
    print(f"Tool: {tool_call.name}")           # "calculator"
    print(f"Arguments: {tool_call.args}")      # {"expression": "392 * 47"}
    
    # Execute the tool (your implementation)
    result = eval(tool_call.args["expression"])  # 18,424
    
    # Send tool result back
    messages.append(response)  # Add the AIMessage with tool_calls
    messages.append(
        ToolMessage(
            content=str(result),
            tool_call_id=tool_call.id
        )
    )
    
    # Get final response
    final_response = await client.messages.create(messages=messages)
    print(final_response.content)  # "392 * 47 = 18,424"

🖼️ Multimodal Messages

Sending Images

from v_router import Client, LLM, HumanMessage
from v_router.classes.messages import TextContent, ImageContent
import base64

client = Client(
    llm_config=LLM(
        model_name="gpt-4o",
        provider="openai"
    )
)

# Method 1: Direct file path (auto-converted to base64)
response = await client.messages.create(
    messages=[
        HumanMessage("image.jpg"),  # Just pass the file path
        HumanMessage("What's in this image?")
    ]
)

# Method 2: Explicit multimodal content
with open("chart.png", "rb") as f:
    image_data = base64.b64encode(f.read()).decode("utf-8")

response = await client.messages.create(
    messages=[
        HumanMessage([
            TextContent(text="Analyze this chart and tell me the trend:"),
            ImageContent(data=image_data, media_type="image/png")
        ])
    ]
)

print(response.content)  # "The chart shows an upward trend..."

Working with Documents (PDFs)

from v_router.classes.messages import DocumentContent

# PDFs work similarly to images
with open("report.pdf", "rb") as f:
    pdf_data = base64.b64encode(f.read()).decode("utf-8")

response = await client.messages.create(
    messages=[
        HumanMessage([
            TextContent(text="Summarize this report:"),
            DocumentContent(
                data=pdf_data,
                media_type="application/pdf"
            )
        ])
    ]
)

# Or just use the file path
response = await client.messages.create(
    messages=[
        HumanMessage("report.pdf"),  # Auto-converted
        HumanMessage("What are the key findings?")
    ]
)

🚀 Automatic Fallback

Configure backup models to ensure reliability:

from v_router import Client, LLM, BackupModel

llm_config = LLM(
    model_name="claude-3-opus-20240229",  # Primary model
    provider="anthropic",
    backup_models=[
        BackupModel(
            model=LLM(model_name="gpt-4", provider="openai"),
            priority=1  # First fallback
        ),
        BackupModel(
            model=LLM(model_name="gemini-1.5-pro", provider="google"),
            priority=2  # Second fallback
        )
    ]
)

client = Client(llm_config)

# If Claude fails, automatically tries GPT-4, then Gemini
response = await client.messages.create(
    messages=[HumanMessage("Hello!")]
)

print(f"Response from: {response.provider}")  # Shows which provider succeeded

🔄 Cross-Provider Switching

Enable automatic cross-provider fallback for the same model:

llm_config = LLM(
    model_name="claude-3-sonnet-20240229",
    provider="vertexai",  # Try Vertex AI first
    try_other_providers=True  # Fall back to direct Anthropic if needed
)

# If Vertex AI fails, automatically tries the same model on Anthropic
response = await client.messages.create(
    messages=[HumanMessage("Hello!")]
)

🎯 Provider-Specific Parameters

v-router supports provider-specific features through flexible parameters:

# Anthropic - Thinking mode
response = await client.messages.create(
    messages=[HumanMessage("Solve this complex problem: ...")],
    timeout=600,
    thinking={
        "type": "enabled",
        "budget_tokens": 10000
    }
)

# OpenAI - JSON mode
response = await client.messages.create(
    messages=[HumanMessage("Return a JSON object with name and age")],
    response_format={"type": "json_object"},
    seed=12345
)

# Google - Safety settings
response = await client.messages.create(
    messages=[HumanMessage("Tell me about...")],
    safety_settings=[{
        "category": "HARM_CATEGORY_DANGEROUS_CONTENT",
        "threshold": "BLOCK_ONLY_HIGH"
    }]
)

🌐 Supported Providers

Provider	Models	Features
Anthropic	Claude 3 (Opus, Sonnet, Haiku), Claude 4 (Opus, Sonnet)	Function calling, Images, PDFs
OpenAI	GPT-4, GPT-4 Turbo, GPT-4o, GPT-3.5	Function calling, Images
Google	Gemini Pro, Gemini 1.5 (Pro, Flash), Gemini 2.0 Flash	Function calling, Images, PDFs
Azure OpenAI	GPT-4, GPT-4 Turbo, GPT-4o, GPT-3.5	Function calling, Images
Vertex AI	Claude 3/4 & Gemini models via Google Cloud	Function calling, Images, PDFs

⚙️ Configuration

Environment Variables

Set up authentication for your providers:

# Anthropic
export ANTHROPIC_API_KEY="your-key-here"

# OpenAI
export OPENAI_API_KEY="your-key-here"

# Google AI Studio
export GOOGLE_API_KEY="your-key-here"

# Google Cloud (for Vertex AI)
export GOOGLE_APPLICATION_CREDENTIALS="path/to/service-account.json"
export GCP_PROJECT_ID="your-project-id"
export GCP_LOCATION="us-central1"

# Azure OpenAI
export AZURE_OPENAI_API_KEY="your-key-here"
export AZURE_OPENAI_ENDPOINT="your-endpoint"

# LangFuse (optional - for tracing)
export LANGFUSE_HOST="your-langfuse-host"
export LANGFUSE_PUBLIC_KEY="your-public-key"
export LANGFUSE_SECRET_KEY="your-secret-key"

📊 LangFuse Tracing Support

v-router includes built-in support for LangFuse tracing:

📊 LLM Request
├── 🔴 Primary Model (claude-3-opus on anthropic) - FAILED
│   ├── Error: Rate limit exceeded
│   └── Duration: 150ms
└── ✅ Backup Model (gpt-4 on openai) - SUCCESS
    ├── Input tokens: 45
    ├── Output tokens: 128
    ├── Duration: 2.3s
    └── Cost: $0.0043

Enable by setting the LANGFUSE_HOST environment variable. No code changes required.

Model Configuration

v-router uses models.yml to map model names across providers:

# These all work automatically:
LLM(model_name="claude-3-sonnet", provider="anthropic")      # → claude-3-sonnet-20240229
LLM(model_name="claude-3-sonnet", provider="vertexai")       # → claude-3-sonnet@20240229
LLM(model_name="gpt-4", provider="openai")                   # → gpt-4
LLM(model_name="gemini-1.5-pro", provider="google")          # → gemini-1.5-pro-latest

📖 Documentation

Complete documentation is available online:

📚 Full Documentation

Jupyter Notebooks

Explore the examples/ directory for interactive examples:

quickstart_models.ipynb: Basic usage, fallbacks, and cross-provider switching
quickstart_tool_calling.ipynb: Function calling across providers
multimodal_content.ipynb: Working with images and PDFs

🛠️ Development

Setup

# Install with development dependencies
uv sync --all-extras

# Install pre-commit hooks
uv run pre-commit install

Testing

# Run all tests
uv run pytest

# Run specific test file
uv run pytest tests/models/test_llm.py

# Run with verbose output
uv run pytest -v

Code Quality

# Check code style
uv run ruff check .

# Auto-fix issues
uv run ruff check --fix .

# Format code
uv run ruff format .

🏗️ Architecture

v-router follows a clean provider pattern:

Client: Main entry point with unified API
Router: Handles request routing and fallback logic
Providers: Individual provider implementations
Models: Unified request/response models

📄 License

This project is licensed under the MIT License.

🤝 Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

📞 Support

Issues: GitHub Issues
Documentation: See examples in the examples/ directory
Email: ben@vectrix.ai

v-router - Making LLM integration simple, reliable, and unified across all providers.

Project details

Release history Release notifications | RSS feed

This version

0.0.27

Jun 13, 2025

0.0.25

Jun 12, 2025

0.0.24

Jun 11, 2025

0.0.23

Jun 10, 2025

0.0.22

Jun 10, 2025

0.0.21

Jun 10, 2025

0.0.20

Jun 10, 2025

0.0.19

Jun 9, 2025

0.0.18

Jun 9, 2025

0.0.17

Jun 9, 2025

0.0.16

Jun 9, 2025

0.0.15

Jun 8, 2025

0.0.14

Jun 8, 2025

0.0.13

Jun 8, 2025

0.0.12

Jun 8, 2025

0.0.10

Jun 1, 2025

0.0.9

Jun 1, 2025

0.0.8

Jun 1, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

v_router-0.0.27.tar.gz (1.2 MB view details)

Uploaded Jun 13, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

v_router-0.0.27-py3-none-any.whl (30.5 kB view details)

Uploaded Jun 13, 2025 Python 3

File details

Details for the file v_router-0.0.27.tar.gz.

File metadata

Download URL: v_router-0.0.27.tar.gz
Upload date: Jun 13, 2025
Size: 1.2 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.7.13

File hashes

Hashes for v_router-0.0.27.tar.gz
Algorithm	Hash digest
SHA256	`5998a5c3134c30ce2d683a2a99a3138bad9afc106e98bc87118c7439f9151466`
MD5	`47ae977543b68cc492b01e6057e6e238`
BLAKE2b-256	`c6c9e305ca8a5952b1a9e7fe7cde54f272bc9ff1c72121c491e505593a547e36`

See more details on using hashes here.

File details

Details for the file v_router-0.0.27-py3-none-any.whl.

File metadata

Download URL: v_router-0.0.27-py3-none-any.whl
Upload date: Jun 13, 2025
Size: 30.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.7.13

File hashes

Hashes for v_router-0.0.27-py3-none-any.whl
Algorithm	Hash digest
SHA256	`157102d160e79cc415105847381677499fc3f308f629cb260668e2f3144021c4`
MD5	`bc9746ab318766bca9d9175ead65eba7`
BLAKE2b-256	`81069f5f22a9f6e0a5e6cbf2bab47edc7194abad020f8342a95bced74aa0022f`

See more details on using hashes here.

v-router 0.0.27

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

v-router

✨ Features

📦 Installation

🚀 Quick Start

Basic Conversation

Response Format

Multi-turn Conversations

🔧 Function Calling (Tool Use)

🖼️ Multimodal Messages

Sending Images

Working with Documents (PDFs)

🚀 Automatic Fallback

🔄 Cross-Provider Switching

🎯 Provider-Specific Parameters

🌐 Supported Providers

⚙️ Configuration

Environment Variables

📊 LangFuse Tracing Support

Model Configuration

📖 Documentation

Jupyter Notebooks

🛠️ Development

Setup

Testing

Code Quality

🏗️ Architecture

📄 License

🤝 Contributing

📞 Support

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes