Skip to main content

Unified interface for Russian LLMs with intelligent routing and fallback

Project description

Multi-LLM Orchestrator

Python License PyPI Coverage Tests

A unified interface for orchestrating multiple Large Language Model providers with intelligent routing and fallback mechanisms.

Overview

The Multi-LLM Orchestrator provides a seamless way to integrate and manage multiple LLM providers through a single, consistent interface. It supports intelligent routing strategies, automatic fallbacks, and provider-specific optimizations. Currently focused on Russian LLM providers (GigaChat, YandexGPT) with a flexible architecture that supports any LLM provider implementation.

Quickstart

Get started with Multi-LLM Orchestrator in minutes:

Using MockProvider (Testing)

import asyncio
from orchestrator import Router
from orchestrator.providers import ProviderConfig, MockProvider

async def main():
    # Initialize router with round-robin strategy
    router = Router(strategy="round-robin")
    
    # Add providers
    for i in range(3):
        config = ProviderConfig(name=f"provider-{i+1}", model="mock-normal")
        router.add_provider(MockProvider(config))
    
    # Make a request
    response = await router.route("What is Python?")
    print(response)
    # Output: Mock response to: What is Python?

if __name__ == "__main__":
    asyncio.run(main())

Using GigaChatProvider (Production)

import asyncio
from orchestrator import Router
from orchestrator.providers import ProviderConfig, GigaChatProvider

async def main():
    # Create GigaChat provider
    config = ProviderConfig(
        name="gigachat",
        api_key="your_authorization_key_here",  # OAuth2 authorization key
        model="GigaChat",  # or "GigaChat-Pro", "GigaChat-Plus"
        scope="GIGACHAT_API_PERS"  # or "GIGACHAT_API_CORP" for corporate
    )
    provider = GigaChatProvider(config)
    
    # Use with router
    router = Router(strategy="round-robin")
    router.add_provider(provider)
    
    # Generate response
    response = await router.route("What is Python?")
    print(response)

if __name__ == "__main__":
    asyncio.run(main())

Using YandexGPTProvider (Production)

import asyncio
from orchestrator import Router
from orchestrator.providers import ProviderConfig, YandexGPTProvider

async def main():
    # Create YandexGPT provider
    config = ProviderConfig(
        name="yandexgpt",
        api_key="your_iam_token_here",  # IAM token (valid for 12 hours)
        folder_id="your_folder_id_here",  # Yandex Cloud folder ID
        model="yandexgpt/latest"  # or "yandexgpt-lite/latest"
    )
    provider = YandexGPTProvider(config)
    
    # Use with router
    router = Router(strategy="round-robin")
    router.add_provider(provider)
    
    # Generate response
    response = await router.route("What is Python?")
    print(response)

if __name__ == "__main__":
    asyncio.run(main())

The MockProvider simulates LLM behavior without requiring API credentials, while GigaChatProvider and YandexGPTProvider provide full integration with their respective APIs.

Installation

Requirements:

  • Python 3.11+
  • Poetry (recommended) or pip

Using Poetry

# Clone the repository
git clone https://github.com/MikhailMalorod/Multi-LLM-Orchestrator.git
cd Multi-LLM-Orchestrator

# Install dependencies
poetry install

Using pip

# Clone the repository
git clone https://github.com/MikhailMalorod/Multi-LLM-Orchestrator.git
cd Multi-LLM-Orchestrator

# Install in development mode
pip install -e .

Architecture

The Multi-LLM Orchestrator follows a modular architecture with clear separation of concerns:

┌──────────────────────────────────────────────┐
│              User Application                │
└─────────────────┬────────────────────────────┘
                  │
                  ▼
         ┌────────────────┐
         │     Router      │ ◄── Strategy: round-robin/random/first-available
         └────────┬───────┘
                  │
      ┌───────────┼───────────┐
      ▼           ▼           ▼
┌──────────┐ ┌──────────┐ ┌──────────┐
│Provider 1│ │Provider 2│ │Provider 3│
│(Base)    │ │(Base)    │ │(Base)    │
└────┬─────┘ └────┬─────┘ └────┬─────┘
     │            │            │
     ▼            ▼            ▼
   (API)        (API)        (API)

Components

  • Router (src/orchestrator/router.py): Manages provider selection based on routing strategy and handles automatic fallback when providers fail.

  • BaseProvider (src/orchestrator/providers/base.py): Abstract base class defining the interface that all provider implementations must follow. Includes configuration models (ProviderConfig, GenerationParams) and exception hierarchy.

  • MockProvider (src/orchestrator/providers/mock.py): Test implementation that simulates LLM behavior without making actual API calls. Supports various simulation modes for testing different scenarios.

  • Config (src/orchestrator/config.py): Future component for loading configuration from environment variables. Currently used for planned real provider integrations (GigaChat, YandexGPT).

Routing Strategies

The Router supports three routing strategies, each suitable for different use cases:

Strategy Description Use Case
round-robin Cycles through providers in a fixed order Equal load distribution (recommended for production)
random Selects a random provider from available providers Simple random selection for load balancing
first-available Selects the first healthy provider based on health checks High availability scenarios with automatic unhealthy provider skipping

The strategy is selected when initializing the Router:

router = Router(strategy="round-robin")  # or "random" or "first-available"

Run the Demo

See the routing strategies and fallback mechanisms in action:

python examples/routing_demo.py

No API keys required — uses MockProvider for demonstration.

The demo showcases:

  • All three routing strategies (round-robin, random, first-available)
  • Automatic fallback mechanism when providers fail
  • Error handling when all providers are unavailable

See routing_demo.py for the complete interactive demonstration.

MockProvider Modes

MockProvider simulates various LLM behaviors for testing without requiring API credentials:

  • mock-normal — Returns successful responses with a small delay
  • mock-timeout — Simulates timeout errors
  • mock-unhealthy — Health check returns False (useful for testing first-available strategy)
  • mock-ratelimit — Simulates rate limit errors
  • mock-auth-error — Simulates authentication failures

See mock.py for all available modes and detailed documentation.

Roadmap

See STRATEGY.md for the detailed roadmap and development plan.

Current Status

  • ✅ Core architecture with Router and BaseProvider
  • ✅ MockProvider for testing
  • ✅ GigaChatProvider with OAuth2 authentication
  • ✅ Three routing strategies (round-robin, random, first-available)
  • ✅ Automatic fallback mechanism
  • ✅ Example demonstrations

Supported Providers

  • MockProvider — For testing and development
  • GigaChatProvider — Full integration with GigaChat (Sber) API
    • OAuth2 authentication with automatic token refresh
    • Support for all generation parameters
    • Comprehensive error handling
  • YandexGPTProvider — Full integration with YandexGPT (Yandex Cloud) API
    • IAM token authentication (user-managed, 12-hour validity)
    • Support for temperature and maxTokens parameters
    • Support for yandexgpt/latest and yandexgpt-lite/latest models
    • Comprehensive error handling

Planned Providers

  • Ollama (local models)

Documentation

  • STRATEGY.md — Project roadmap and development plan
  • routing_demo.py — Interactive demonstration of routing strategies and fallback mechanisms

Contributing

Contributions are welcome! Please feel free to submit a Pull Request. For major changes, please open an issue first to discuss what you would like to change.

  1. Fork the repository
  2. Create your feature branch (git checkout -b feature/amazing-feature)
  3. Commit your changes (git commit -m 'Add some amazing feature')
  4. Push to the branch (git push origin feature/amazing-feature)
  5. Open a Pull Request

License

This project is licensed under the MIT License - see the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

multi_llm_orchestrator-0.2.0.tar.gz (20.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

multi_llm_orchestrator-0.2.0-py3-none-any.whl (25.7 kB view details)

Uploaded Python 3

File details

Details for the file multi_llm_orchestrator-0.2.0.tar.gz.

File metadata

  • Download URL: multi_llm_orchestrator-0.2.0.tar.gz
  • Upload date:
  • Size: 20.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/2.2.1 CPython/3.13.7 Windows/11

File hashes

Hashes for multi_llm_orchestrator-0.2.0.tar.gz
Algorithm Hash digest
SHA256 5193b9129f614776d9ff8098313dfcdf2753c8b9f094f3625f63746382968d83
MD5 b269ba17e7b17a917e40ba31aa3a306f
BLAKE2b-256 e124dddb900190c6009056d2bff8f830dd2237ed9678cd782eb67151bd93b4b0

See more details on using hashes here.

File details

Details for the file multi_llm_orchestrator-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for multi_llm_orchestrator-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e6c6da9605febf735f8a88226634b6e71b62ef08c298724586902bead9f2b3ac
MD5 049576eccab663c12fcd14bb58b1bdc2
BLAKE2b-256 fc1f2263e2fc2a5d087af87fee7a7cacd6d7474714847ce5b99f88850fb9b36f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page