Multi-model routing and load balancing system with OpenAI-compatible API

These details have not been verified by PyPI

Project description

Router-Maestro

Multi-model routing router with OpenAI-compatible and Anthropic-compatible APIs. Route LLM requests across GitHub Copilot, OpenAI, Anthropic, and custom providers with intelligent fallback and priority-based selection.

TL;DR

Use GitHub Copilot's models (Claude, GPT-4o, o3-mini) with Claude Code or any OpenAI/Anthropic-compatible client.

Router-Maestro acts as a proxy that gives you access to models from multiple providers through a unified API. Authenticate once with GitHub Copilot, and use its models anywhere that supports OpenAI or Anthropic APIs.

Features

1M context support: Activate Opus 4.6 with 1M context window via GitHub Copilot — just select claude-opus-4-6[1m] during config claude-code setup
Fuzzy model matching: No need to type exact model IDs. Subagents, agent teams, and tools that hardcode model names (e.g. opus-4-6, claude-sonnet-4.5) are resolved automatically to the correct provider model
Multi-provider support: GitHub Copilot (OAuth), OpenAI, Anthropic, and custom OpenAI-compatible endpoints
Intelligent routing: Priority-based model selection with automatic fallback on failure
Dual API compatibility: Both OpenAI (/v1/...) and Anthropic (/v1/messages) API formats
Gemini API compatibility: Gemini REST API format (/api/gemini/v1beta/...) for Gemini CLI/SDK
Cross-provider translation: Seamlessly route OpenAI requests to Anthropic providers and vice versa
Configuration hot-reload: Auto-reload config files every 5 minutes without server restart
CLI management: Full command-line interface for configuration and server control
Docker ready: Production-ready Docker images with Traefik integration

Quick Start
Core Concepts
CLI Reference
API Reference
Configuration
Deployment
License
Changelog

Quick Start

Get up and running in 4 steps:

https://github.com/user-attachments/assets/8f60ec7a-4fbe-4342-9408-084073a4d48d

1. Start the Server

Docker (recommended)

docker run -d -p 8080:8080 \
  -v ~/.local/share/router-maestro:/home/maestro/.local/share/router-maestro \
  -v ~/.config/router-maestro:/home/maestro/.config/router-maestro \
  likanwen/router-maestro:latest

Install locally

pip install router-maestro
router-maestro server start --port 8080

2. Set Context (for Docker or Remote)

When running via Docker in remote VPS, set up a context to communicate with the containerized server:

pip install router-maestro  # Install CLI locally
router-maestro context add docker --endpoint http://localhost:8080
router-maestro context set docker

What's a context? A context is a named connection profile (endpoint + API key) that lets you manage local or remote Router-Maestro servers. See Contexts for details.

3. Authenticate with GitHub Copilot

router-maestro auth login github-copilot

# Follow the prompts:
#   1. Visit https://github.com/login/device
#   2. Enter the displayed code
#   3. Authorize "GitHub Copilot Chat"

4. Configure Your CLI Tool

Claude Code

router-maestro config claude-code
# Follow the wizard to select models

OpenAI Codex (CLI, Extension, App)

router-maestro config codex
# Follow the wizard to select models

Gemini CLI

router-maestro config gemini
# Follow the wizard to select models

After configuration, set the API key environment variable:

# Get your API key
router-maestro server show-key

# Set the environment variable (add to your shell profile)
export ROUTER_MAESTRO_API_KEY="your-api-key-here"

Done! Now run claude or codex and your requests will route through Router-Maestro.

For production deployment, see the Deployment section.

Core Concepts

Model Identification

Models are identified using the format {provider}/{model-id}:

Example	Description
`github-copilot/gpt-4o`	GPT-4o via GitHub Copilot
`github-copilot/claude-sonnet-4`	Claude Sonnet 4 via GitHub Copilot
`openai/gpt-4-turbo`	GPT-4 Turbo via OpenAI
`anthropic/claude-3-5-sonnet`	Claude 3.5 Sonnet via Anthropic

Fuzzy matching: You don't need to type exact model IDs. Router-Maestro will fuzzy-match common variations:

You type	Resolves to
`Opus 4.6`	`claude-opus-4-6-20250617`
`opus-4-6`	`claude-opus-4-6-20250617`
`claude-sonnet-4.5`	`claude-sonnet-4-5-20250929`
`anthropic/sonnet-4-5`	Sonnet 4.5 via Anthropic only

When multiple versions match, the newest (by date suffix) is selected automatically.

Auto-Routing

Use the special model name router-maestro for automatic provider selection:

{"model": "router-maestro", "messages": [...]}

The router will try models in priority order and fall back to the next on failure.

Priority & Fallback

Priority determines which model is tried first when using auto-routing.

# Set priorities
router-maestro model priority github-copilot/claude-sonnet-4 --position 1
router-maestro model priority github-copilot/gpt-4o --position 2

# View priorities
router-maestro model priority list

Fallback triggers when a request fails with a retryable error (429, 5xx):

Strategy	Behavior
`priority`	Try next model in priorities list
`same-model`	Try same model on different provider
`none`	Fail immediately

Configure in ~/.config/router-maestro/priorities.json:

{
  "priorities": ["github-copilot/claude-sonnet-4", "github-copilot/gpt-4o"],
  "fallback": {"strategy": "priority", "maxRetries": 2}
}

Cross-Provider Translation

Router-Maestro automatically translates between OpenAI and Anthropic formats:

# Use Anthropic API with OpenAI provider
POST /v1/messages  {"model": "openai/gpt-4o", ...}

# Use OpenAI API with Anthropic provider
POST /v1/chat/completions  {"model": "anthropic/claude-3-5-sonnet", ...}

Contexts

A context is a named connection profile that stores an endpoint URL and API key. Contexts let you manage multiple Router-Maestro deployments from a single CLI.

Context	Use Case
`local`	Default context for `router-maestro server start`
`docker`	Connect to a local Docker container
`my-vps`	Connect to a remote VPS deployment

# Add a context
router-maestro context add my-vps --endpoint https://api.example.com --api-key xxx

# Switch contexts
router-maestro context set my-vps

# All CLI commands now target the remote server
router-maestro model list

CLI Reference

Server

Command	Description
`server start --port 8080`	Start the server
`server stop`	Stop the server
`server info`	Show server status

Authentication

Command	Description
`auth login [provider]`	Authenticate with a provider
`auth logout <provider>`	Remove authentication
`auth list`	List authenticated providers

Models

Command	Description
`model list`	List available models
`model refresh`	Refresh models cache
`model priority list`	Show priorities
`model priority <model> --position <n>`	Set priority
`model fallback show`	Show fallback config

Contexts (Remote Management)

Command	Description
`context show`	Show current context
`context list`	List all contexts
`context set <name>`	Switch context
`context add <name> --endpoint <url> --api-key <key>`	Add remote context
`context test`	Test connection

Other

Command	Description
`config claude-code`	Generate Claude Code settings
`config codex`	Generate Codex config (CLI/Extension/App)
`config gemini`	Generate Gemini CLI .env

API Reference

OpenAI-Compatible

# Chat completions
POST /v1/chat/completions
{
  "model": "github-copilot/gpt-4o",
  "messages": [{"role": "user", "content": "Hello"}],
  "stream": false
}

# List models
GET /v1/models

Anthropic-Compatible

# Messages
POST /v1/messages
POST /api/anthropic/v1/messages
{
  "model": "github-copilot/claude-sonnet-4",
  "max_tokens": 1024,
  "messages": [{"role": "user", "content": "Hello"}]
}

# Count tokens
POST /v1/messages/count_tokens

Admin

POST /api/admin/models/refresh   # Refresh model cache

Gemini-Compatible

# Generate content (non-streaming)
POST /api/gemini/v1beta/models/{model}:generateContent
{
  "contents": [{"role": "user", "parts": [{"text": "Hello"}]}]
}

# Stream generate content (SSE)
POST /api/gemini/v1beta/models/{model}:streamGenerateContent?alt=sse
{
  "contents": [{"role": "user", "parts": [{"text": "Hello"}]}]
}

# Count tokens
POST /api/gemini/v1beta/models/{model}:countTokens
{
  "contents": [{"role": "user", "parts": [{"text": "Hello"}]}]
}

Configuration

File Locations

Following XDG Base Directory specification:

Type	Path	Contents
Config	`~/.config/router-maestro/`
	`providers.json`	Custom provider definitions
	`priorities.json`	Model priorities and fallback
	`contexts.json`	Deployment contexts
Data	`~/.local/share/router-maestro/`
	`auth.json`	OAuth tokens
	`server.json`	Server state

Custom Providers

Add OpenAI-compatible providers in ~/.config/router-maestro/providers.json:

{
  "providers": {
    "ollama": {
      "type": "openai-compatible",
      "baseURL": "http://localhost:11434/v1",
      "models": {
        "llama3": {"name": "Llama 3"},
        "mistral": {"name": "Mistral 7B"}
      }
    }
  }
}

Set API keys via environment variables (uppercase, hyphens → underscores):

export OLLAMA_API_KEY="sk-..."

Hot-Reload

Configuration files are automatically reloaded every 5 minutes:

File	Auto-Reload
`priorities.json`	✓ (5 min)
`providers.json`	✓ (5 min)
`auth.json`	Requires restart

Force immediate reload:

router-maestro model refresh

Deployment

Architecture

graph TD
    Internet["🌐 Internet (HTTPS)"]
    subgraph VPS
        Traefik["Traefik (ports 80/443)\nAutomatic HTTPS · Let's Encrypt\nHTTP → HTTPS redirect"]
        RM["Router-Maestro (port 8080)\nOpenAI / Anthropic-compatible API\nMulti-provider routing"]
    end
    Providers["LLM Providers\nGitHub Copilot · OpenAI · Anthropic"]

    Internet -->|443| Traefik
    Traefik -->|8080| RM
    RM --> Providers

Traefik — reverse proxy that handles TLS termination and auto-renews HTTPS certificates via Let's Encrypt. Only needed for public-facing deployments.
Router-Maestro — the API server. Listens on port 8080, requires an API key for all requests, and routes them to configured LLM providers.

Option A: Simple Docker (No HTTPS)

Use when: local testing, running behind an existing reverse proxy (Nginx, Caddy, etc.), or on an internal network.

Prerequisites: Docker installed.

Step 1 — Generate an API key

export ROUTER_MAESTRO_API_KEY=$(openssl rand -hex 32)
echo "Save this key: $ROUTER_MAESTRO_API_KEY"

Step 2 — Start the container

docker run -d --name router-maestro \
  -p 8080:8080 \
  -e ROUTER_MAESTRO_API_KEY="$ROUTER_MAESTRO_API_KEY" \
  -v ~/.local/share/router-maestro:/home/maestro/.local/share/router-maestro \
  -v ~/.config/router-maestro:/home/maestro/.config/router-maestro \
  likanwen/router-maestro:latest

Step 3 — Authenticate with GitHub Copilot

docker exec -it router-maestro router-maestro auth login github-copilot
# 1. Visit the URL shown
# 2. Enter the code
# 3. Authorize "GitHub Copilot Chat"

Step 4 — Verify

curl http://localhost:8080/health
# Expected: {"status":"ok"}

curl http://localhost:8080/api/openai/v1/models \
  -H "Authorization: Bearer $ROUTER_MAESTRO_API_KEY"
# Expected: JSON list of available models

Option B: Production (Docker Compose + Traefik + HTTPS)

Use when: deploying to a public-facing VPS with a domain name. Provides automatic HTTPS via Let's Encrypt with Cloudflare DNS challenge.

Prerequisites:

A VPS with Docker and Docker Compose installed
A domain name (e.g., api.example.com) with DNS pointing to your VPS
A Cloudflare account managing your domain's DNS (for automatic HTTPS)

Step 1 — Clone the repository

git clone https://github.com/MadSkittles/Router-Maestro.git
cd Router-Maestro

Step 2 — Configure environment variables

cp .env.example .env

Edit .env with your values:

Variable	Description	Example
`DOMAIN`	Your domain pointing to this VPS	`api.example.com`
`CF_DNS_API_TOKEN`	Cloudflare API token with `Zone:DNS:Edit` permission. Generate here	`abc123...`
`ACME_EMAIL`	Email for Let's Encrypt certificate expiry notifications	`you@example.com`
`ROUTER_MAESTRO_API_KEY`	API key clients use to authenticate. Generate with `openssl rand -hex 32`	`a1b2c3...`
`ROUTER_MAESTRO_LOG_LEVEL`	Log verbosity (`DEBUG`, `INFO`, `WARNING`, `ERROR`)	`INFO`
`TRAEFIK_DASHBOARD_AUTH`	(Optional) Basic auth for Traefik dashboard. Generate with `htpasswd -nB admin`, then escape `$` as `$$`	`admin:$$2y$$05$$...`

Step 3 — Start the services

docker compose up -d

This starts both Traefik (reverse proxy) and Router-Maestro. Traefik will automatically obtain an HTTPS certificate for your domain.

Step 4 — Authenticate with GitHub Copilot

docker compose exec router-maestro router-maestro auth login github-copilot
# 1. Visit the URL shown
# 2. Enter the code
# 3. Authorize "GitHub Copilot Chat"

Step 5 — Set up remote management (on your local machine)

pip install router-maestro   # install CLI locally if not already installed

router-maestro context add my-vps \
  --endpoint https://api.example.com \
  --api-key YOUR_API_KEY

router-maestro context set my-vps

Now all CLI commands run against your VPS:

router-maestro model list          # list models on VPS
router-maestro auth list           # check auth status on VPS
router-maestro config claude-code  # configure Claude Code to use VPS

Step 6 — Verify

curl https://api.example.com/health
# Expected: {"status":"ok"}

curl https://api.example.com/api/openai/v1/models \
  -H "Authorization: Bearer YOUR_API_KEY"
# Expected: JSON list of available models

Remote Management

Contexts let you manage any Router-Maestro server (local or remote) from your local CLI:

# Add a remote server
router-maestro context add my-vps --endpoint https://api.example.com --api-key YOUR_KEY

# Switch between servers
router-maestro context set my-vps     # target remote VPS
router-maestro context set local      # target local server

# Test the connection
router-maestro context test

# All commands now target the active context
router-maestro model list
router-maestro auth login github-copilot

Advanced Configuration

For additional deployment options, see docs/deployment.md:

Alternative DNS providers (AWS Route53, DigitalOcean, GoDaddy, Namecheap, etc.)
HTTP challenge setup (when DNS challenge is not available)
Traefik dashboard configuration and security
Complete environment variables reference

License

MIT License - see LICENSE file.

Changelog

See CHANGELOG.md for release history.

Contributing

Contributions are welcome! Please feel free to submit a Pull Request.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.3.2

Apr 30, 2026

0.3.1

Apr 30, 2026

0.3.0

Apr 30, 2026

0.2.0

Apr 25, 2026

0.1.37

Apr 23, 2026

0.1.36

Apr 23, 2026

0.1.35

Apr 23, 2026

0.1.34

Apr 23, 2026

0.1.33

Apr 20, 2026

0.1.32

Apr 20, 2026

0.1.31

Apr 20, 2026

0.1.30

Apr 8, 2026

0.1.29

Apr 1, 2026

0.1.28

Mar 31, 2026

0.1.27

Mar 31, 2026

This version

0.1.26

Mar 31, 2026

0.1.25

Mar 31, 2026

0.1.24

Mar 15, 2026

0.1.23

Mar 15, 2026

0.1.22

Mar 15, 2026

0.1.21

Mar 15, 2026

0.1.20

Mar 9, 2026

0.1.19

Mar 9, 2026

0.1.18

Mar 6, 2026

0.1.17

Mar 6, 2026

0.1.16

Mar 5, 2026

0.1.15

Mar 2, 2026

0.1.14

Feb 13, 2026

0.1.13

Feb 11, 2026

0.1.12

Feb 5, 2026

0.1.11

Feb 5, 2026

0.1.10

Feb 5, 2026

0.1.9

Feb 4, 2026

0.1.8

Feb 3, 2026

0.1.7

Feb 3, 2026

0.1.6

Feb 2, 2026

0.1.5

Jan 29, 2026

0.1.4

Jan 29, 2026

0.1.3

Jan 29, 2026

0.1.2

Jan 29, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

router_maestro-0.1.26.tar.gz (222.4 kB view details)

Uploaded Mar 31, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

router_maestro-0.1.26-py3-none-any.whl (112.2 kB view details)

Uploaded Mar 31, 2026 Python 3

File details

Details for the file router_maestro-0.1.26.tar.gz.

File metadata

Download URL: router_maestro-0.1.26.tar.gz
Upload date: Mar 31, 2026
Size: 222.4 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for router_maestro-0.1.26.tar.gz
Algorithm	Hash digest
SHA256	`f1715aec0205d92cc000bbddfa8533d8d9602af3c2858655049e70250ac7c533`
MD5	`655264618fee0a5e659e6a9f5ea8908c`
BLAKE2b-256	`7278082add0b7f0410582b155326c73a50c59fc4b16e8969e99f557163695c37`

See more details on using hashes here.

Provenance

The following attestation bundles were made for router_maestro-0.1.26.tar.gz:

Publisher: release.yml on MadSkittles/Router-Maestro

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: router_maestro-0.1.26.tar.gz
- Subject digest: f1715aec0205d92cc000bbddfa8533d8d9602af3c2858655049e70250ac7c533
- Sigstore transparency entry: 1203526232
- Sigstore integration time: Mar 31, 2026
Source repository:
- Permalink: MadSkittles/Router-Maestro@44e959292e7c6ad2709413fafbbc65e7f8078735
- Branch / Tag: refs/tags/v0.1.26
- Owner: https://github.com/MadSkittles
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@44e959292e7c6ad2709413fafbbc65e7f8078735
- Trigger Event: push

File details

Details for the file router_maestro-0.1.26-py3-none-any.whl.

File metadata

Download URL: router_maestro-0.1.26-py3-none-any.whl
Upload date: Mar 31, 2026
Size: 112.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for router_maestro-0.1.26-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a7c1e40850fd39b80f6aadf12d1857a46be8051b185e8700b1c86b2c90245fb7`
MD5	`d71a21d1361efc376b435fa574bd7fdf`
BLAKE2b-256	`f6df4ac76a96b02bcc1a38b822f7fd031ad2287486606f03cd4e0cda77075844`

See more details on using hashes here.

Provenance

The following attestation bundles were made for router_maestro-0.1.26-py3-none-any.whl:

Publisher: release.yml on MadSkittles/Router-Maestro

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: router_maestro-0.1.26-py3-none-any.whl
- Subject digest: a7c1e40850fd39b80f6aadf12d1857a46be8051b185e8700b1c86b2c90245fb7
- Sigstore transparency entry: 1203526236
- Sigstore integration time: Mar 31, 2026
Source repository:
- Permalink: MadSkittles/Router-Maestro@44e959292e7c6ad2709413fafbbc65e7f8078735
- Branch / Tag: refs/tags/v0.1.26
- Owner: https://github.com/MadSkittles
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: release.yml@44e959292e7c6ad2709413fafbbc65e7f8078735
- Trigger Event: push

router-maestro 0.1.26

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

Router-Maestro

TL;DR

Features

Table of Contents

Quick Start

1. Start the Server

Docker (recommended)

Install locally

2. Set Context (for Docker or Remote)

3. Authenticate with GitHub Copilot

4. Configure Your CLI Tool

Claude Code

OpenAI Codex (CLI, Extension, App)

Gemini CLI

Core Concepts

Model Identification

Auto-Routing

Priority & Fallback

Cross-Provider Translation

Contexts

CLI Reference

Server

Authentication

Models

Contexts (Remote Management)

Other

API Reference

OpenAI-Compatible

Anthropic-Compatible

Admin

Gemini-Compatible

Configuration

File Locations

Custom Providers

Hot-Reload

Deployment

Architecture

Option A: Simple Docker (No HTTPS)

Option B: Production (Docker Compose + Traefik + HTTPS)

Remote Management

Advanced Configuration

License

Changelog

Contributing

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance