LLM Gateway Proxy with multi-provider fallback

Project description

ModelSwitch - LLM Gateway Proxy

ModelSwitch 是一个 LLM 网关代理，对外暴露 OpenAI 兼容和 Anthropic 兼容 API，后端支持多提供商自动切换。

主要特性

多协议支持: 同时暴露 OpenAI 和 Anthropic 兼容 API，支持 Tool Use 双向转换
智能路由: Chain 模式自动 fallback、Circuit Breaker 熔断、流式首 chunk 探测
配置热更新: config.yaml 基于 watchdog 实时监听，修改即生效
系统服务: 支持 systemd（Linux）和 launchd（macOS）一键安装
工作空间: 所有运行时数据（配置、日志、数据库）统一管理，默认 ~/.modelswitch
用量追踪: SQLite 持久化，按服务商/模型/API Key 多维统计
对话日志: 完整请求/响应记录到 JSONL，Web 端可回放浏览
Web 管理 UI: 6 个功能 Tab，支持中英文切换
API Key 管理: 每分钟/每日限流，模型白名单

快速开始

# 安装
pip install modelswitch

# 首次运行会自动创建 ~/.modelswitch/config.yaml
modelswitch --start

# 或指定工作目录
modelswitch --workspace /data/modelswitch --start

# 健康检查
curl http://localhost:8000/api/config/health

从源码安装

git clone https://github.com/ddmonster/modelswitch.git
cd modelswitch
pip install -e ".[dev]"

安装为系统服务

# 安装 systemd（Linux）或 launchd（macOS）服务
modelswitch --install

# 查看状态
modelswitch --status

# 卸载服务
modelswitch --uninstall

API Key 使用指南

认证方式

网关支持三种认证方式（任选其一）：

# 方式 1: Authorization Bearer（推荐）
-H "Authorization: Bearer sk-your-api-key"

# 方式 2: x-api-key 头
-H "x-api-key: sk-your-api-key"

# 方式 3: 直接使用 sk- 前缀的值
-H "Authorization: sk-your-api-key"

可用端点

端点	协议	说明
`POST /openai/chat/completions`	OpenAI	聊天补全
`POST /v1/chat/completions`	OpenAI	聊天补全（向后兼容）
`GET /openai/models`	OpenAI	模型列表
`POST /anthropic/messages`	Anthropic	Messages API
`POST /v1/messages`	Anthropic	Messages API（向后兼容）

使用示例

OpenAI 协议调用

curl -s http://localhost:8000/openai/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "glm-5",
    "messages": [{"role": "user", "content": "你好"}],
    "max_tokens": 100
  }'

Anthropic 协议调用

curl -s http://localhost:8000/anthropic/messages \
  -H "x-api-key: YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "glm-5",
    "messages": [{"role": "user", "content": "你好"}],
    "max_tokens": 100
  }'

流式调用

curl -s http://localhost:8000/openai/chat/completions \
  -H "Authorization: Bearer YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "glm-5",
    "messages": [{"role": "user", "content": "讲个故事"}],
    "max_tokens": 500,
    "stream": true
  }'

Tool Use（函数调用）

curl -s http://localhost:8000/anthropic/messages \
  -H "x-api-key: YOUR_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "model": "glm-5",
    "messages": [{"role": "user", "content": "北京天气怎么样？"}],
    "max_tokens": 500,
    "tools": [{
      "name": "get_weather",
      "description": "获取城市天气",
      "input_schema": {
        "type": "object",
        "properties": {
          "city": {"type": "string", "description": "城市名称"}
        },
        "required": ["city"]
      }
    }]
  }'

Python SDK 示例

OpenAI SDK

from openai import OpenAI

client = OpenAI(
    api_key="YOUR_API_KEY",
    base_url="http://localhost:8000/openai"
)

response = client.chat.completions.create(
    model="glm-5",
    messages=[{"role": "user", "content": "你好"}],
    max_tokens=100
)
print(response.choices[0].message.content)

Anthropic SDK

from anthropic import Anthropic

client = Anthropic(
    api_key="YOUR_API_KEY",
    base_url="http://localhost:8000"
)

response = client.messages.create(
    model="glm-5",
    max_tokens=100,
    messages=[{"role": "user", "content": "你好"}]
)
print(response.content[0].text)

配置 Claude Code

在 ~/.claude/settings.json 中配置：

{
  "env": {
    "ANTHROPIC_AUTH_TOKEN": "YOUR_API_KEY",
    "ANTHROPIC_BASE_URL": "http://localhost:8000",
    "ANTHROPIC_MODEL": "glm-5"
  }
}

Web 管理界面

访问 http://localhost:8000/ 进入 Web 管理界面，支持：

Providers: 查看和管理上游提供商，连通性测试
Models: 配置模型及其适配器链，Chain 逐级测试
API Keys: 创建、查看、删除 API Key
Usage Stats: 用量统计，按服务商/模型/API Key 分组，支持多级下钻
Debug Logs: 实时请求日志，支持按级别/Request ID/API Key 过滤
Conversations: 对话记录查看器，完整输入/输出/Tool Call 回放，消息折叠展开
i18n: 支持中文/英文界面切换（右上角语言按钮）
登录鉴权: 管理界面需要使用 admin 角色 API Key 登录后才能操作

API Key 管理

通过 Web 界面

访问 http://localhost:8000/
切换到 "API Keys" 标签
点击 "新建 Key" 创建新的 API Key
可设置：名称、描述、速率限制、日限额、允许的模型、过期时间

通过 API

# 列出所有 API Key
# 需要 admin 角色
curl -s http://localhost:8000/api/keys \
  -H "Authorization: Bearer YOUR_ADMIN_KEY"

# 创建新 Key
# 需要 admin 角色
curl -s -X POST http://localhost:8000/api/keys \
  -H "Authorization: Bearer YOUR_ADMIN_KEY" \
  -H "Content-Type: application/json" \
  -d '{
    "name": "my-app",
    "description": "我的应用",
    "rate_limit": 60,
    "daily_limit": 1000,
    "allowed_models": ["glm-5"]
  }'

# 删除 Key
# 需要 admin 角色
curl -s -X DELETE http://localhost:8000/api/keys/my-app \
  -H "Authorization: Bearer YOUR_ADMIN_KEY"

管理 API 鉴权

管理界面和 API 采用基于角色的访问控制：

路径	需要认证	需要角色
`/`, `/health`, `/docs`, `/web/*`	❌	—
`/v1/`, `/openai/`, `/anthropic/*`	✅	任意有效 Key
`/api/usage`, `/api/logs`, `/api/conversations`	✅	任意有效 Key
`/api/config/`, `/api/keys/`	✅	admin

在配置文件中给 API Key 添加 roles 字段来授予管理员权限：

配置文件位于工作空间目录（默认 ~/.modelswitch/config.yaml，可通过 --workspace 或 MODELSWITCH_WORKSPACE 环境变量自定义）。

api_keys:
  - key: sk-gateway-admin
    name: admin
    roles:
      - admin

前端登录：访问 Web 管理界面时会弹出登录框，输入 admin 角色的 API Key 即可。登录状态通过浏览器 localStorage 持久化。

可用模型

模型配置在工作空间的 config.yaml 中定义，可根据上游提供商支持的模型自定义。

故障排查

日志查看

# 请求日志（工作空间默认 ~/.modelswitch/logs/）
tail -f ~/.modelswitch/logs/gateway.log

# 会话日志（完整请求/响应）
tail -f ~/.modelswitch/logs/conversations.jsonl

常见问题

401 Unauthorized: 检查 API Key 是否正确
403 Forbidden: 检查 API Key 是否启用的，或模型是否在 allowed_models 列表中，或访问管理 API 时缺少 admin 角色
404 Model not found: 检查模型名称是否正确（区分大小写，建议用小写）
502/503 Upstream error: 上游提供商不可用，检查 provider 配置

License

MIT

Project details

Release history Release notifications | RSS feed

0.4.4

Apr 27, 2026

0.4.3

Apr 27, 2026

0.4.2

Apr 27, 2026

0.4.1

Apr 27, 2026

0.4.0

Apr 27, 2026

0.3.17

Apr 27, 2026

0.3.16

Apr 27, 2026

0.3.15

Apr 27, 2026

0.3.14

Apr 24, 2026

0.3.13

Apr 24, 2026

0.3.12

Apr 24, 2026

0.3.11

Apr 24, 2026

0.3.10

Apr 24, 2026

0.3.9

Apr 24, 2026

0.3.8

Apr 24, 2026

0.3.7

Apr 24, 2026

0.3.6

Apr 24, 2026

0.3.5

Apr 24, 2026

0.3.4

Apr 24, 2026

0.3.3

Apr 24, 2026

0.3.2

Apr 24, 2026

0.3.1

Apr 17, 2026

0.3.0

Apr 17, 2026

0.2.9

Apr 17, 2026

0.2.8

Apr 17, 2026

0.2.7

Apr 17, 2026

0.2.6

Apr 17, 2026

0.2.5

Apr 14, 2026

0.2.4

Apr 8, 2026

0.2.3

Apr 8, 2026

This version

0.2.2

Apr 8, 2026

0.2.1

Apr 8, 2026

0.2.0

Apr 8, 2026

0.1.3

Apr 8, 2026

0.1.1

Apr 8, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

modelswitch-0.2.2.tar.gz (156.7 kB view details)

Uploaded Apr 8, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

modelswitch-0.2.2-py3-none-any.whl (97.8 kB view details)

Uploaded Apr 8, 2026 Python 3

File details

Details for the file modelswitch-0.2.2.tar.gz.

File metadata

Download URL: modelswitch-0.2.2.tar.gz
Upload date: Apr 8, 2026
Size: 156.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for modelswitch-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`659fe1a04a7b0abbc498a71c6872d9f0a7384d008d54ea758308e52073ab991c`
MD5	`eb78d5c1d06be7b55f4b8544a74dc7a4`
BLAKE2b-256	`fb4ecf7eaaf5e66a60a8e07836544f50c3ef6308c5fc23c272667f25b6583068`

See more details on using hashes here.

Provenance

The following attestation bundles were made for modelswitch-0.2.2.tar.gz:

Publisher: ci.yml on ddmonster/modelswitch

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: modelswitch-0.2.2.tar.gz
- Subject digest: 659fe1a04a7b0abbc498a71c6872d9f0a7384d008d54ea758308e52073ab991c
- Sigstore transparency entry: 1252405861
- Sigstore integration time: Apr 8, 2026
Source repository:
- Permalink: ddmonster/modelswitch@21ed8e64d6d6264ea4b5b3877630c9875a9839ce
- Branch / Tag: refs/tags/v0.2.2
- Owner: https://github.com/ddmonster
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@21ed8e64d6d6264ea4b5b3877630c9875a9839ce
- Trigger Event: push

File details

Details for the file modelswitch-0.2.2-py3-none-any.whl.

File metadata

Download URL: modelswitch-0.2.2-py3-none-any.whl
Upload date: Apr 8, 2026
Size: 97.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for modelswitch-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`091d3def21be62c56e8fe2c08e93c5c537f7a79343738c2cb5baaba4a6375b71`
MD5	`3e25935a725138a5559fdb8d68f32fe8`
BLAKE2b-256	`384ecb490b1e79799ed813f2ca46eebf255a868becf29a8d33a82fe9fc22880a`

See more details on using hashes here.

Provenance

The following attestation bundles were made for modelswitch-0.2.2-py3-none-any.whl:

Publisher: ci.yml on ddmonster/modelswitch

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: modelswitch-0.2.2-py3-none-any.whl
- Subject digest: 091d3def21be62c56e8fe2c08e93c5c537f7a79343738c2cb5baaba4a6375b71
- Sigstore transparency entry: 1252405881
- Sigstore integration time: Apr 8, 2026
Source repository:
- Permalink: ddmonster/modelswitch@21ed8e64d6d6264ea4b5b3877630c9875a9839ce
- Branch / Tag: refs/tags/v0.2.2
- Owner: https://github.com/ddmonster
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@21ed8e64d6d6264ea4b5b3877630c9875a9839ce
- Trigger Event: push

modelswitch 0.2.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

ModelSwitch - LLM Gateway Proxy

主要特性

快速开始

从源码安装

安装为系统服务

API Key 使用指南

认证方式

可用端点

使用示例

OpenAI 协议调用

Anthropic 协议调用

流式调用

Tool Use（函数调用）

Python SDK 示例

OpenAI SDK

Anthropic SDK

配置 Claude Code

Web 管理界面

API Key 管理

通过 Web 界面

通过 API

管理 API 鉴权

可用模型

故障排查

日志查看

常见问题

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance