Translate OpenAI Responses API ↔ DeepSeek Chat Completions API for codex

These details have not been verified by PyPI

Project links

Project description

DeepSeek Proxy

A lightweight proxy that translates OpenAI Responses API requests into DeepSeek Chat Completions API calls.
Designed for use with Claude Code as a backend model provider.

English

Motivation

codex (Claude Code CLI) uses the OpenAI Responses API (/v1/responses) natively, with SSE streaming and a specific event lifecycle. DeepSeek only provides a standard Chat Completions API (/v1/chat/completions). These two protocols are incompatible — you cannot simply point codex at a DeepSeek endpoint and expect it to work.

This proxy solves that problem by translating between the two protocols in real time, allowing codex to use DeepSeek as its backend model.

Overview

DeepSeek Proxy is a single-file Flask application that sits between codex and the DeepSeek API. It accepts requests in the Responses API streaming format, translates them into DeepSeek's chat completions format, and converts the streaming SSE output back into the Responses API event protocol.

Architecture

┌─────────────┐     Responses API SSE      ┌────────────────┐     DeepSeek Chat API      ┌────────────┐
│    codex    │  ───────────────────────→  │  ds_proxy.py   │  ──────────────────────→  │  DeepSeek  │
│ (CLI Client)│  ←───────────────────────  │  (Flask Proxy) │  ←──────────────────────  │   API      │
└─────────────┘     SSE Events (7 types)    └────────────────┘     SSE stream chunks      └────────────┘

Components

Layer	Technology	Role
Server	Flask + uvicorn (ASGI)	HTTP server, SSE streaming
Adapter	`asgiref.wsgi.WsgiToAsgi`	WSGI → ASGI wrapper for uvicorn
Translator	Custom logic in `ds_proxy.py`	Responses API ↔ Chat API conversion
Client	`requests` (streaming)	HTTP client to DeepSeek API

Endpoints

Route	Method	Purpose
`/v1/responses`	POST	Accept Responses API request, return SSE stream
`/v1/models`	GET	List available model (`deepseek-v4-flash`)
`/v1/models/<id>`	GET	Get model capabilities

Design Approach

The proxy follows a stream-through architecture:

No buffering — as DeepSeek streams tokens, the proxy immediately forwards them as SSE events
Minimal transformation — only the necessary field mappings are applied, keeping latency low
Fail-fast — errors from DeepSeek are propagated back as SSE error events
Single-file — the entire proxy is one Python file for easy deployment and modification

Processing Flow

Request Translation

When a client sends a request to /v1/responses, the proxy:

Extracts the input array from the Responses API payload
For each message:
- Maps role: "developer" → role: "system" (DeepSeek doesn't support "developer" role)
- Flattens content arrays into a single string (concatenates input_text parts)
- Passes through role: "user" and role: "assistant" unchanged
Constructs a DeepSeek Chat Completions payload with stream: true
Streams the request to DeepSeek's API

SSE Event Lifecycle

The proxy emits exactly these 8 events in order during a successful stream:

 1. response.created        → Stream begins, response metadata
 2. response.in_progress    → Response status confirmed
 3. response.output_item.added   → Output slot opened (type: "message")
 4. response.content_part.added  → Text part initialized
 5. response.output_text.delta   → Incremental content (one per token)
 6. response.output_text.done    → Full text with aggregated content
 7. response.output_item.done    → Output item completed
 8. response.completed           → Full response with usage info

On error, the proxy emits a single response.error event and stops.

Field Mapping

OpenAI Responses API	DeepSeek Chat API	Direction
`input[].role: "developer"`	`messages[].role: "system"`	→
`input[].content[].input_text`	`messages[].content` (string)	→
`choices[0].delta.content`	`response.output_text.delta.delta`	←
`usage.prompt_tokens`	`usage.input_tokens`	←
`usage.completion_tokens`	`usage.output_tokens`	←
`prompt_tokens_details.cached_tokens`	`input_tokens_details.cached_tokens`	←
`completion_tokens_details.reasoning_tokens`	`output_tokens_details.reasoning_tokens`	←

Quick Start

Prerequisites

Python 3.8+
A DeepSeek API key

Installation

git clone <repo-url>
cd deepseek-proxy
pip install flask requests uvicorn asgiref

Run

export DEEPSEEK_API_KEY=sk-your-key-here
python ds_proxy.py

The proxy starts on http://127.0.0.1:8787.

Verify

curl -X POST http://127.0.0.1:8787/v1/responses \
  -H "Authorization: Bearer $DEEPSEEK_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"input":[{"role":"user","content":[{"type":"input_text","text":"hello"}]}],"model":"deepseek-v4-flash"}'

Configure codex (Claude Code CLI)

codex can be configured to use the proxy in two ways.

Option A: Proxy config (simpler)

Add to ~/.claude/settings.local.json:

{
  "proxy": {
    "url": "http://127.0.0.1:8787/v1/responses",
    "model": "deepseek-v4-flash"
  }
}

This tells Claude Code to route all requests through the proxy URL and use the specified model.

Option B: Model provider config

Add to ~/.codex/config.toml:

model = "deepseek-v4-flash"
model_provider = "deepseek"

[model_providers.deepseek]
name = "DeepSeek"
base_url = "http://127.0.0.1:8787/v1"
env_key = "DEEPSEEK_API_KEY"
wire_api = "responses"

This registers DeepSeek as a custom model provider, making it selectable alongside other providers.

Environment Variable

Regardless of which option you choose, set the API key:

export DEEPSEEK_API_KEY=sk-your-actual-key-here

The proxy reads this from the environment and passes it as the Authorization header to DeepSeek's API.

Verification

Run Claude Code and send a message:

codex "hello"

You should see a response from the DeepSeek model. Check the proxy logs for details:

tail -f /tmp/ds_proxy.log

中文

初衷

codex（Claude Code CLI）原生使用 OpenAI Responses API（/v1/responses），采用 SSE 流式传输和特定的事件生命周期。而 DeepSeek 只提供标准的 Chat Completions API（/v1/chat/completions）。这两种协议互不兼容——不能简单地把 codex 指向 DeepSeek 端点就指望它能工作。

这个代理通过实时转换两种协议解决了这个问题，让 codex 可以使用 DeepSeek 作为后端模型。

概述

DeepSeek Proxy 是一个单文件 Flask 应用，充当 codex 和 DeepSeek API 之间的桥梁。它将 Responses API 流式格式的请求转换为 DeepSeek 的对话补全格式，再将 DeepSeek 的流式输出转换回 Responses API 事件协议。

架构

┌─────────────┐     Responses API SSE      ┌────────────────┐     DeepSeek Chat API      ┌────────────┐
│    codex    │  ───────────────────────→  │  ds_proxy.py   │  ──────────────────────→  │  DeepSeek  │
│  (CLI 客户端) │  ←───────────────────────  │  (Flask 代理)  │  ←──────────────────────  │   API      │
└─────────────┘     SSE 事件 (7 种类型)     └────────────────┘     SSE 流式数据块         └────────────┘

组件

层级	技术	职责
服务器	Flask + uvicorn (ASGI)	HTTP 服务、SSE 流式传输
适配器	`asgiref.wsgi.WsgiToAsgi`	WSGI → ASGI 包装，用于 uvicorn
转换器	`ds_proxy.py` 中的自定义逻辑	请求/响应格式转换
客户端	`requests` (流式)	向 DeepSeek API 发送 HTTP 请求

接口

路由	方法	用途
`/v1/responses`	POST	接收 Responses API 请求，返回 SSE 流
`/v1/models`	GET	列出可用模型 (`deepseek-v4-flash`)
`/v1/models/<id>`	GET	获取模型能力信息

设计思路

代理采用流式直通架构：

不缓冲 — DeepSeek 逐 token 返回时，代理立即转发为 SSE 事件
最小转换 — 只应用必要的字段映射，保持低延迟
快速失败 — DeepSeek 的错误通过 SSE 错误事件传播回客户端
单文件 — 整个代理只有一个 Python 文件，便于部署和修改

处理流程

请求转换

客户端向 /v1/responses 发送请求时，代理执行以下操作：

从 Responses API 请求体中提取 input 数组
对每条消息：
- 将 role: "developer" 映射为 role: "system"（DeepSeek 不支持 "developer" 角色）
- 将 content 数组合并为一个字符串（拼接所有 input_text 片段）
- role: "user" 和 role: "assistant" 保持不变
构造 DeepSeek Chat Completions 请求体，设置 stream: true
以流式方式向 DeepSeek API 发送请求

SSE 事件生命周期

一次成功的流式响应会按顺序发送以下 8 个事件：

 1. response.created        → 流开始，响应元数据
 2. response.in_progress    → 确认响应进行中
 3. response.output_item.added   → 开启输出槽 (类型: "message")
 4. response.content_part.added  → 初始化文本部分
 5. response.output_text.delta   → 增量内容（每个 token 一次）
 6. response.output_text.done    → 完整文本内容
 7. response.output_item.done    → 输出项完成
 8. response.completed           → 完整响应，含 usage 信息

发生错误时，代理发送一个 response.error 事件并停止。

字段映射

OpenAI Responses API	DeepSeek Chat API	方向
`input[].role: "developer"`	`messages[].role: "system"`	→
`input[].content[].input_text`	`messages[].content` (字符串)	→
`choices[0].delta.content`	`response.output_text.delta.delta`	←
`usage.prompt_tokens`	`usage.input_tokens`	←
`usage.completion_tokens`	`usage.output_tokens`	←
`prompt_tokens_details.cached_tokens`	`input_tokens_details.cached_tokens`	←
`completion_tokens_details.reasoning_tokens`	`output_tokens_details.reasoning_tokens`	←

快速开始

前置条件

Python 3.8+
DeepSeek API 密钥

安装

git clone <repo-url>
cd deepseek-proxy
pip install flask requests uvicorn asgiref

运行

export DEEPSEEK_API_KEY=sk-your-key-here
python ds_proxy.py

代理启动在 http://127.0.0.1:8787。

验证

curl -X POST http://127.0.0.1:8787/v1/responses \
  -H "Authorization: Bearer $DEEPSEEK_API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"input":[{"role":"user","content":[{"type":"input_text","text":"你好"}]}],"model":"deepseek-v4-flash"}'

配置 codex (Claude Code CLI)

有两种方式让 codex 使用代理。

方式 A：代理配置（更简单）

添加到 ~/.claude/settings.local.json：

{
  "proxy": {
    "url": "http://127.0.0.1:8787/v1/responses",
    "model": "deepseek-v4-flash"
  }
}

这告诉 Claude Code 将所有请求通过代理 URL 路由，并使用指定模型。

方式 B：模型提供商配置

添加到 ~/.codex/config.toml：

model = "deepseek-v4-flash"
model_provider = "deepseek"

[model_providers.deepseek]
name = "DeepSeek"
base_url = "http://127.0.0.1:8787/v1"
env_key = "DEEPSEEK_API_KEY"
wire_api = "responses"

这将 DeepSeek 注册为自定义模型提供商，可以在多个提供商之间切换选择。

环境变量

无论选择哪种方式，都需要设置 API 密钥：

export DEEPSEEK_API_KEY=sk-your-actual-key-here

代理从环境变量读取密钥，并将其作为 Authorization 头传递给 DeepSeek API。

验证

运行 Claude Code 发送消息：

codex "hello"

你应该能看到来自 DeepSeek 模型的响应。查看代理日志获取详情：

tail -f /tmp/ds_proxy.log

English | 中文

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.2.2

May 16, 2026

0.2.1

May 16, 2026

0.1.9

May 16, 2026

0.1.8

May 16, 2026

0.1.7

May 16, 2026

0.1.6

May 16, 2026

0.1.3

May 16, 2026

0.1.2

May 16, 2026

0.1.1

May 16, 2026

This version

0.1.0

May 16, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

deepseek_proxy-0.1.0.tar.gz (13.6 kB view details)

Uploaded May 16, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

deepseek_proxy-0.1.0-py3-none-any.whl (10.6 kB view details)

Uploaded May 16, 2026 Python 3

File details

Details for the file deepseek_proxy-0.1.0.tar.gz.

File metadata

Download URL: deepseek_proxy-0.1.0.tar.gz
Upload date: May 16, 2026
Size: 13.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for deepseek_proxy-0.1.0.tar.gz
Algorithm	Hash digest
SHA256	`074b0f101df6d6f9c0619e2f2e2a1eace2ec993b8cba7de5bafb3b333eafc3de`
MD5	`e69a46f23e5cac0190c82f3cf558bd42`
BLAKE2b-256	`2b28801f4bc1c1e973cd2f8472b2acdc09ebd00638dd9a5a42f5e9912e6beef9`

See more details on using hashes here.

File details

Details for the file deepseek_proxy-0.1.0-py3-none-any.whl.

File metadata

Download URL: deepseek_proxy-0.1.0-py3-none-any.whl
Upload date: May 16, 2026
Size: 10.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for deepseek_proxy-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`095e10fea21f5952bf0942a9d273c8a7642ec246ca195661e1211828ae36c3eb`
MD5	`1bc6ee36d0e0c63bec64cf2b690eabfb`
BLAKE2b-256	`24f1c4d01bd74ed4c4f54a255cb945cc332290de45bc82e4baeea304985de6da`

See more details on using hashes here.

deepseek-proxy 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

DeepSeek Proxy

English

Motivation

Overview

Architecture

Components

Endpoints

Design Approach

Processing Flow

Request Translation

SSE Event Lifecycle

Field Mapping

Quick Start

Prerequisites

Installation

Run

Verify

Configure codex (Claude Code CLI)

Option A: Proxy config (simpler)

Option B: Model provider config

Environment Variable

Verification

中文

初衷

概述

架构

组件

接口

设计思路

处理流程

请求转换

SSE 事件生命周期

字段映射

快速开始

前置条件

安装

运行

验证

配置 codex (Claude Code CLI)

方式 A：代理配置（更简单）

方式 B：模型提供商配置

环境变量

验证

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes