Coze Coding Dev SDK - 优雅的多功能 AI SDK，支持图片生成、视频生成、语音合成、语音识别、大语言模型、联网搜索和文本/多模态 Embedding。包含命令行工具 coze-coding-ai，支持 Context 上下文追踪

These details have not been verified by PyPI

Project links

Project description

Coze Coding Dev SDK

优雅、模块化的多功能 AI SDK，支持图片生成、视频生成、语音合成、语音识别、大语言模型和联网搜索。同时提供强大的命令行工具 coze-coding-ai。

✨ 特性

🎨 图片生成 - 基于豆包 SeeDream 模型的高质量图片生成 (2K/4K)
🎬 视频生成 - 文本/图片生成视频，支持多种模型和配置
🤖 大语言模型 - 支持流式对话、思考链、缓存机制
🔍 联网搜索 - Web 搜索、AI 总结、图片搜索
🎙️ 语音合成 (TTS) - 多音色、高质量的文本转语音
🎧 语音识别 (ASR) - 快速准确的语音转文字
🏗️ 模块化设计 - 清晰的模块划分，易于扩展
🔒 类型安全 - 完整的类型提示
🔄 自动重试 - 内置重试机制
📊 可观测性 - 集成 cozeloop 监控
🛠️ 命令行工具 - 提供 coze-coding-ai CLI 工具，快速使用 AI 功能

📦 安装

仅安装 SDK

pip install coze-coding-dev-sdk

安装 SDK + CLI 工具

pip install coze-coding-dev-sdk[cli]

开发安装

git clone https://github.com/coze/coze-coding-dev-sdk.git
cd coze-coding-dev-sdk/packages/python
pip install -e ".[dev]"

🚀 快速开始

环境配置

必需配置

export COZE_WORKLOAD_IDENTITY_API_KEY="your_api_key"
export COZE_INTEGRATION_BASE_URL="https://api.coze.com"
export COZE_INTEGRATION_MODEL_BASE_URL="https://model.coze.com"

可选配置（Cozeloop 监控）

如果需要使用扣子罗盘（Cozeloop）进行模型监控和可观测性追踪，需要额外配置：

export COZELOOP_WORKSPACE_ID="your_workspace_id"
export COZELOOP_API_TOKEN="your_api_token"

获取 workspace_id 的方法：

登录扣子罗盘平台
在项目设置或工作区设置中找到 workspace_id
复制该 ID 并设置到环境变量中

注意： 如果不配置 Cozeloop 相关环境变量，会看到警告信息，但不影响 SDK 的正常使用。

图片生成

from coze_coding_dev_sdk import ImageGenerationClient, ImageConfig

client = ImageGenerationClient()

files, response = client.generate(
    prompt="一只可爱的橘猫坐在窗台上",
    config=ImageConfig(size="4K", watermark=False)
)

print(f"图片 URL: {response.image_urls}")

视频生成

from coze_coding_dev_sdk import VideoGenerationClient, VideoConfig

client = VideoGenerationClient()

task = client.text_to_video(
    prompt="一只可爱的小猫在草地上玩耍",
    config=VideoConfig(resolution="1080p", ratio="16:9", duration=5)
)

print(f"视频 URL: {task.video_url}")

大语言模型

from coze_coding_dev_sdk import LLMClient
from langchain_core.messages import HumanMessage

client = LLMClient()

messages = [HumanMessage(content="你好，介绍一下你自己")]

for chunk in client.stream(messages=messages):
    if chunk.content:
        print(chunk.content, end="", flush=True)

联网搜索

from coze_coding_dev_sdk import SearchClient

client = SearchClient()

results = client.web_search("Python 最新特性")
for item in results:
    print(f"{item.title}: {item.url}")

语音合成 (TTS)

from coze_coding_dev_sdk import TTSClient

client = TTSClient()

audio_file, size = client.synthesize(
    uid="user123",
    text="你好，欢迎使用 Coze SDK！",
    speaker="zh_female_xiaohe_uranus_bigtts"
)

print(f"音频文件: {audio_file.url}")

语音识别 (ASR)

from coze_coding_dev_sdk import ASRClient

client = ASRClient()

text, data = client.recognize(
    uid="user123",
    url="https://example.com/audio.mp3"
)

print(f"识别结果: {text}")

上下文追踪 (Context)

SDK 支持通过 Context 对象进行请求追踪和上下文管理。Context 会自动注入到 HTTP 请求头中，用于链路追踪、日志关联等场景。

from coze_coding_dev_sdk import ImageGenerationClient
from coze_coding_utils.runtime_ctx.context import Context

# 创建 Context 对象
ctx = Context(
    request_id="req-123456",
    trace_id="trace-789",
    user_id="user-001"
)

# 在初始化 Client 时传入 ctx
client = ImageGenerationClient(ctx=ctx)

# 后续所有请求都会自动携带 Context 信息
response = client.generate(prompt="一只可爱的小猫")

支持 Context 的所有 Client：

ImageGenerationClient - 图片生成
VideoGenerationClient - 视频生成
LLMClient - 大语言模型
SearchClient - 联网搜索
TTSClient - 语音合成
ASRClient - 语音识别

注意事项：

Context 参数是可选的，不传入时不影响正常使用
Context 只在 Client 初始化时传入，不需要在每个方法中传递
Context 信息会通过 default_headers() 自动转换为 HTTP 请求头

📁 项目结构

coze-coding-dev-sdk/
├── coze_coding_dev_sdk/          # SDK 主包
│   ├── core/                # 核心层（配置、异常、基础客户端）
│   ├── image/               # 图片生成模块
│   ├── video/               # 视频生成模块
│   ├── llm/                 # 大语言模型模块
│   ├── search/              # 联网搜索模块
│   └── voice/               # 语音模块（TTS + ASR）
├── examples/                # 使用示例
│   ├── image_examples/      # 图片生成示例
│   ├── video_examples/      # 视频生成示例
│   ├── llm_examples/        # LLM 示例
│   ├── search_examples/     # 搜索示例
│   └── voice_examples/      # 语音功能示例
├── LICENSE                  # MIT 许可证
├── CHANGELOG.md             # 版本变更记录
└── README.md                # 项目文档

🎯 核心模块

1. 图片生成 (Image)

from coze_coding_dev_sdk import ImageGenerationClient, ImageConfig

client = ImageGenerationClient()

files, response = client.generate(
    prompt="壮丽的雪山日出",
    config=ImageConfig(size="4K", watermark=False)
)

results = await client.batch_generate([
    {"prompt": "春天的樱花", "size": "2K"},
    {"prompt": "夏日的海滩", "size": "4K"},
])

功能特性：

支持 2K/4K 或自定义尺寸
同步/异步双模式
批量并发生成
参考图片风格迁移
组图生成

2. 视频生成 (Video)

from coze_coding_dev_sdk import VideoGenerationClient, VideoConfig

client = VideoGenerationClient()

task = client.image_to_video(
    prompt="小猫从坐着到站起来",
    first_frame_url="https://example.com/cat_sitting.jpg",
    last_frame_url="https://example.com/cat_standing.jpg",
    config=VideoConfig(resolution="1080p", duration=5)
)

功能特性：

文本生成视频（text-to-video）
图片生成视频（image-to-video）
支持首帧、尾帧、参考图片
异步任务轮询
多种分辨率和比例

3. 大语言模型 (LLM)

from coze_coding_dev_sdk import LLMClient

client = LLMClient()

response = client.chat(
    messages=[{"role": "user", "content": "解释量子计算"}],
    enable_thinking=True,
    enable_cache=True
)

for chunk in client.chat_stream(messages=[...]):
    print(chunk, end="")

功能特性：

流式和非流式对话
思考链（thinking）
缓存机制
多模态支持
集成 LangChain

4. 联网搜索 (Search)

from coze_coding_dev_sdk import SearchClient

client = SearchClient()

web_results = client.web_search("AI 最新进展")

summary, results = client.web_search_with_summary("量子计算原理")

images = client.image_search("可爱的猫咪")

功能特性：

Web 搜索
Web 搜索 + AI 总结
图片搜索
结构化结果返回

5. 语音合成 (TTS)

from coze_coding_dev_sdk import TTSClient, TTSConfig

client = TTSClient()

audio, size = client.synthesize(
    uid="user123",
    text="你好世界",
    speaker="zh_female_xiaohe_uranus_bigtts",
    audio_format="mp3",
    speech_rate=0
)

功能特性：

30+ 音色选择
支持 SSML 格式
可调节语速和音量
多种音频格式（MP3/PCM/OGG）
流式返回

6. 语音识别 (ASR)

from coze_coding_dev_sdk import ASRClient

client = ASRClient()

text, data = client.recognize(
    uid="user123",
    url="https://example.com/audio.mp3"
)

功能特性：

支持 URL 和 Base64 输入
多种音频格式
详细的时间戳信息
最长 2 小时音频

🔧 高级配置

统一配置

from coze_coding_dev_sdk import Config, ImageGenerationClient, TTSClient

config = Config(
    api_key="your_api_key",
    base_url="https://api.coze.com",
    retry_times=5,
    retry_delay=2.0,
    timeout=120
)

image_client = ImageGenerationClient(config=config)
tts_client = TTSClient(config=config)

异常处理

from coze_coding_dev_sdk import (
    CozeSDKError,
    APIError,
    NetworkError,
    ValidationError,
    ConfigurationError
)

try:
    files, response = client.generate(prompt="测试")
except ValidationError as e:
    print(f"参数错误: {e.field} = {e.value}")
except APIError as e:
    print(f"API 错误: {e.message}, 状态码: {e.status_code}")
except NetworkError as e:
    print(f"网络错误: {e}")
except ConfigurationError as e:
    print(f"配置错误: {e.missing_key}")

💡 使用示例

查看 examples/ 目录获取更多示例：

examples/image_examples/ - 图片生成示例
examples/video_examples/ - 视频生成示例
examples/llm_examples/ - 大语言模型示例
examples/search_examples/ - 联网搜索示例
examples/voice_examples/ - 语音功能示例

运行示例：

python examples/image_examples/simple_example.py
python examples/video_examples/text_to_video.py
python examples/llm_examples/simple_chat.py
python examples/search_examples/web_search.py
python examples/voice_examples/tts_example.py

🛠️ 命令行工具 (CLI)

安装 CLI 工具后,可以直接在命令行中使用 AI 功能:

# 安装 CLI
pip install coze-coding-dev-sdk[cli]

# 查看帮助
coze-coding-ai --help

图片生成

支持的图片尺寸:

2K (默认, ~2560x1440)
4K (~3840x2160)
自定义: WIDTHxHEIGHT (宽度 2560-4096, 高度 1440-4096)

# 文生图 (默认 2K)
coze-coding-ai image -p "A beautiful landscape" -o "./image.png"

# 4K 分辨率
coze-coding-ai image -p "Professional portrait" -o "./portrait.png" -s 4K

# 图生图
coze-coding-ai image \
  -p "Transform into watercolor style" \
  -i "https://example.com/photo.jpg" \
  -o "./result.png"

视频生成

# 文生视频
coze-coding-ai video -p "A cat playing with a ball" --poll

# 图生视频
coze-coding-ai video \
  -i "https://example.com/image.png" \
  -p "Make the scene come alive" \
  --poll

# 查询任务状态
coze-coding-ai video-status <task-id>

# Mock 模式（测试运行，不消耗配额）
coze-coding-ai video -p "Test video" --mock --poll
coze-coding-ai video-status <task-id> --mock

Chat 对话

# 基础对话
coze-coding-ai chat -p "Hello, how are you?"

# 自定义系统提示词
coze-coding-ai chat \
  -p "Review this code: function add(a,b) { return a+b; }" \
  -s "You are an expert code reviewer" \
  -o review.json

# 启用思考链 (Chain of Thought)
coze-coding-ai chat \
  -p "Solve this math problem: If a train travels 120km in 2 hours, what's its speed?" \
  -t \
  -o solution.json

# 流式输出
coze-coding-ai chat \
  -p "Write a short story about a robot" \
  --stream

联网搜索

# 网页搜索（默认）
coze-coding-ai search "AI 最新进展"

# 网页搜索 + 指定结果数量
coze-coding-ai search "Python 教程" --count 20

# 网页搜索 + AI 智能摘要
coze-coding-ai search "量子计算原理" --type web_summary

# 图片搜索
coze-coding-ai search "可爱的猫咪" --type image
coze-coding-ai search "可爱的猫咪" -t image -c 20

# 高级过滤 - 指定站点
coze-coding-ai search "Python 教程" --sites "python.org,github.com"

# 高级过滤 - 屏蔽站点
coze-coding-ai search "新闻" --block-hosts "example.com"

# 高级过滤 - 时间范围（1d=1天, 1w=1周, 1m=1月）
coze-coding-ai search "最新科技" --time-range "1d"

# 高级过滤 - 仅返回有正文的结果
coze-coding-ai search "技术文章" --need-content

# 保存结果到 JSON 文件
coze-coding-ai search "AI 研究" -o results.json

# 不同输出格式
coze-coding-ai search "Python" --format table    # 表格格式（默认）
coze-coding-ai search "Python" --format simple   # 简单文本格式
coze-coding-ai search "Python" --format json     # JSON 格式

搜索类型说明：

web - 普通网页搜索（默认）
image - 图片搜索
web_summary - 网页搜索 + AI 智能摘要

高级过滤选项：

--count, -c - 返回结果数量（默认 10）
--summary, -s - 启用 AI 摘要（仅 web 类型）
--need-content - 仅返回有正文的结果
--need-url - 仅返回有原文链接的结果
--sites - 指定搜索的站点范围（逗号分隔）
--block-hosts - 屏蔽的站点（逗号分隔）
--time-range - 发文时间范围（1d, 1w, 1m）
--output, -o - 保存结果到 JSON 文件
--format, -f - 输出格式（table, json, simple）

更多 CLI 使用说明请查看 CLI 文档

📊 版本历史

查看 CHANGELOG.md 了解详细的版本变更记录。

🤝 贡献

欢迎提交 Issue 和 Pull Request！

查看 CONTRIBUTING.md 了解如何参与贡献。

📄 许可证

本项目采用 MIT License 开源协议。

🙏 致谢

基于 Coze AI Integrations 和豆包大模型构建。

📞 联系方式

项目主页: https://github.com/coze/coze-sdk
问题反馈: https://github.com/coze/coze-sdk/issues
邮箱: support@coze.com

⭐ Star History

如果这个项目对你有帮助，请给我们一个 Star ⭐️

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.5.14

Apr 14, 2026

0.5.13

Apr 7, 2026

0.5.12

Mar 31, 2026

This version

0.5.11

Feb 28, 2026

0.5.10

Feb 27, 2026

0.5.9

Feb 10, 2026

0.5.8

Feb 4, 2026

0.5.7

Jan 30, 2026

0.5.6

Jan 26, 2026

0.5.5

Jan 19, 2026

0.5.4

Jan 15, 2026

0.5.3

Jan 9, 2026

0.5.2

Jan 8, 2026

0.5.1

Jan 5, 2026

0.5.0

Dec 30, 2025

0.4.5

Dec 27, 2025

0.4.4

Dec 27, 2025

0.4.3

Dec 26, 2025

0.4.2

Dec 26, 2025

0.4.1

Dec 25, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

coze_coding_dev_sdk-0.5.11.tar.gz (104.7 kB view details)

Uploaded Feb 28, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

coze_coding_dev_sdk-0.5.11-py3-none-any.whl (127.6 kB view details)

Uploaded Feb 28, 2026 Python 3

File details

Details for the file coze_coding_dev_sdk-0.5.11.tar.gz.

File metadata

Download URL: coze_coding_dev_sdk-0.5.11.tar.gz
Upload date: Feb 28, 2026
Size: 104.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.3.2 CPython/3.11.2 Linux/5.4.210.bsk.6-amd64

File hashes

Hashes for coze_coding_dev_sdk-0.5.11.tar.gz
Algorithm	Hash digest
SHA256	`10097b5088e8aab8e55fc696bcc13afc39dc4c3ffc8eee31ae0c6cbd7b32d488`
MD5	`4736e1318618ddf3e58fb6eeae5ac13f`
BLAKE2b-256	`86560b52b6953f5b0b9d6bf66c7d4301c8f88dca25b6a42bb06064e397e72cfb`

See more details on using hashes here.

File details

Details for the file coze_coding_dev_sdk-0.5.11-py3-none-any.whl.

File metadata

Download URL: coze_coding_dev_sdk-0.5.11-py3-none-any.whl
Upload date: Feb 28, 2026
Size: 127.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: poetry/2.3.2 CPython/3.11.2 Linux/5.4.210.bsk.6-amd64

File hashes

Hashes for coze_coding_dev_sdk-0.5.11-py3-none-any.whl
Algorithm	Hash digest
SHA256	`34fe407ce17be2cf87449e5f07f71c9eacb29f46ec144da688e1ba723dc24c66`
MD5	`fa34e6105018ff6a8ec57b5095c9db82`
BLAKE2b-256	`d597a65b377c72218503854a6c13971c567eb592ff1993dfce0525a7aad5f8ee`

See more details on using hashes here.

coze-coding-dev-sdk 0.5.11

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Coze Coding Dev SDK

✨ 特性

📦 安装

仅安装 SDK

安装 SDK + CLI 工具

开发安装

🚀 快速开始

环境配置

必需配置

可选配置（Cozeloop 监控）

图片生成

视频生成

大语言模型

联网搜索

语音合成 (TTS)

语音识别 (ASR)

上下文追踪 (Context)

📁 项目结构

🎯 核心模块

1. 图片生成 (Image)

2. 视频生成 (Video)

3. 大语言模型 (LLM)

4. 联网搜索 (Search)

5. 语音合成 (TTS)

6. 语音识别 (ASR)

🔧 高级配置

统一配置

异常处理

💡 使用示例

🛠️ 命令行工具 (CLI)

图片生成

视频生成

Chat 对话

联网搜索

📊 版本历史

🤝 贡献

📄 许可证

🙏 致谢

📞 联系方式

⭐ Star History

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes