Skip to main content

DashScope Realtime ASR & TTS SDK

Project description

DashScope Realtime

🚀 Async Python SDK for DashScope Realtime ASR (Speech Recognition) & TTS (Speech Synthesis)


简介

DashScope Realtime 是一个支持异步 WebSocket 的 Python SDK,适配阿里 DashScope 的实时流式语音识别(ASR)和流式语音合成(TTS)能力。


为什么开发这个项目?

阿里云官方提供的DashScope Python SDK 是同步 WebSocket 实现,存在以下问题:

  • 不支持 async / await

  • 回调不在同一事件循环,无法直接使用 async 上下文

  • 与 OpenAI API 生态的开源项目(如 FastAPI、Chainlit)不兼容

为了解决这些问题,本项目基于 DashScope WebSocket API,重新实现了异步版本的 ASR(语音识别)与 TTS(语音合成)SDK,具备:

  • 纯异步 API 设计

  • 支持流式音频输入输出

  • 支持上下文无感知切换

  • 更易接入 OpenAI API 风格的开源项目


安装

pip install dashscope-realtime

快速上手

实时语音识别(ASR)

from dashscope_realtime import DashScopeRealtimeASR

async with DashScopeRealtimeASR(api_key="your-api-key") as asr:
    await asr.send_audio_chunk(b"...")  # 发送音频片段

实时语音合成(TTS)

from dashscope_realtime import DashScopeRealtimeTTS

async with DashScopeRealtimeTTS(api_key="your-api-key") as tts:
    await tts.send_text("Hello, DashScope!")  # 发送文本
    await tts.finish()                        # 完成任务

特性

  • ✅ 全异步设计(async / await)
  • ✅ ASR 支持流式音频输入
  • ✅ TTS 支持流式音频输出
  • ✅ 自动重连 & 错误处理
  • ✅ 接口风格对齐 OpenAI Realtime
  • ✅ 方便集成任意异步 Python 项目

License

MIT License — see LICENSE for details.


Made with ❤️ by mikuh

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dashscope_realtime-0.1.0.tar.gz (13.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

dashscope_realtime-0.1.0-py3-none-any.whl (3.9 kB view details)

Uploaded Python 3

File details

Details for the file dashscope_realtime-0.1.0.tar.gz.

File metadata

  • Download URL: dashscope_realtime-0.1.0.tar.gz
  • Upload date:
  • Size: 13.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.7

File hashes

Hashes for dashscope_realtime-0.1.0.tar.gz
Algorithm Hash digest
SHA256 88304581be0772d7161576b9ecb0e191238d6a36157413da33717a2ce044b1e1
MD5 8e2c92cb6c2007d32c2f141d3db82681
BLAKE2b-256 f0131c8e98ebbad555bfc61ce2ae9eaef6ce3ad67260413830f1eaaf9e11c993

See more details on using hashes here.

File details

Details for the file dashscope_realtime-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for dashscope_realtime-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 90c273f66e64a53308873e33a19dab95fa3904e5fdde257f1c9825bab2ca9b34
MD5 6ebaade56b338d90aeae1fbac3ed4164
BLAKE2b-256 e17b5b10d38891afd9c5eca10e1a5dc63accddcdf8073618bbafbf822451f34a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page