Skip to main content

A Python client library for Xiaozhi AI assistant

Project description

Xiaozhi Client

这是一个用于连接小智AI助手的Python客户端库。它提供了简单的接口来进行语音对话和文本交互。

小智前端-硬件版 小智后端-本地部署版

源码安装

git clone https://github.com/Eric0308/xiaozhi-client.git
cd xiaozhi-client
conda create -n xiaozhi-client python=3.10 -y
conda activate xiaozhi-client 
pip install -e .
python examples/simple_client.py

pip 安装

pip install xiaozhi-client

快速开始

这是一个基础的文本对话示例:

import asyncio
from xiaozhi_client import XiaozhiClient, ClientConfig

async def main():
    # 配置客户端
    config = ClientConfig(
        ws_url="ws://localhost:8000",
    )
    
    client = XiaozhiClient(config)
    
    try:
        await client.connect()
        # 发送文本消息
        await client.send_txt_message("你好")
    finally:
        await client.close()

if __name__ == "__main__":
    asyncio.run(main())

完整示例代码请参考 simple_client.py

语音对话示例

这是一个基础的语音对话示例:

import asyncio
import sounddevice as sd
from xiaozhi_client import XiaozhiClient, ClientConfig, AudioConfig

async def main():
    # 配置客户端
    config = ClientConfig(
        ws_url="ws://localhost:8000",
    )
    
    audio_config = AudioConfig(
        sample_rate=16000,
        channels=1,
        frame_size=960,
        frame_duration=20,
        format="opus"
    )
    
    client = XiaozhiClient(config, audio_config)
    
    try:
        await client.connect()
        # 开始录音并发送音频数据
        with sd.InputStream(samplerate=audio_config.sample_rate, channels=audio_config.channels) as stream:
            while True:
                data, _ = stream.read(audio_config.frame_size)
                await client.send_audio_data(data)
    finally:
        await client.close()

if __name__ == "__main__":
    asyncio.run(main())

完整示例代码请参考 audio_chat.py

特性

  • WebSocket连接管理
  • 音频编解码(Opus格式)
  • 支持实时语音对话
  • 支持文本消息交互
  • 内置设备标识和认证
  • 支持不同的语音识别模式

配置项

ClientConfig

  • ws_url: WebSocket服务器地址
  • device_token: 设备认证token
  • enable_token: 是否启用token认证
  • protocol_version: 协议版本(默认1)

AudioConfig

  • sample_rate: 采样率(默认16000)
  • channels: 声道数(默认1)
  • frame_size: 帧大小(默认960)
  • frame_duration: 帧时长(默认20ms)
  • format: 音频格式(默认"opus")

支持的消息类型

语音识别

# 开始监听
{
    "type": "listen",
    "state": "start",
    "mode": "auto"  # auto/manual/realtime
}

# 停止监听
{
    "type": "listen",
    "state": "stop"
}

TTS状态回调

{
    "type": "tts",
    "state": "start|stop|sentence_start",
    "text": "要说的文本"  # 仅在 sentence_start 时存在
}

示例

  1. 基础文本对话 - examples/simple_client.py
  2. 实时语音对话 - examples/audio_chat.py

开发说明

音频处理

客户端发送和接收的音频数据都使用Opus编码:

  • 采样率:16000Hz
  • 声道数:1(单声道)
  • 帧大小:960样本/帧
  • 帧时长:20ms

错误处理

客户端会自动处理连接断开等错误:

  • WebSocket连接断开时会触发重连
  • 音频解码错误会被捕获并记录
  • 网络错误会抛出相应异常

依赖

  • websockets==10.4
  • opuslib==3.0.1
  • numpy==2.2.3
  • sounddevice==0.5.1
  • loguru==0.7.3
  • pyaudio==0.2.14

协议文档

完整的协议文档请参考API文档。主要包括:

  • 设备认证
  • 音频流协议
  • 消息类型定义
  • 错误处理机制

致谢

本项目参考和借鉴了以下优秀的开源项目:

xiaozhi-py - Python版本的小智客户端实现

xiaozhi-web-client - Web版本的小智客户端实现

感谢所有为此项目做出贡献的开发者和社区成员。

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xiaozhi_client-0.1.3.tar.gz (13.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

xiaozhi_client-0.1.3-py3-none-any.whl (12.7 kB view details)

Uploaded Python 3

File details

Details for the file xiaozhi_client-0.1.3.tar.gz.

File metadata

  • Download URL: xiaozhi_client-0.1.3.tar.gz
  • Upload date:
  • Size: 13.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for xiaozhi_client-0.1.3.tar.gz
Algorithm Hash digest
SHA256 6ac8b40f8118d37d25c7e91b2d1ffb0b43e6fbfd99a05c9ab3e3898ead37b205
MD5 c53916a9765bbe73dbb2c75ffe7523e1
BLAKE2b-256 13721264ad914c0fe661c8949afa03c10bc4205831170d74033c97554d8b1e63

See more details on using hashes here.

File details

Details for the file xiaozhi_client-0.1.3-py3-none-any.whl.

File metadata

  • Download URL: xiaozhi_client-0.1.3-py3-none-any.whl
  • Upload date:
  • Size: 12.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for xiaozhi_client-0.1.3-py3-none-any.whl
Algorithm Hash digest
SHA256 1ecf65d61c70e684abd52c5f7cab21e58e7be928f945ee3fde062eabcb40457b
MD5 68982b0bee787afe3ca22767ede536f1
BLAKE2b-256 73588df3fc973ba65598f8aaa53fa332c3f8578def765b3e81e9ca421dd6a077

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page