Skip to main content

Qwen3 TTS Service with Voice Cloning

Project description

Chichi Speech

A high-quality, voice-cloning TTS service powered by Qwen3.

Chichi Speech provides a robust REST API and CLI tools for text-to-speech synthesis, featuring efficient voice cloning capabilities. It is designed to be easily deployed or integrated into other AI agents and workflows.

Acknowledgement: This project is just a simple wrapper of Qwen3-TTS, the SOTA TTS model as of 2/5/2025.

Features

  • High Quality: Utilizes the Qwen3-TTS model for state-of-the-art speech synthesis.
  • Voice Cloning: Clone voices from reference audio files.
  • Efficient: Optimized for reusing voice prompts to minimize computation for repeated requests.
  • Standardized API: Simple REST API (/synthesize) for easy integration.
  • CLI Tools: Includes chichi-speech-server and chichi-speech-client for immediate use.

Installation

Prerequisites: python >= 3.10.

pip install chichi-speech

Usage

1. Start the Service

The service runs on port 9090 by default.

# Start the server (runs in foreground, use & for background or a separate terminal)
chichi-speech-server
# OR specify the port explicitly
chichi-speech-server --port 9090 --host 0.0.0.0
# OR specify your reference audio and text for voice cloning (Recommended)
chichi-speech-server --ref-audio /path/to/my/voice.wav --ref-text "caption of the reference audio"

2. Verify Service is Running

Check the health/docs:

curl http://localhost:9090/docs

3. Generate Speech

Use cURL:

curl -X POST "http://localhost:9090/synthesize" \
     -H "Content-Type: application/json" \
     -d '{
           "text": "Nice to meet you",
           "language": "English"
         }' \
     --output output/nice_to_meet.wav

Functionality

  • Endpoint: POST /synthesize
  • Default Port: 9090
  • Voice Cloning: Uses a pre-computed voice prompt from reference files to ensure the cloned voice is consistent and generation is fast.

Development

Install dev dependencies:

pip install -e ".[dev]"

Run tests:

pytest

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

chichi_speech-0.1.1-py3-none-any.whl (5.4 kB view details)

Uploaded Python 3

File details

Details for the file chichi_speech-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: chichi_speech-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 5.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for chichi_speech-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 d27803fe6ae3c53064ec8d781cab6abd6bd0c3079375ef13ea8eb3d87472c836
MD5 7447547479f84805945a037f54e4368d
BLAKE2b-256 f4268fe1fc410d74fb56ce32b6a4d482fbf470e567e3b1a38e467332c06d8fe0

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page