Skip to main content

Qwen3 TTS Service with Voice Cloning

Project description

Chichi Speech

A high-quality, voice-cloning TTS service powered by Qwen3.

Chichi Speech provides a robust REST API and CLI tools for text-to-speech synthesis, featuring efficient voice cloning capabilities. It is designed to be easily deployed or integrated into other AI agents and workflows.

Features

  • High Quality: Utilizes the Qwen3-TTS model for state-of-the-art speech synthesis.
  • Voice Cloning: Clone voices from reference audio files.
  • Efficient: Optimized for reusing voice prompts to minimize computation for repeated requests.
  • Standardized API: Simple REST API (/synthesize) for easy integration.
  • CLI Tools: Includes chichi-speech-server and chichi-speech-client for immediate use.

Installation

Prerequisites: git, uv, python >= 3.10.

export CHICHI_SPEECH_HOME="~/chichi-speech/"
export CHICHI_SPEECH_ENV="~/chichi-speech/.venv"
git clone https://github.com/yourusername/chichi-speech.git $CHICHI_SPEECH_HOME
cd $CHICHI_SPEECH_HOME

uv venv $CHICHI_SPEECH_ENV --python 3.10
source $CHICHI_SPEECH_ENV/bin/activate

uv pip install -e .

Usage

1. Start the Service

The service runs on port 9090 by default.

# Start the server (runs in foreground, use & for background or a separate terminal)
source $$CHICHI_SPEECH_ENV/bin/activate
chichi-speech-server
# OR specify the port explicitly
chichi-speech-server --port 9090 --host 0.0.0.0
# OR specify your reference audio and text for voice cloning (Recommended)
chichi-speech-server --ref-audio /path/to/my/voice.wav --ref-text "caption of the reference audio"

2. Verify Service is Running

Check the health/docs:

curl http://localhost:9090/docs

3. Generate Speech

Use cURL:

curl -X POST "http://localhost:9090/synthesize" \
     -H "Content-Type: application/json" \
     -d '{
           "text": "Nice to meet you",
           "language": "English"
         }' \
     --output output/nice_to_meet.wav

Functionality

  • Endpoint: POST /synthesize
  • Default Port: 9090
  • Voice Cloning: Uses a pre-computed voice prompt from reference files to ensure the cloned voice is consistent and generation is fast.

Development

Install dev dependencies:

uv pip install -e ".[dev]"

Run tests:

pytest

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

chichi_speech-0.1.0-py3-none-any.whl (5.4 kB view details)

Uploaded Python 3

File details

Details for the file chichi_speech-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: chichi_speech-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 5.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for chichi_speech-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6fea5b622a718dda9dd0a1816443520cc79fc463ab9ad39a7d431606cd63f046
MD5 91b465b7201531c1970f2d402aeb9043
BLAKE2b-256 2044df1fd6ef96eb149017ea7b2d6c192359b62f310f5b6e69f46697e39ea2d4

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page