Qwen3 TTS Service with Voice Cloning

Project description

Chichi Speech

A high-quality, voice-cloning TTS service powered by Qwen3.

Chichi Speech provides a robust REST API and CLI tools for text-to-speech synthesis, featuring efficient voice cloning capabilities. It is designed to be easily deployed or integrated into other AI agents and workflows.

Features

High Quality: Utilizes the Qwen3-TTS model for state-of-the-art speech synthesis.
Voice Cloning: Clone voices from reference audio files.
Efficient: Optimized for reusing voice prompts to minimize computation for repeated requests.
Standardized API: Simple REST API (/synthesize) for easy integration.
CLI Tools: Includes chichi-speech-server and chichi-speech-client for immediate use.

Installation

Prerequisites: git, uv, python >= 3.10.

export CHICHI_SPEECH_HOME="~/chichi-speech/"
export CHICHI_SPEECH_ENV="~/chichi-speech/.venv"
git clone https://github.com/yourusername/chichi-speech.git $CHICHI_SPEECH_HOME
cd $CHICHI_SPEECH_HOME

uv venv $CHICHI_SPEECH_ENV --python 3.10
source $CHICHI_SPEECH_ENV/bin/activate

uv pip install -e .

Usage

1. Start the Service

The service runs on port 9090 by default.

# Start the server (runs in foreground, use & for background or a separate terminal)
source $$CHICHI_SPEECH_ENV/bin/activate
chichi-speech-server
# OR specify the port explicitly
chichi-speech-server --port 9090 --host 0.0.0.0
# OR specify your reference audio and text for voice cloning (Recommended)
chichi-speech-server --ref-audio /path/to/my/voice.wav --ref-text "caption of the reference audio"

2. Verify Service is Running

Check the health/docs:

curl http://localhost:9090/docs

3. Generate Speech

Use cURL:

curl -X POST "http://localhost:9090/synthesize" \
     -H "Content-Type: application/json" \
     -d '{
           "text": "Nice to meet you",
           "language": "English"
         }' \
     --output output/nice_to_meet.wav

Functionality

Endpoint: POST /synthesize
Default Port: 9090
Voice Cloning: Uses a pre-computed voice prompt from reference files to ensure the cloned voice is consistent and generation is fast.

Development

Install dev dependencies:

uv pip install -e ".[dev]"

Run tests:

pytest

License

MIT

Project details

Release history Release notifications | RSS feed

0.1.1

Feb 6, 2026

This version

0.1.0

Feb 6, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

chichi_speech-0.1.0-py3-none-any.whl (5.4 kB view details)

Uploaded Feb 6, 2026 Python 3

File details

Details for the file chichi_speech-0.1.0-py3-none-any.whl.

File metadata

Download URL: chichi_speech-0.1.0-py3-none-any.whl
Upload date: Feb 6, 2026
Size: 5.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.12

File hashes

Hashes for chichi_speech-0.1.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6fea5b622a718dda9dd0a1816443520cc79fc463ab9ad39a7d431606cd63f046`
MD5	`91b465b7201531c1970f2d402aeb9043`
BLAKE2b-256	`2044df1fd6ef96eb149017ea7b2d6c192359b62f310f5b6e69f46697e39ea2d4`

See more details on using hashes here.

chichi-speech 0.1.0

Navigation

Verified details

Maintainers

Unverified details

Meta