A serving system for speech language models.
Project description
VoxServe: a serving system for SpeechLMs
VoxServe is a serving system for Speech Language Models (SpeechLMs). VoxServe provides low-latency & high-throughput inference for language models trained for speech tokens, specifically text-to-speech (TTS) and speech-to-speech (STS) models.
News
- [2025-02] We released our paper: VoxServe: A Streaming-Centric Serving System for Speech Language Models
Usage
You can install VoxServe via pip:
pip install vox-serve
vox-serve --model <model-name> --port <port-number>
Or, you can clone the code and start the inference server with launch.py:
git clone https://github.com/vox-serve/vox-serve.git
cd vox-serve
python -m vox_serve.launch --model <model-name> --port <port-number>
And call the server like this:
# Generate audio from text
curl -X POST "http://localhost:<port-number>/generate" -F "text=Hello world" -F "streaming=true" -o output.wav
# For models supporting audio input
curl -X POST "http://localhost:<port-number>/generate" -F "text=Hello world" -F "@input.wav" -F "streaming=true" -o output.wav
We currently support the following TTS and STS models:
chatterbox: Chatterbox TTScosyvoice2: CosyVoice2-0.5Bcsm: CSM-1Borpheus: Orpheus-3Bqwen3-tts: Qwen3-TTS-1.7Bzonos: Zonos-v0.1glm: GLM-4-Voice-9Bstep: Step-Audio-2-Mini
And we are actively working on expanding the support.
./examples folder has more example usage.
Playground
VoxServe includes a web-based playground for interactive testing. Start the playground server and use the browser UI to manage VoxServe, generate audio, and view real-time logs.
See examples/playground/README.md for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vox_serve-0.1.0.tar.gz.
File metadata
- Download URL: vox_serve-0.1.0.tar.gz
- Upload date:
- Size: 272.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
98a369ed1da7abdff6512c6b211293b3c5025c2645d0537f50888e4291ecb0f8
|
|
| MD5 |
193f1a30c012be7a445a8ce63af3755c
|
|
| BLAKE2b-256 |
cf9e968150750d05850dcf6cd243cb4907eb0d4539ee89599f914c7cee33ef3a
|
Provenance
The following attestation bundles were made for vox_serve-0.1.0.tar.gz:
Publisher:
publish.yml on vox-serve/vox-serve
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
vox_serve-0.1.0.tar.gz -
Subject digest:
98a369ed1da7abdff6512c6b211293b3c5025c2645d0537f50888e4291ecb0f8 - Sigstore transparency entry: 929769682
- Sigstore integration time:
-
Permalink:
vox-serve/vox-serve@469c5b77854269a5eb99b558bb60551409ded16e -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/vox-serve
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@469c5b77854269a5eb99b558bb60551409ded16e -
Trigger Event:
release
-
Statement type:
File details
Details for the file vox_serve-0.1.0-py3-none-any.whl.
File metadata
- Download URL: vox_serve-0.1.0-py3-none-any.whl
- Upload date:
- Size: 302.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
85e9d39789a0efec9cdb8233b3802b03685afcc291565c01f86737af498c5ee8
|
|
| MD5 |
063dd11ab3074b677289bf3a1a0b8b27
|
|
| BLAKE2b-256 |
69a9b0e4e1499e7cbbf0a3267ab890188ee416057d16e0a385f92e1c16417fe7
|
Provenance
The following attestation bundles were made for vox_serve-0.1.0-py3-none-any.whl:
Publisher:
publish.yml on vox-serve/vox-serve
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
vox_serve-0.1.0-py3-none-any.whl -
Subject digest:
85e9d39789a0efec9cdb8233b3802b03685afcc291565c01f86737af498c5ee8 - Sigstore transparency entry: 929769683
- Sigstore integration time:
-
Permalink:
vox-serve/vox-serve@469c5b77854269a5eb99b558bb60551409ded16e -
Branch / Tag:
refs/tags/v0.1.0 - Owner: https://github.com/vox-serve
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@469c5b77854269a5eb99b558bb60551409ded16e -
Trigger Event:
release
-
Statement type: