Skip to main content

A serving system for speech language models.

Project description

VoxServe: a serving system for SpeechLMs

VoxServe Logo

arXiv Documentation

VoxServe is a serving system for Speech Language Models (SpeechLMs). VoxServe provides low-latency & high-throughput inference for language models trained for speech tokens, specifically text-to-speech (TTS) and speech-to-speech (STS) models.

News

Usage

You can install VoxServe via pip:

pip install vox-serve 
vox-serve --model <model-name> --port <port-number>

Or, you can clone the code and start the inference server with launch.py:

git clone https://github.com/vox-serve/vox-serve.git
cd vox-serve
python -m vox_serve.launch --model <model-name> --port <port-number>

And call the server like this:

# Generate audio from text
curl -X POST "http://localhost:<port-number>/generate" -F "text=Hello world" -F "streaming=true" -o output.wav

# For models supporting audio input
curl -X POST "http://localhost:<port-number>/generate" -F "text=Hello world" -F "@input.wav" -F "streaming=true" -o output.wav

We currently support the following TTS and STS models:

And we are actively working on expanding the support.

./examples folder has more example usage.

Playground

VoxServe includes a web-based playground for interactive testing. Start the playground server and use the browser UI to manage VoxServe, generate audio, and view real-time logs.

VoxServe Playground

See examples/playground/README.md for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vox_serve-0.1.0.tar.gz (272.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vox_serve-0.1.0-py3-none-any.whl (302.8 kB view details)

Uploaded Python 3

File details

Details for the file vox_serve-0.1.0.tar.gz.

File metadata

  • Download URL: vox_serve-0.1.0.tar.gz
  • Upload date:
  • Size: 272.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for vox_serve-0.1.0.tar.gz
Algorithm Hash digest
SHA256 98a369ed1da7abdff6512c6b211293b3c5025c2645d0537f50888e4291ecb0f8
MD5 193f1a30c012be7a445a8ce63af3755c
BLAKE2b-256 cf9e968150750d05850dcf6cd243cb4907eb0d4539ee89599f914c7cee33ef3a

See more details on using hashes here.

Provenance

The following attestation bundles were made for vox_serve-0.1.0.tar.gz:

Publisher: publish.yml on vox-serve/vox-serve

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file vox_serve-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: vox_serve-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 302.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for vox_serve-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 85e9d39789a0efec9cdb8233b3802b03685afcc291565c01f86737af498c5ee8
MD5 063dd11ab3074b677289bf3a1a0b8b27
BLAKE2b-256 69a9b0e4e1499e7cbbf0a3267ab890188ee416057d16e0a385f92e1c16417fe7

See more details on using hashes here.

Provenance

The following attestation bundles were made for vox_serve-0.1.0-py3-none-any.whl:

Publisher: publish.yml on vox-serve/vox-serve

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page