OpenAI-compatible HTTP server for OmniVoice TTS
Project description
omnivoice-server
OpenAI-compatible HTTP server for OmniVoice text-to-speech.
Author: zamery (@maemreyo) | Email: matthew.ngo1114@gmail.com
Early Development Notice
This is a new repository built on top of OmniVoice (released 2026). Both the upstream model and this server wrapper are under active development. Expect API changes, breaking updates, and performance improvements as PyTorch MPS support matures.
Current Status: Functional on CPU and CUDA. MPS (Apple Silicon) has known issues.
Quick Links
| Category | Sections |
|---|---|
| Getting Started | Features - Quick Start - Verification Status |
| Usage | API Usage - CLI Usage - Configuration |
| Reference | API Reference - Advanced Features - Examples |
| Deployment | Docker Deployment - Hardware Requirements - Performance |
| Development | Development - Troubleshooting - Known Limitations |
| Project | Documentation Index - License - Contributing - Acknowledgments - Support |
Quick Start
Prerequisites: PyTorch must be installed first. See Quick Start for details.
# Install
pip install omnivoice-server
# Start server
omnivoice-server
# Test with curl
curl -X POST http://127.0.0.1:8880/v1/audio/speech \
-H "Content-Type: application/json" \
-d '{"model": "omnivoice", "input": "Hello world!"}' \
--output speech.wav
Overview
omnivoice-server wraps the OmniVoice TTS model with an OpenAI-compatible HTTP API:
- Voice Design: Control gender, age, pitch, accent, dialect
- Voice Cloning: Clone from reference audio
- Streaming: Real-time audio streaming with chunked transfer
- Voice Profiles: Persistent storage for cloned voices
- OpenAI-Compatible: Drop-in replacement for OpenAI TTS endpoints
See Features for complete capability list.
Verification Status
- System: Working on CPU and CUDA
- MPS: Broken on Apple Silicon (use CPU instead)
- Performance: RTF ~4.92 on CPU, ~0.2 on GPU
See Verification Status for benchmarks and audio samples.
Documentation
This README provides quick links to detailed documentation. For complete information, see:
- Individual section files in
docs/readme/sections/ - Technical docs in
docs/verification/,docs/system/,docs/architecture/
License
MIT - See License
Support
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file omnivoice_server-0.2.3.tar.gz.
File metadata
- Download URL: omnivoice_server-0.2.3.tar.gz
- Upload date:
- Size: 18.8 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
69fa38a3774510d099b445743f281f503fdfdb27ac717142c85a7e7427eeaaa7
|
|
| MD5 |
4e61305315e8bd24ccc406fbc1f86e58
|
|
| BLAKE2b-256 |
4caaa1b4cb9d9718efb53d901de57fe17ddccaf36124f6deab2655b3ed9b86b7
|
File details
Details for the file omnivoice_server-0.2.3-py3-none-any.whl.
File metadata
- Download URL: omnivoice_server-0.2.3-py3-none-any.whl
- Upload date:
- Size: 42.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d02422a74e0d6b8b0effa2c296a13bac27238abf2af93e8dd5124d6c469694ad
|
|
| MD5 |
f91294ab1d2cc3f8af6a78f825cb85f7
|
|
| BLAKE2b-256 |
739594e101de1a3b3f6367a9f3d4946e43f840267b6fc20ba905d431764a099c
|