Skip to main content

OpenAI-compatible HTTP server for OmniVoice TTS

Project description

omnivoice-server

License: MIT Python 3.10+ CI PyPI version PyPI Downloads

OpenAI-compatible HTTP server for OmniVoice text-to-speech.

Author: zamery (@maemreyo) | Email: matthew.ngo1114@gmail.com

Early Development Notice

This is a new repository built on top of OmniVoice (released 2026). Both the upstream model and this server wrapper are under active development. Expect API changes, breaking updates, and performance improvements as PyTorch MPS support matures.

Current Status: Functional on CPU and CUDA. MPS (Apple Silicon) has known issues.

Quick Links

Category Sections
Getting Started Features - Quick Start - Verification Status
Usage API Usage - CLI Usage - Configuration
Reference API Reference - Advanced Features - Examples
Deployment Docker Deployment - Hardware Requirements - Performance
Development Development - Troubleshooting - Known Limitations
Project Documentation Index - License - Contributing - Acknowledgments - Support

Quick Start

Prerequisites: PyTorch must be installed first. See Quick Start for details.

# Install
pip install omnivoice-server

# Start server
omnivoice-server

# Test with curl
curl -X POST http://127.0.0.1:8880/v1/audio/speech \
  -H "Content-Type: application/json" \
  -d '{"model": "omnivoice", "input": "Hello world!"}' \
  --output speech.wav

Overview

omnivoice-server wraps the OmniVoice TTS model with an OpenAI-compatible HTTP API:

  • Voice Design: Control gender, age, pitch, accent, dialect
  • Voice Cloning: Clone from reference audio
  • Streaming: Real-time audio streaming with chunked transfer
  • Voice Profiles: Persistent storage for cloned voices
  • OpenAI-Compatible: Drop-in replacement for OpenAI TTS endpoints

See Features for complete capability list.

Verification Status

  • System: Working on CPU and CUDA
  • MPS: Broken on Apple Silicon (use CPU instead)
  • Performance: RTF ~4.92 on CPU, ~0.2 on GPU

See Verification Status for benchmarks and audio samples.

Documentation

This README provides quick links to detailed documentation. For complete information, see:

  • Individual section files in docs/readme/sections/
  • Technical docs in docs/verification/, docs/system/, docs/architecture/

License

MIT - See License

Support

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omnivoice_server-0.2.4.tar.gz (18.8 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

omnivoice_server-0.2.4-py3-none-any.whl (43.1 kB view details)

Uploaded Python 3

File details

Details for the file omnivoice_server-0.2.4.tar.gz.

File metadata

  • Download URL: omnivoice_server-0.2.4.tar.gz
  • Upload date:
  • Size: 18.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for omnivoice_server-0.2.4.tar.gz
Algorithm Hash digest
SHA256 8ce9956f953f6e484b7ba7f5c216a2aa3b598668e44c0e3986572555b6e7f2d4
MD5 c6a610e7f17bded3113058c59f3c0a5f
BLAKE2b-256 ac459056fd0913e251bac9fd4470edcbeb10761694959c3b121098af4b2d4136

See more details on using hashes here.

File details

Details for the file omnivoice_server-0.2.4-py3-none-any.whl.

File metadata

File hashes

Hashes for omnivoice_server-0.2.4-py3-none-any.whl
Algorithm Hash digest
SHA256 a125a47d17846aaafefcad5205567b842df05c3b2ae6efba61f628f780c1c309
MD5 11a3c73a5c24c55f77dc94230ea80c63
BLAKE2b-256 66e87d5531c28e585266e39a325693933a0107174ec6055b70aa6a678e3ce8ac

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page