Skip to main content

OpenAI-compatible HTTP server for OmniVoice TTS

Project description

omnivoice-server

License: MIT Python 3.10+ CI PyPI version

OpenAI-compatible HTTP server for OmniVoice text-to-speech.

Author: zamery (@maemreyo) | Email: matthew.ngo1114@gmail.com

Early Development Notice

This is a new repository built on top of OmniVoice (released 2026). Both the upstream model and this server wrapper are under active development. Expect API changes, breaking updates, and performance improvements as PyTorch MPS support matures.

Current Status: Functional on CPU and CUDA. MPS (Apple Silicon) has known issues.

Quick Links

Category Sections
Getting Started Features - Quick Start - Verification Status
Usage API Usage - CLI Usage - Configuration
Reference API Reference - Advanced Features - Examples
Deployment Docker Deployment - Hardware Requirements - Performance
Development Development - Troubleshooting - Known Limitations
Project Documentation Index - License - Contributing - Acknowledgments - Support

Quick Start

Prerequisites: PyTorch must be installed first. See Quick Start for details.

# Install
pip install omnivoice-server

# Start server
omnivoice-server

# Test with curl
curl -X POST http://127.0.0.1:8880/v1/audio/speech \
  -H "Content-Type: application/json" \
  -d '{"model": "omnivoice", "input": "Hello world!"}' \
  --output speech.wav

Overview

omnivoice-server wraps the OmniVoice TTS model with an OpenAI-compatible HTTP API:

  • Voice Design: Control gender, age, pitch, accent, dialect
  • Voice Cloning: Clone from reference audio
  • Streaming: Real-time audio streaming with chunked transfer
  • Voice Profiles: Persistent storage for cloned voices
  • OpenAI-Compatible: Drop-in replacement for OpenAI TTS endpoints

See Features for complete capability list.

Verification Status

  • System: Working on CPU and CUDA
  • MPS: Broken on Apple Silicon (use CPU instead)
  • Performance: RTF ~4.92 on CPU, ~0.2 on GPU

See Verification Status for benchmarks and audio samples.

Documentation

This README provides quick links to detailed documentation. For complete information, see:

  • Individual section files in docs/readme/sections/
  • Technical docs in docs/verification/, docs/system/, docs/architecture/

License

MIT - See License

Support

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omnivoice_server-0.2.3.tar.gz (18.8 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

omnivoice_server-0.2.3-py3-none-any.whl (42.8 kB view details)

Uploaded Python 3

File details

Details for the file omnivoice_server-0.2.3.tar.gz.

File metadata

  • Download URL: omnivoice_server-0.2.3.tar.gz
  • Upload date:
  • Size: 18.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for omnivoice_server-0.2.3.tar.gz
Algorithm Hash digest
SHA256 69fa38a3774510d099b445743f281f503fdfdb27ac717142c85a7e7427eeaaa7
MD5 4e61305315e8bd24ccc406fbc1f86e58
BLAKE2b-256 4caaa1b4cb9d9718efb53d901de57fe17ddccaf36124f6deab2655b3ed9b86b7

See more details on using hashes here.

File details

Details for the file omnivoice_server-0.2.3-py3-none-any.whl.

File metadata

File hashes

Hashes for omnivoice_server-0.2.3-py3-none-any.whl
Algorithm Hash digest
SHA256 d02422a74e0d6b8b0effa2c296a13bac27238abf2af93e8dd5124d6c469694ad
MD5 f91294ab1d2cc3f8af6a78f825cb85f7
BLAKE2b-256 739594e101de1a3b3f6367a9f3d4946e43f840267b6fc20ba905d431764a099c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page