Skip to main content

OpenAI-compatible HTTP server for OmniVoice TTS

Project description

omnivoice-server

License: MIT Python 3.10+ CI PyPI version

OpenAI-compatible HTTP server for OmniVoice text-to-speech.

Author: zamery (@maemreyo) | Email: matthew.ngo1114@gmail.com

Early Development Notice

This is a new repository built on top of OmniVoice (released 2026). Both the upstream model and this server wrapper are under active development. Expect API changes, breaking updates, and performance improvements as PyTorch MPS support matures.

Current Status: Functional on CPU and CUDA. MPS (Apple Silicon) has known issues.

Quick Links

Category Sections
Getting Started Features - Quick Start - Verification Status
Usage API Usage - CLI Usage - Configuration
Reference API Reference - Advanced Features - Examples
Deployment Docker Deployment - Hardware Requirements - Performance
Development Development - Troubleshooting - Known Limitations
Project Documentation Index - License - Contributing - Acknowledgments - Support

Quick Start

Prerequisites: PyTorch must be installed first. See Quick Start for details.

# Install
pip install omnivoice-server

# Start server
omnivoice-server

# Test with curl
curl -X POST http://127.0.0.1:8880/v1/audio/speech \
  -H "Content-Type: application/json" \
  -d '{"model": "omnivoice", "input": "Hello world!"}' \
  --output speech.wav

Overview

omnivoice-server wraps the OmniVoice TTS model with an OpenAI-compatible HTTP API:

  • Voice Design: Control gender, age, pitch, accent, dialect
  • Voice Cloning: Clone from reference audio
  • Streaming: Real-time audio streaming with chunked transfer
  • Voice Profiles: Persistent storage for cloned voices
  • OpenAI-Compatible: Drop-in replacement for OpenAI TTS endpoints

See Features for complete capability list.

Verification Status

  • System: Working on CPU and CUDA
  • MPS: Broken on Apple Silicon (use CPU instead)
  • Performance: RTF ~4.92 on CPU, ~0.2 on GPU

See Verification Status for benchmarks and audio samples.

Documentation

This README provides quick links to detailed documentation. For complete information, see:

  • Individual section files in docs/readme/sections/
  • Technical docs in docs/verification/, docs/system/, docs/architecture/

License

MIT - See License

Support

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omnivoice_server-0.2.2.tar.gz (18.8 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

omnivoice_server-0.2.2-py3-none-any.whl (42.2 kB view details)

Uploaded Python 3

File details

Details for the file omnivoice_server-0.2.2.tar.gz.

File metadata

  • Download URL: omnivoice_server-0.2.2.tar.gz
  • Upload date:
  • Size: 18.8 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for omnivoice_server-0.2.2.tar.gz
Algorithm Hash digest
SHA256 3f6bacfa58e5fe9649d2e377dd6062a21d475b0b2117646c9450d4296cabd83a
MD5 0c250cb2dd1140fc7a6d8117532820d7
BLAKE2b-256 b9fb13c927731606a16356846da180cf8fdd3746941c364c450323e2cee3d8ae

See more details on using hashes here.

File details

Details for the file omnivoice_server-0.2.2-py3-none-any.whl.

File metadata

File hashes

Hashes for omnivoice_server-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 84645fa0d30563a800e5b09cb20c36ca73f29f02df222bdd2509c8e72c81903e
MD5 02ebb4bdc59fde25136de416117b3b6c
BLAKE2b-256 0a6097814aac21f68bcfd7e7b473042ec31e103b3ba5d8e2ce4b86e5e4a663d2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page