Skip to main content

OpenAI-compatible HTTP server for OmniVoice TTS

Project description

omnivoice-server

License: MIT Python 3.10+ CI PyPI version

OpenAI-compatible HTTP server for OmniVoice text-to-speech.

Author: zamery (@maemreyo) | Email: matthew.ngo1114@gmail.com

Early Development Notice

This is a new repository built on top of OmniVoice (released 2026). Both the upstream model and this server wrapper are under active development. Expect API changes, breaking updates, and performance improvements as PyTorch MPS support matures.

Current Status: Functional on CPU and CUDA. MPS (Apple Silicon) has known issues.

Quick Links

Category Sections
Getting Started Features - Quick Start - Verification Status
Usage API Usage - CLI Usage - Configuration
Reference API Reference - Advanced Features - Examples
Deployment Docker Deployment - Hardware Requirements - Performance
Development Development - Troubleshooting - Known Limitations
Project Documentation Index - License - Contributing - Acknowledgments - Support

Quick Start

Prerequisites: PyTorch must be installed first. See Quick Start for details.

# Install
pip install omnivoice-server

# Start server
omnivoice-server

# Test with curl
curl -X POST http://127.0.0.1:8880/v1/audio/speech \
  -H "Content-Type: application/json" \
  -d '{"model": "omnivoice", "input": "Hello world!"}' \
  --output speech.wav

Overview

omnivoice-server wraps the OmniVoice TTS model with an OpenAI-compatible HTTP API:

  • Voice Design: Control gender, age, pitch, accent, dialect
  • Voice Cloning: Clone from reference audio
  • Streaming: Real-time audio streaming with chunked transfer
  • Voice Profiles: Persistent storage for cloned voices
  • OpenAI-Compatible: Drop-in replacement for OpenAI TTS endpoints

See Features for complete capability list.

Verification Status

  • System: Working on CPU and CUDA
  • MPS: Broken on Apple Silicon (use CPU instead)
  • Performance: RTF ~4.92 on CPU, ~0.2 on GPU

See Verification Status for benchmarks and audio samples.

Documentation

This README provides quick links to detailed documentation. For complete information, see:

  • Individual section files in docs/readme/sections/
  • Technical docs in docs/verification/, docs/system/, docs/architecture/

License

MIT - See License

Support

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omnivoice_server-0.2.1.tar.gz (4.9 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

omnivoice_server-0.2.1-py3-none-any.whl (39.4 kB view details)

Uploaded Python 3

File details

Details for the file omnivoice_server-0.2.1.tar.gz.

File metadata

  • Download URL: omnivoice_server-0.2.1.tar.gz
  • Upload date:
  • Size: 4.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for omnivoice_server-0.2.1.tar.gz
Algorithm Hash digest
SHA256 5722cc5ff8438219cceb5891e51461764f2f0d87feb401017870787ca0914f33
MD5 10d2138bdfe6337590d3efb762b73898
BLAKE2b-256 133d959536b5350a2ed04bfa1f876e1e4fb50ccb77851b6af8ba79d303da82ca

See more details on using hashes here.

File details

Details for the file omnivoice_server-0.2.1-py3-none-any.whl.

File metadata

File hashes

Hashes for omnivoice_server-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 b08270f8ef116fbb6f7dd67d862f99fb73c68e87175786f5902f2d0d95439556
MD5 da2908e4fd3edc8caebaffb18dd7f4b2
BLAKE2b-256 bde5e94148ebe4c33ca5b1ff0ba92ff52f503817743e510e0eccb0154b8c172f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page