Skip to main content

No project description provided

Project description

OpenVoiceLab (Beta)

Documentation | Join Discord

[!IMPORTANT] OpenVoiceLab is currently in beta. Some things still need to be improved - especially in the finetuning process. Feedback and contributions are welcome!

A beginner-friendly interface for finetuning and running text-to-speech models. Currently supports VibeVoice.

What is this?

OpenVoiceLab provides a simple web interface for working with the VibeVoice text-to-speech model. You can:

  • Finetune models on your own voice data to create custom voices
  • Generate speech from text using pretrained or finetuned models
  • Experiment with models through an easy-to-use web UI

The goal is to make state-of-the-art voice synthesis accessible to anyone interested in exploring TTS technology, whether you're a developer, researcher, content creator, or hobbyist.

Requirements

Before you begin, make sure you have:

  • Python 3.9 or newer - Check your version with python3 --version
  • CUDA-compatible NVIDIA GPU - At least 16 GB of VRAM is recommended for training the 1.5B parameter model
    • For inference (generating speech), you can get by with less VRAM or even CPU-only mode, though it will be slower
  • Operating System - Linux, macOS, or Windows

Quick Start

The easiest way to get started is using the provided setup scripts. These will create a Python virtual environment, install all dependencies, and launch the interface.

Linux/macOS

./scripts/setup.sh
./scripts/run.sh

Windows

scripts\setup.bat
scripts\run.bat

After running these commands, the web interface will launch automatically. Open your browser and navigate to:

http://localhost:7860

If the browser doesn't open automatically, you can manually visit this address.

Manual Setup

If you prefer to set things up yourself, or if the scripts don't work on your system:

  1. Create a virtual environment (recommended to avoid conflicts with other Python packages):

    python3 -m venv venv
    
  2. Activate the virtual environment:

    source venv/bin/activate  # Linux/macOS
    venv\Scripts\activate     # Windows
    
  3. Install dependencies:

    pip install -r requirements.txt
    
  4. Launch the interface:

    python -m ovl.cli
    

    Then open your browser to http://localhost:7860

What's Next?

Once you have OpenVoiceLab running, you can:

  • Start with inference to generate speech from a pretrained model
  • Prepare your own voice dataset for finetuning
  • Experiment with different model parameters and training configurations

Detailed usage instructions are available in the interface itself.

Troubleshooting

Out of memory errors during training: Try reducing the batch size or using a smaller model variant.

CUDA not available: Make sure you have NVIDIA drivers and PyTorch with CUDA support installed. The setup scripts should handle this automatically.

Import errors: Ensure you've activated the virtual environment before running the CLI.

License

OpenVoiceLab is licensed under the BSD-3-Clause license. See the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openvoicelab-0.0.1.tar.gz (17.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

openvoicelab-0.0.1-py3-none-any.whl (23.0 kB view details)

Uploaded Python 3

File details

Details for the file openvoicelab-0.0.1.tar.gz.

File metadata

  • Download URL: openvoicelab-0.0.1.tar.gz
  • Upload date:
  • Size: 17.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.12.10

File hashes

Hashes for openvoicelab-0.0.1.tar.gz
Algorithm Hash digest
SHA256 c28c872f258b257b846b177b3152afbd3634e9a5222faae81f3e8e1c37d4acca
MD5 a18e8a14d3e47015ecef9a3f576a0b01
BLAKE2b-256 a34005bb243f0929963b5eb943959b3bc096ab9437dcd7d2a9ad91287383b22e

See more details on using hashes here.

File details

Details for the file openvoicelab-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: openvoicelab-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 23.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.0.1 CPython/3.12.10

File hashes

Hashes for openvoicelab-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 3cfede0979186bfa89b2504769d0a4a9227346183fb37b10503a18999e7aa631
MD5 1c1c4129639d7e868d4fb21f15e83869
BLAKE2b-256 25ab2becf961a766186328d228f24fa06e0de380baa18ac8b9c57fac861ba653

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page