No project description provided
Project description
OpenVoiceLab (Beta)
[!IMPORTANT] OpenVoiceLab is currently in beta. Some things still need to be improved - especially in the finetuning process. Feedback and contributions are welcome!
A beginner-friendly interface for finetuning and running text-to-speech models. Currently supports VibeVoice.
What is this?
OpenVoiceLab provides a simple web interface for working with the VibeVoice text-to-speech model. You can:
- Finetune models on your own voice data to create custom voices
- Generate speech from text using pretrained or finetuned models
- Experiment with models through an easy-to-use web UI
The goal is to make state-of-the-art voice synthesis accessible to anyone interested in exploring TTS technology, whether you're a developer, researcher, content creator, or hobbyist.
Requirements
Before you begin, make sure you have:
- Python 3.9 or newer - Check your version with
python3 --version - CUDA-compatible NVIDIA GPU - At least 16 GB of VRAM is recommended for training the 1.5B parameter model
- For inference (generating speech), you can get by with less VRAM or even CPU-only mode, though it will be slower
- Operating System - Linux, macOS, or Windows
Quick Start
The easiest way to get started is using the provided setup scripts. These will create a Python virtual environment, install all dependencies, and launch the interface.
Linux/macOS
./scripts/setup.sh
./scripts/run.sh
Windows
scripts\setup.bat
scripts\run.bat
After running these commands, the web interface will launch automatically. Open your browser and navigate to:
http://localhost:7860
If the browser doesn't open automatically, you can manually visit this address.
Manual Setup
If you prefer to set things up yourself, or if the scripts don't work on your system:
-
Create a virtual environment (recommended to avoid conflicts with other Python packages):
python3 -m venv venv
-
Activate the virtual environment:
source venv/bin/activate # Linux/macOS venv\Scripts\activate # Windows
-
Install dependencies:
pip install -r requirements.txt
-
Launch the interface:
python -m ovl.cli
Then open your browser to
http://localhost:7860
What's Next?
Once you have OpenVoiceLab running, you can:
- Start with inference to generate speech from a pretrained model
- Prepare your own voice dataset for finetuning
- Experiment with different model parameters and training configurations
Detailed usage instructions are available in the interface itself.
Troubleshooting
Out of memory errors during training: Try reducing the batch size or using a smaller model variant.
CUDA not available: Make sure you have NVIDIA drivers and PyTorch with CUDA support installed. The setup scripts should handle this automatically.
Import errors: Ensure you've activated the virtual environment before running the CLI.
License
OpenVoiceLab is licensed under the BSD-3-Clause license. See the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file openvoicelab-0.0.1.tar.gz.
File metadata
- Download URL: openvoicelab-0.0.1.tar.gz
- Upload date:
- Size: 17.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c28c872f258b257b846b177b3152afbd3634e9a5222faae81f3e8e1c37d4acca
|
|
| MD5 |
a18e8a14d3e47015ecef9a3f576a0b01
|
|
| BLAKE2b-256 |
a34005bb243f0929963b5eb943959b3bc096ab9437dcd7d2a9ad91287383b22e
|
File details
Details for the file openvoicelab-0.0.1-py3-none-any.whl.
File metadata
- Download URL: openvoicelab-0.0.1-py3-none-any.whl
- Upload date:
- Size: 23.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.12.10
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3cfede0979186bfa89b2504769d0a4a9227346183fb37b10503a18999e7aa631
|
|
| MD5 |
1c1c4129639d7e868d4fb21f15e83869
|
|
| BLAKE2b-256 |
25ab2becf961a766186328d228f24fa06e0de380baa18ac8b9c57fac861ba653
|