No project description provided

These details have not been verified by PyPI

Project description

OpenVoiceLab (Beta)

[!IMPORTANT] OpenVoiceLab is currently in beta. Some things still need to be improved - especially in the finetuning process. Feedback and contributions are welcome!

A beginner-friendly interface for finetuning and running text-to-speech models. Currently supports VibeVoice.

What is this?

OpenVoiceLab provides a simple web interface for working with the VibeVoice text-to-speech model. You can:

Finetune models on your own voice data to create custom voices
Generate speech from text using pretrained or finetuned models
Experiment with models through an easy-to-use web UI

The goal is to make state-of-the-art voice synthesis accessible to anyone interested in exploring TTS technology, whether you're a developer, researcher, content creator, or hobbyist.

Requirements

Before you begin, make sure you have:

Python 3.9 or newer - Check your version with python3 --version
CUDA-compatible NVIDIA GPU - At least 16 GB of VRAM is recommended for training the 1.5B parameter model
- For inference (generating speech), you can get by with less VRAM or even CPU-only mode, though it will be slower
Operating System - Linux, macOS, or Windows

Quick Start

The easiest way to get started is using the provided setup scripts. These will create a Python virtual environment, install all dependencies, and launch the interface.

Linux/macOS

./scripts/setup.sh
./scripts/run.sh

Windows

scripts\setup.bat
scripts\run.bat

After running these commands, the web interface will launch automatically. Open your browser and navigate to:

http://localhost:7860

If the browser doesn't open automatically, you can manually visit this address.

Manual Setup

If you prefer to set things up yourself, or if the scripts don't work on your system:

Create a virtual environment (recommended to avoid conflicts with other Python packages):
```
python3 -m venv venv
```

Activate the virtual environment:

source venv/bin/activate  # Linux/macOS
venv\Scripts\activate     # Windows

Install dependencies:
```
pip install -r requirements.txt
```
Launch the interface:
```
python -m ovl.cli
```
Then open your browser to http://localhost:7860

What's Next?

Once you have OpenVoiceLab running, you can:

Start with inference to generate speech from a pretrained model
Prepare your own voice dataset for finetuning
Experiment with different model parameters and training configurations

Detailed usage instructions are available in the interface itself.

Troubleshooting

Out of memory errors during training: Try reducing the batch size or using a smaller model variant.

CUDA not available: Make sure you have NVIDIA drivers and PyTorch with CUDA support installed. The setup scripts should handle this automatically.

Import errors: Ensure you've activated the virtual environment before running the CLI.

License

OpenVoiceLab is licensed under the BSD-3-Clause license. See the LICENSE file for details.

Project details

These details have not been verified by PyPI

Development Status
- 3 - Alpha
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Programming Language

Release history Release notifications | RSS feed

This version

0.0.1

Oct 9, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

openvoicelab-0.0.1.tar.gz (17.8 kB view details)

Uploaded Oct 9, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

openvoicelab-0.0.1-py3-none-any.whl (23.0 kB view details)

Uploaded Oct 9, 2025 Python 3

File details

Details for the file openvoicelab-0.0.1.tar.gz.

File metadata

Download URL: openvoicelab-0.0.1.tar.gz
Upload date: Oct 9, 2025
Size: 17.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.12.10

File hashes

Hashes for openvoicelab-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`c28c872f258b257b846b177b3152afbd3634e9a5222faae81f3e8e1c37d4acca`
MD5	`a18e8a14d3e47015ecef9a3f576a0b01`
BLAKE2b-256	`a34005bb243f0929963b5eb943959b3bc096ab9437dcd7d2a9ad91287383b22e`

See more details on using hashes here.

File details

Details for the file openvoicelab-0.0.1-py3-none-any.whl.

File metadata

Download URL: openvoicelab-0.0.1-py3-none-any.whl
Upload date: Oct 9, 2025
Size: 23.0 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.0.1 CPython/3.12.10

File hashes

Hashes for openvoicelab-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3cfede0979186bfa89b2504769d0a4a9227346183fb37b10503a18999e7aa631`
MD5	`1c1c4129639d7e868d4fb21f15e83869`
BLAKE2b-256	`25ab2becf961a766186328d228f24fa06e0de380baa18ac8b9c57fac861ba653`

See more details on using hashes here.

openvoicelab 0.0.1

Navigation

Verified details

Maintainers

Unverified details

Meta

Classifiers

Project description

OpenVoiceLab (Beta)

What is this?

Requirements

Quick Start

Linux/macOS

Windows

Manual Setup

What's Next?

Troubleshooting

License

Project details

Verified details

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes