Command-line interface (CLI) to train WaveGlow using .wav files.
Project description
waveglow-cli
Command-line interface (CLI) to train WaveGlow using .wav files.
Features
- train/synthesize on CPU or GPU
- download pre-trained models by Nvidia
Installation
pip install waveglow-cli --user
Usage
usage: waveglow-cli [-h] [-v] {download,train,continue-train,validate,synthesize,synthesize-wav} ...
This program trains WaveGlow.
positional arguments:
{download,train,continue-train,validate,synthesize,synthesize-wav}
description
download download pre-trained checkpoint from Nvidia
train start training
continue-train continue training
validate validate checkpoint(s)
synthesize synthesize mel-spectrograms into an audio signal
synthesize-wav synthesize audio file into an audio signal
options:
-h, --help show this help message and exit
-v, --version show program's version number and exit
Pretrained Models
- LJS-v3-580000: Adapted model trained on LJ Speech dataset by Nvidia.
Audio Example
"The North Wind and the Sun were disputing which was the stronger, when a traveler came along wrapped in a warm cloak." Listen here (headphones recommended)
Roadmap
- Outsource method to convert audio files to mel-spectrograms before training
- Improve logging
- Add more audio examples
- Adding tests
Development setup
# update
sudo apt update
# install Python 3.8-3.11 for ensuring that tests can be run
sudo apt install python3-pip \
python3.8 python3.8-dev python3.8-distutils python3.8-venv \
python3.9 python3.9-dev python3.9-distutils python3.9-venv \
python3.10 python3.10-dev python3.10-distutils python3.10-venv \
python3.11 python3.11-dev python3.11-distutils python3.11-venv
# install pipenv for creation of virtual environments
python3.8 -m pip install pipenv --user
# check out repo
git clone https://github.com/stefantaubert/waveglow.git
cd waveglow
# create virtual environment
python3.8 -m pipenv install --dev
Running the tests
# first install the tool like in "Development setup"
# then, navigate into the directory of the repo (if not already done)
cd waveglow
# activate environment
python3.8 -m pipenv shell
# run tests
tox
Final lines of test result output:
py38: commands succeeded
py39: commands succeeded
py310: commands succeeded
py311: commands succeeded
congratulations :)
License
MIT License
Acknowledgments
Model code adapted from Nvidia.
Paper:
Funded by the Deutsche Forschungsgemeinschaft (DFG, German Research Foundation) – Project-ID 416228727 – CRC 1410
Citation
If you want to cite this repo, you can use the BibTeX-entry generated by GitHub (see About => Cite this repository).
Taubert, S. (2024). waveglow-cli (Version 0.0.2) [Computer software]. [https://doi.org/10.5281/zenodo.10569141](https://doi.org/10.5281/zenodo.10569141)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file waveglow-cli-0.0.2.tar.gz
.
File metadata
- Download URL: waveglow-cli-0.0.2.tar.gz
- Upload date:
- Size: 428.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 09f770d9dbf6ea0970fdc9abec332662c3650e6712f57e1103bc52ff8a34c3f3 |
|
MD5 | e0982eed14573ef23903989e04628786 |
|
BLAKE2b-256 | 0c60679d27aca5083d9bbfad9dc9d44faf907f4225787e90cefc2bc4da023405 |
File details
Details for the file waveglow_cli-0.0.2-py3-none-any.whl
.
File metadata
- Download URL: waveglow_cli-0.0.2-py3-none-any.whl
- Upload date:
- Size: 60.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5d50a68bcef9d5923d6a5d67ad17a2e37d11b0b38276d838de763c6543d97279 |
|
MD5 | 78c151a666d378ba3e0f12dcdd6b1a27 |
|
BLAKE2b-256 | 8d36a251676f73d6c695f385b3870b96e80d243fbe0db17cd6fc7715b9656cb3 |