Parallel WaveGAN implementation

These details have not been verified by PyPI

Project links

Homepage

Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Operating System
- POSIX :: Linux
Programming Language
- Python :: 3
Topic
- Software Development :: Libraries :: Python Modules

Project description

Parallel WaveGAN implementation with Pytorch

This repository provides UNOFFICIAL Parallel WaveGAN implementation with Pytorch.

The goal of this repository is to provide the real-time neural vocoder which is compatible with ESPnet-TTS.
Audio samples and pretrained models will be available at our google drive.

Source of the figure: https://arxiv.org/pdf/1910.11480.pdf

Requirements

This repository is tested on Ubuntu 16.04 with a GPU Titan V.

Python 3.6+
Cuda 10.0
CuDNN 7+

All of the codes are tested on Pytorch 1.0.1, 1.1, 1.2, and 1.3.

Setup

You can select the installation method from two alternatives.

A. Use pip

$ git clone https://github.com/kan-bayashi/ParallelWaveGAN.git
$ cd ParallelWaveGAN
$ pip install -e .

B. Make virtualenv

$ git clone https://github.com/kan-bayashi/ParallelWaveGAN.git
$ cd ParallelWaveGAN/tools
$ make
$ source venv/bin/activate

Run

This repository provides Kaldi-style recipes, as the same as ESPnet.
Currently, three recipes are supported.

LJSpeech: English female speaker
JSUT: Japanese female speaker
CSMSC: Mandarin female speaker

To run the recipe, please follow the below instruction.

# Let us move on the recipe directory
$ cd egs/ljspeech/voc1

# Run the recipe from scratch
$ ./run.sh

# You can select the stage to start and stop
$ ./run.sh --stage 2 --stop_stage 2

All of the hyperparameters is written in a single yaml format configuration file.
Please check this example in ljspeech recipe.

The training is still on going. Please check the (https://github.com/kan-bayashi/ParallelWaveGAN/issues/1).

References

Acknowledgement

The author would like to thank Ryuichi Yamamoto (@r9y9) for his great repository, paper and valuable discussions.

Author

Tomoki Hayashi (@kan-bayashi)
E-mail: hayashi.tomoki<at>g.sp.m.is.nagoya-u.ac.jp

Project details

These details have not been verified by PyPI

Project links

Homepage

Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Operating System
- POSIX :: Linux
Programming Language
- Python :: 3
Topic
- Software Development :: Libraries :: Python Modules

Release history Release notifications | RSS feed

0.6.1

Nov 13, 2023

0.5.5

May 17, 2022

0.5.4

Feb 10, 2022

0.5.3

Aug 26, 2021

0.5.2

Aug 24, 2021

0.5.1

Aug 8, 2021

0.5.0

Aug 7, 2021

0.4.8

Nov 2, 2020

0.4.7

Oct 6, 2020

0.4.6

Aug 31, 2020

0.4.5

Aug 18, 2020

0.4.4

Aug 18, 2020

0.4.3

Aug 16, 2020

0.4.2

Aug 15, 2020

0.4.1

Jun 28, 2020

0.4.0

May 22, 2020

0.3.5

Apr 24, 2020

0.3.4

Mar 9, 2020

0.3.3

Mar 7, 2020

0.3.2

Mar 6, 2020

0.3.1.post1

Feb 28, 2020

0.3.1

Feb 26, 2020

0.3.0

Feb 15, 2020

0.2.8.post1

Feb 8, 2020

0.2.7.post1

Jan 14, 2020

0.2.7

Jan 14, 2020

0.2.6.post1

Dec 26, 2019

0.2.6

Dec 25, 2019

0.2.5.post2

Nov 25, 2019

0.2.5.post1

Nov 24, 2019

0.2.5

Nov 13, 2019

0.2.4.post2

Nov 12, 2019

0.2.4.post1

Nov 12, 2019

0.2.4

Nov 11, 2019

0.2.3

Nov 8, 2019

0.2.2.post1

Nov 6, 2019

0.2.2

Nov 5, 2019

0.2.1.post3

Nov 5, 2019

0.2.1.post2

Nov 4, 2019

This version

0.2.1

Nov 4, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parallel_wavegan-0.2.1.tar.gz (13.2 kB view hashes)

Uploaded Nov 4, 2019 Source

Hashes for parallel_wavegan-0.2.1.tar.gz

Hashes for parallel_wavegan-0.2.1.tar.gz
Algorithm	Hash digest
SHA256	`73a0726fdda89379e667bfbff9d35853b331cc753248912331b3c8010f593194`
MD5	`cab57fd5176f234f44326778679feefb`
BLAKE2b-256	`f1404d8142efd3bcd4519753a2c17d154efae4abe858ea8ab9dccd07d9443a38`