Skip to main content

Parallel WaveGAN implementation

Project description

Parallel WaveGAN implementation with Pytorch

This repository provides UNOFFICIAL Parallel WaveGAN implementation with Pytorch.

The goal of this repository is to provide the real-time neural vocoder which is compatible with ESPnet-TTS.
Audio samples and pretrained models will be available at our google drive.

Source of the figure: https://arxiv.org/pdf/1910.11480.pdf

Requirements

This repository is tested on Ubuntu 16.04 with a GPU Titan V.

  • Python 3.6+
  • Cuda 10.0
  • CuDNN 7+

All of the codes are tested on Pytorch 1.0.1, 1.1, 1.2, and 1.3.

Setup

You can select the installation method from two alternatives.

A. Use pip

$ git clone https://github.com/kan-bayashi/ParallelWaveGAN.git
$ cd ParallelWaveGAN
$ pip install -e .

B. Make virtualenv

$ git clone https://github.com/kan-bayashi/ParallelWaveGAN.git
$ cd ParallelWaveGAN/tools
$ make
$ source venv/bin/activate

Run

This repository provides Kaldi-style recipes, as the same as ESPnet.
Currently, three recipes are supported.

  • LJSpeech: English female speaker
  • JSUT: Japanese female speaker
  • CSMSC: Mandarin female speaker

To run the recipe, please follow the below instruction.

# Let us move on the recipe directory
$ cd egs/ljspeech/voc1

# Run the recipe from scratch
$ ./run.sh

# You can select the stage to start and stop
$ ./run.sh --stage 2 --stop_stage 2

All of the hyperparameters is written in a single yaml format configuration file.
Please check this example in ljspeech recipe.

The training is still on going. Please check the (https://github.com/kan-bayashi/ParallelWaveGAN/issues/1).

References

Acknowledgement

The author would like to thank Ryuichi Yamamoto (@r9y9) for his great repository, paper and valuable discussions.

Author

Tomoki Hayashi (@kan-bayashi)
E-mail: hayashi.tomoki<at>g.sp.m.is.nagoya-u.ac.jp

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

parallel_wavegan-0.2.1.post2.tar.gz (21.6 kB view details)

Uploaded Source

File details

Details for the file parallel_wavegan-0.2.1.post2.tar.gz.

File metadata

  • Download URL: parallel_wavegan-0.2.1.post2.tar.gz
  • Upload date:
  • Size: 21.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/2.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.32.2 CPython/3.6.4

File hashes

Hashes for parallel_wavegan-0.2.1.post2.tar.gz
Algorithm Hash digest
SHA256 3e72502356603d517bb14935ca581181b70f7d91af1e7cd5afa1b91bda2be3e8
MD5 454da469163b58a91b984462766d2a13
BLAKE2b-256 ad0f43b3af9c253514bec377f6ab17cdaf5ad4626c09c009dd2c68f355dba62c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page