Skip to main content

Text-to-Speech Synthesis for the Speech Generation for Indigenous Language Education Small Teams Project

Reason this release was yanked:

We now do not use a date-based versioning system

Project description

EveryVoice TTS Toolkit 💬

codecov license

This is an implementation of the Text-to-Speech (TTS) model used by the Small Teams "Speech Generation for Indigenous Language Education" project.

It is largely based off the FastSpeech2/Fastpitch models.

Quickstart

Install

Clone clone the repo and pip install it locally:

$ git clone https://github.com/roedoejet/EveryVoice.git
$ cd EveryVoice
$ git submodule update --init
$ pip install -e .

Dependencies

I recommend using Conda and Python 3.9. To do that, create a new environment:

conda create --name EveryVoice python=3.9
conda activate EveryVoice
conda install pytorch torchvision torchaudio cudatoolkit=11.3 -c pytorch

You can then install the rest of the Python dependencies with pip

pip3 install -r requirements.txt

Alternatively, you can just create the whole thing from our environment.yml file:

conda env create -f environment.yml
conda activate EveryVoice

Contributing

Feel free to dive in! Open an issue or submit PRs.

This repo follows the Contributor Covenant Code of Conduct.

You can install our standard Git hooks by running these commands in your sandbox:

pip install -r requirements.dev.txt
pre-commit install
gitlint install-hook

Have a look at Contributing.md for the full details on the Conventional Commit messages we prefer, our code formatting conventions, and our Git hooks.

You can then interactively install the package by running the following command from the project root:

pip install -e .

Acknowledgements

This repo draws on many other wonderful code bases. Many thanks to:

https://github.com/nocotan/pytorch-lightning-gans https://github.com/rishikksh20/iSTFTNet-pytorch https://github.com/jik876/hifi-gan https://github.com/ming024/FastSpeech2 https://github.com/MiniXC/LightningFastSpeech2 https://github.com/DigitalPhonetics/IMS-Toucan

Tests

Run unit tests by python3 -m unittest tests/test_configs.py or suites of tests by running everyvoice test dev if you have the package installed interactively.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

everyvoice-0.1.20230307.tar.gz (47.6 kB view details)

Uploaded Source

Built Distribution

everyvoice-0.1.20230307-py3-none-any.whl (57.6 kB view details)

Uploaded Python 3

File details

Details for the file everyvoice-0.1.20230307.tar.gz.

File metadata

  • Download URL: everyvoice-0.1.20230307.tar.gz
  • Upload date:
  • Size: 47.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.14

File hashes

Hashes for everyvoice-0.1.20230307.tar.gz
Algorithm Hash digest
SHA256 b0f4a4e1b3fa751a0e099992de1ec30bc54fb76a0316a9cb6b05b8840da18bb1
MD5 8cd91f863cc3227365f2a62562519e37
BLAKE2b-256 63f9a0530fd354133452a817c6d9a4e185eb1a84575072a99c57bc8933ddde9b

See more details on using hashes here.

File details

Details for the file everyvoice-0.1.20230307-py3-none-any.whl.

File metadata

File hashes

Hashes for everyvoice-0.1.20230307-py3-none-any.whl
Algorithm Hash digest
SHA256 15c9fa0b7e5913ffd7980ff642b50bb6e75f8805370964d7e1df87305fdbb5e2
MD5 8652bfb515e8a65dd9eb6a8dca865287
BLAKE2b-256 aabdd2c606fdfce41c6384f6432de41fbc0366959b28d3bbbb81649d60d525e5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page