Text-to-Speech Synthesis for the Speech Generation for Indigenous Language Education Small Teams Project
Project description
EveryVoice TTS Toolkit 💬
This is the Text-to-Speech (TTS) toolkit used by the Small Teams "Speech Generation for Indigenous Language Education" project.
Quickstart from PyPI
-
Install Python 3.10, 3.11, or 3.12 and create a venv or a conda env for EveryVoice.
-
Install
sox.- On Ubuntu,
sudo apt-get install sox libsox-devshould work. - Other Linux distros should have equivalent packages.
- With Conda,
conda install sox -c conda-forgeis reliable.
- On Ubuntu,
-
Install
ffmpeg:- On Ubuntu,
sudo apt-get install ffmpegshould work. - Other Linux distros should have an equivalent package.
- With Conda,
conda install ffmpegis reliable. - Or, use the official bundles from https://www.ffmpeg.org/download.html
- On Ubuntu,
-
Install
torchandtorchaudioversion 2.1.0 for your platform and CUDA version: follow the instructions at https://pytorch.org/get-started/locally/ but specifytorch==2.1.0 torchaudio==2.1.0in the install command and removetorchvision. -
Run
pip install everyvoice(change the version to the current version if needed).
Quickstart from source
Install conda
First, you'll need to install conda. Miniforge3 is a fully open-source option which is free for all users and works well. You can also use Anaconda3 or Miniconda3 if you have or can get a license.
Clone the repo
git clone https://github.com/EveryVoiceTTS/EveryVoice.git
cd EveryVoice
git submodule update --init
Environment and installation – automated
To run EveryVoice, you need to create a new environment using Conda and Python 3.12, install all our dependencies and EveryVoice itself.
We have automated the procedure required to do all this in the script make-everyvoice-env, which you can run like this:
./make-everyvoice-env --path <env-path-of-your-choice>
conda activate <env-path-of-your-choice>
Add the option --cuda CUDA_VERSION if you need to override the default CUDA version, or --cpu to use Torch compiled for CPU use only.
Environment and installation – manual
If the automated installation process does not work for you, or if you prefer to do the full installation manually, please refer to EveryVoice / Installation.
Documentation
Read the full EveryVoice documentation.
In particular, read the Guides to get familiar with the whole process.
To build and view the documentation locally:
pip install -e '.[docs]'
mkdocs serve
and browse to http://127.0.0.1:8000/.
Contributing
Feel free to dive in! Open an issue or submit PRs.
This repo follows the Contributor Covenant Code of Conduct.
Please make sure our standard Git hooks are activated, by running these commands in your sandbox (if you used our make-everyvoice-env script then this step is already done for you):
pip install -e '.[dev]'
pre-commit install
gitlint install-hook
git submodule foreach 'pre-commit install'
git submodule foreach 'gitlint install-hook'
Have a look at Contributing.md for the full details on the Conventional Commit messages we prefer, our code formatting conventions, our Git hooks, and recommendations on how to make effective pull requests.
Publishing Instructions
To publish a new version of the project, follow these steps:
-
Determine the Version Bump Decide whether your changes constitute a:
- Major version bump (breaking changes),
- Minor version bump (new features, backward-compatible, any change to the schema), or
- Patch version bump (bug fixes, small changes).
-
Update Version Files
- Update the
everyvoice._versionfile to reflect the new version. - Keep all
submodule._versionfiles in sync with this version, except for thewav2vec2aligner submodule (which can be installed separately). - Commit the resulting changes, including all submodules.
- Update the
-
Update Schema (for Major/Minor bumps) If you bumped a major or minor version:
- Run
everyvoice update-schema. You may need to delete existing schema files if you get an error message, but you should only do so if you are sure that those schema files have not already been published. I.e. we might already have schema files related to an alpha release - those can be overwritten, but we should never change published schema files. - Commit the resulting changes.
- Run
-
Open a Pull Request
- Create a PR with your changes.
- Wait for tests to pass and for the PR to be merged into
main.
-
Tag the Release After merging:
git tag -a -m vX.Y.Z vX.Y.Z git push 'vX.Y.Z'
-
Update SchemaStore (for Major/Minor bumps) Once the CI has built and released your version, if you bumped a major or minor version:
Submit a PR to SchemaStore to update the schema reference.
The only file you need to change is:
src/api/json/catalog.json
Acknowledgements
This repository draws on many other wonderful code bases. Many thanks to:
- https://github.com/nocotan/pytorch-lightning-gans
- https://github.com/rishikksh20/iSTFTNet-pytorch
- https://github.com/jik876/hifi-gan
- https://github.com/ming024/FastSpeech2
- https://github.com/MiniXC/LightningFastSpeech2
- https://github.com/DigitalPhonetics/IMS-Toucan
Tests
Run unit tests by python -m unittest tests/test_configs.py or suites of tests by running everyvoice test dev if you have the package installed interactively.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file everyvoice-0.3.0.tar.gz.
File metadata
- Download URL: everyvoice-0.3.0.tar.gz
- Upload date:
- Size: 168.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7cccacfdb2aea1631ef6f565d00effea55c7d40043e983c19d29840c655ac8ae
|
|
| MD5 |
c3c22c054fd3b560fe172816c13c6667
|
|
| BLAKE2b-256 |
5633187d1e2b8de8cd410095860c287f40f1e83f317d40aa0942560bc5c08007
|
File details
Details for the file everyvoice-0.3.0-py3-none-any.whl.
File metadata
- Download URL: everyvoice-0.3.0-py3-none-any.whl
- Upload date:
- Size: 220.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d997f618a866e02aef35bb79e6d2bc7ef9c8363c3cce3dc96c893d23ef849bc6
|
|
| MD5 |
dc8177630f102983eb27f310b2604570
|
|
| BLAKE2b-256 |
7898d3cbe10066feaa05e5c44032f18ee6b0cda84a55ca888702a26d164aa6b7
|