E2-TTS in Pytorch
Project description
E2 TTS - Pytorch
Implementation of E2-TTS, Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS, in Pytorch
You can chat with other researchers about this work here
Appreciation
- Manmay for contributing working end-to-end training code!
Install
$ pip install e2-tts-pytorch
Usage
import torch
from e2_tts_pytorch import (
E2TTS,
DurationPredictor
)
duration_predictor = DurationPredictor(
transformer = dict(
dim = 512,
depth = 2,
)
)
mel = torch.randn(2, 1024, 512)
text = ['Hello', 'Goodbye']
loss = duration_predictor(mel, text = text)
loss.backward()
e2tts = E2TTS(
duration_predictor = duration_predictor,
transformer = dict(
dim = 512,
depth = 4,
skip_connect_type = 'concat'
),
)
loss = e2tts(mel, text = text)
loss.backward()
sampled = e2tts.sample(mel[:, :5], text = text)
Citations
@inproceedings{Eskimez2024E2TE,
title = {E2 TTS: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS},
author = {Sefik Emre Eskimez and Xiaofei Wang and Manthan Thakker and Canrun Li and Chung-Hsien Tsai and Zhen Xiao and Hemin Yang and Zirun Zhu and Min Tang and Xu Tan and Yanqing Liu and Sheng Zhao and Naoyuki Kanda},
year = {2024},
url = {https://api.semanticscholar.org/CorpusID:270738197}
}
@inproceedings{Li2024ImmiscibleDA,
title = {Immiscible Diffusion: Accelerating Diffusion Training with Noise Assignment},
author = {Yiheng Li and Heyang Jiang and Akio Kodaira and Masayoshi Tomizuka and Kurt Keutzer and Chenfeng Xu},
year = {2024},
url = {https://api.semanticscholar.org/CorpusID:270562607}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
e2_tts_pytorch-0.0.24.tar.gz
(178.9 kB
view hashes)
Built Distribution
Close
Hashes for e2_tts_pytorch-0.0.24-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2b094a137ce713e7a73d3b1e20da2ef7af512852f7e99e80b7e9539b5b3ea12b |
|
MD5 | 9ad14e0b915431a2f7d1e59bd9e2690d |
|
BLAKE2b-256 | 02d526c6ccb61fcb8ebbcb337fd9586f68894379db076959fec40b68176ef782 |