Skip to main content

Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS in MLX

Project description

e2tts-mlx: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS in MLX

A lightweight implementation of Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS model using MLX, with minimal dependencies and efficient computation on Apple Silicon.

Quick Start

Install

# Quick install (note: PyPI version may not always be up to date)
pip install e2tts-mlx

# For the latest version, you can install directly from the repository:
# git clone https://github.com/JosefAlbers/e2tts-mlx.git
# cd e2tts-mlx
# pip install -e .

Usage

To use a pre-trained model for text-to-speech:

e2tts 'We must achieve our own salvation.'

This will write tts_0.wav to the current directory, which you can then play.

https://github.com/user-attachments/assets/89265113-0785-42f1-b867-c38c886acbef

To train a new model with default settings:

e2tts

e2tts

To train with custom options:

e2tts --batch_size=16 --n_epoch=100 --lr=1e-4 --depth=8 --n_ode=32

Select training options:

  • --batch_size: Set the batch size (default: 32)
  • --n_epoch: Set the number of epochs (default: 200)
  • --lr: Set the learning rate (default: 2e-4)
  • --depth: Set the model depth (default: 8)
  • --n_ode: Set the number of steps for sampling (default: 1)

Acknowledgements

Thanks to lucidrains' fantastic code that inspired this project.

License

Apache License 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

e2tts_mlx-0.0.4.tar.gz (8.6 kB view details)

Uploaded Source

Built Distribution

e2tts_mlx-0.0.4-py3-none-any.whl (8.5 kB view details)

Uploaded Python 3

File details

Details for the file e2tts_mlx-0.0.4.tar.gz.

File metadata

  • Download URL: e2tts_mlx-0.0.4.tar.gz
  • Upload date:
  • Size: 8.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for e2tts_mlx-0.0.4.tar.gz
Algorithm Hash digest
SHA256 1e49e1f8e43baeedd6b99ad372b48edcd6f900b27d485a80152eb699db422b47
MD5 183130328d68a7a4eb14650fa0a2ce53
BLAKE2b-256 4540486fb5545eb910c2e9ee728efcecf4f96460fb41195d99dcb1165b23de32

See more details on using hashes here.

File details

Details for the file e2tts_mlx-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: e2tts_mlx-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 8.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for e2tts_mlx-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 9618f50f4478e23f991b484afab0d9d357cd56ac9fa477ee4e4ab7f8af981ee6
MD5 bb29fadbcf3d17814a42188b9dfead64
BLAKE2b-256 aefe8f7b0435eb995690cbd659bbda31c64aa1848339cfab3fc8fa46599619b1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page