Skip to main content

Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS in MLX

Project description

e2tts-mlx: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS in MLX

A lightweight implementation of Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS model using MLX, with minimal dependencies and efficient computation on Apple Silicon.

Quick Start

Install

# Quick install (note: PyPI version may not always be up to date)
pip install e2tts-mlx

# For the latest version, you can install directly from the repository
# git clone https://github.com/JosefAlbers/e2tts-mlx.git
# cd e2tts-mlx
# pip install -e .

Usage

To use a pre-trained model for text-to-speech:

e2tts 'We must achieve our own salvation.'

https://github.com/user-attachments/assets/c022d622-2437-4dbf-b3ac-d0ce89322402

To train a new model with default settings:

e2tts

e2tts

To train with custom options:

e2tts --batch_size=16 --n_epoch=100 --lr=1e-4 --depth=8 --n_ode=32

Available training options:

  • --batch_size: Set the batch size (default: 32)
  • --n_epoch: Set the number of epochs (default: 200)
  • --lr: Set the learning rate (default: 2e-4)
  • --depth: Set the model depth (default: 8)
  • --n_ode: Set the number of steps for sampling (default: 1)
  • --postfix: Add a custom postfix to output file names

Acknowledgements

Thanks to lucidrains' fantastic code that inspired this project.

License

Apache License 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

e2tts_mlx-0.0.3.tar.gz (8.3 kB view details)

Uploaded Source

Built Distribution

e2tts_mlx-0.0.3-py3-none-any.whl (8.2 kB view details)

Uploaded Python 3

File details

Details for the file e2tts_mlx-0.0.3.tar.gz.

File metadata

  • Download URL: e2tts_mlx-0.0.3.tar.gz
  • Upload date:
  • Size: 8.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for e2tts_mlx-0.0.3.tar.gz
Algorithm Hash digest
SHA256 5f0625130b86ee87523edba0042ceca7a2b3b59e1d13fd2076b13a7577089756
MD5 b9eb02548b65db1991ac90df2c9c7d10
BLAKE2b-256 2c611d3ce70a58131a46cdd00bee40a06eb19628e4b17bcbb09031603c18b383

See more details on using hashes here.

File details

Details for the file e2tts_mlx-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: e2tts_mlx-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 8.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for e2tts_mlx-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 1dcce4b890682a7ad60636e44e3d9b79d6759ba3c37caa501e997f5fa2d4bc70
MD5 0d7a33211536109909c68a8b36b46fe0
BLAKE2b-256 a8546b045cafee748afcbfc5f8ebf718449c536896c27514aa4a3503b19b04bf

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page