Skip to main content

Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS in MLX

Project description

e2tts-mlx: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS in MLX

A lightweight implementation of Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS model using MLX, with minimal dependencies and efficient computation on Apple Silicon.

Quick Start

Install

# Quick install (note: PyPI version may not always be up to date)
pip install e2tts-mlx

# For the latest version, you can install directly from the repository:
# git clone https://github.com/JosefAlbers/e2tts-mlx.git
# cd e2tts-mlx
# pip install -e .

Usage

To use a pre-trained model for text-to-speech:

e2tts 'We must achieve our own salvation.'

This will write tts_0.wav to the current directory, which you can then play.

https://github.com/user-attachments/assets/89265113-0785-42f1-b867-c38c886acbef

To train a new model with default settings:

e2tts

e2tts

To train with custom options:

e2tts --batch_size=16 --n_epoch=100 --lr=1e-4 --depth=8 --n_ode=32

Select training options:

  • --batch_size: Set the batch size (default: 32)
  • --n_epoch: Set the number of epochs (default: 200)
  • --lr: Set the learning rate (default: 2e-4)
  • --depth: Set the model depth (default: 8)
  • --n_ode: Set the number of steps for sampling (default: 1)

Acknowledgements

Thanks to lucidrains' fantastic code that inspired this project.

License

Apache License 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

e2tts_mlx-0.0.4a0.tar.gz (8.6 kB view details)

Uploaded Source

Built Distribution

e2tts_mlx-0.0.4a0-py3-none-any.whl (8.5 kB view details)

Uploaded Python 3

File details

Details for the file e2tts_mlx-0.0.4a0.tar.gz.

File metadata

  • Download URL: e2tts_mlx-0.0.4a0.tar.gz
  • Upload date:
  • Size: 8.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for e2tts_mlx-0.0.4a0.tar.gz
Algorithm Hash digest
SHA256 2adec83bf23e45816d1cacb9d03a4d6653156e37f13d3b79386cfbb75169369f
MD5 bbc054ff41516503beea1ac95c60435e
BLAKE2b-256 c571771f3e8b79e10040cde992d6c261074104e63e55ffa50e8a20e1b02731db

See more details on using hashes here.

File details

Details for the file e2tts_mlx-0.0.4a0-py3-none-any.whl.

File metadata

  • Download URL: e2tts_mlx-0.0.4a0-py3-none-any.whl
  • Upload date:
  • Size: 8.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for e2tts_mlx-0.0.4a0-py3-none-any.whl
Algorithm Hash digest
SHA256 bd8c6c7ced950e95f984241416f89870b371ebbf62653237a2598e7e96b326ed
MD5 f5088c375be7aa06dd68faa3c9b3dee6
BLAKE2b-256 6d8bceb61ae89d338faad42f8cda0f64b5226e55aae35d3d8ff566589159f80f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page