Skip to main content

Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS in MLX

Project description

e2tts-mlx: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS in MLX

A lightweight implementation of Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS model using MLX, with minimal dependencies and efficient computation on Apple Silicon.

Quick Start

Install

# Quick install (note: PyPI version may not always be up to date)
pip install e2tts-mlx

# For the latest version, you can install directly from the repository
# git clone https://github.com/JosefAlbers/e2tts-mlx.git
# cd e2tts-mlx
# pip install -e .

Usage

To use a pre-trained model for text-to-speech:

e2tts 'We must achieve our own salvation.'

https://github.com/user-attachments/assets/c022d622-2437-4dbf-b3ac-d0ce89322402

To train a new model with default settings:

e2tts

e2tts

To train with custom options:

e2tts --batch_size=16 --n_epoch=100 --lr=1e-4 --depth=8 --n_ode=32

Available training options:

  • --batch_size: Set the batch size (default: 32)
  • --n_epoch: Set the number of epochs (default: 200)
  • --lr: Set the learning rate (default: 2e-4)
  • --depth: Set the model depth (default: 8)
  • --n_ode: Set the number of steps for sampling (default: 1)
  • --postfix: Add a custom postfix to output file names

Acknowledgements

Thanks to lucidrains' fantastic code that inspired this project.

License

Apache License 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

e2tts_mlx-0.0.3b0.tar.gz (8.3 kB view details)

Uploaded Source

Built Distribution

e2tts_mlx-0.0.3b0-py3-none-any.whl (8.2 kB view details)

Uploaded Python 3

File details

Details for the file e2tts_mlx-0.0.3b0.tar.gz.

File metadata

  • Download URL: e2tts_mlx-0.0.3b0.tar.gz
  • Upload date:
  • Size: 8.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for e2tts_mlx-0.0.3b0.tar.gz
Algorithm Hash digest
SHA256 e8e1161d1e72bfb5584ac28511e60353f83c172ff0dc969436463e3b9afdea40
MD5 fe46a20164903f8d9cfa1a894cdf0ed6
BLAKE2b-256 011da74c7346dd35640310a971ed689ff81fe4de604acc0bc07b7d0dfde15711

See more details on using hashes here.

File details

Details for the file e2tts_mlx-0.0.3b0-py3-none-any.whl.

File metadata

  • Download URL: e2tts_mlx-0.0.3b0-py3-none-any.whl
  • Upload date:
  • Size: 8.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for e2tts_mlx-0.0.3b0-py3-none-any.whl
Algorithm Hash digest
SHA256 6ecca02b0c45401fd8a862deb3e15dba8fb9d9520330185dd47e6d17e4b6677c
MD5 5117b6678dcf1138d95c64a54a6ac019
BLAKE2b-256 093ebf9c1f4747dbfc70b65f36c75ced392670c77122bf0da734bf64f62dda10

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page