Skip to main content

Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS in MLX

Project description

e2tts-mlx: Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS in MLX

A lightweight implementation of Embarrassingly Easy Fully Non-Autoregressive Zero-Shot TTS model using MLX, with minimal dependencies and efficient computation on Apple Silicon.

Quick Start

Install

# Quick install (note: PyPI version may not always be up to date)
pip install e2tts-mlx

# For the latest version, you can install directly from the repository
# git clone https://github.com/JosefAlbers/e2tts-mlx.git
# cd e2tts-mlx
# pip install -e .

Usage

To use a pre-trained model for text-to-speech:

e2tts 'We must achieve our own salvation.'

https://github.com/user-attachments/assets/c022d622-2437-4dbf-b3ac-d0ce89322402

To train a new model with default settings:

e2tts

e2tts

To train with custom options:

e2tts --batch_size=16 --n_epoch=100 --lr=1e-4 --depth=8 --n_ode=32

Available training options:

  • --batch_size: Set the batch size (default: 32)
  • --n_epoch: Set the number of epochs (default: 200)
  • --lr: Set the learning rate (default: 2e-4)
  • --depth: Set the model depth (default: 8)
  • --n_ode: Set the number of steps for sampling (default: 1)
  • --postfix: Add a custom postfix to output file names

Acknowledgements

Thanks to lucidrains' fantastic code that inspired this project.

License

Apache License 2.0

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

e2tts_mlx-0.0.3a0.tar.gz (8.3 kB view details)

Uploaded Source

Built Distribution

e2tts_mlx-0.0.3a0-py3-none-any.whl (8.2 kB view details)

Uploaded Python 3

File details

Details for the file e2tts_mlx-0.0.3a0.tar.gz.

File metadata

  • Download URL: e2tts_mlx-0.0.3a0.tar.gz
  • Upload date:
  • Size: 8.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for e2tts_mlx-0.0.3a0.tar.gz
Algorithm Hash digest
SHA256 38e0c55edf44375122a951109aa198c9c92251aeaaefd9ec58b83ad18b85f9e8
MD5 597565a069a45048bd40ba0cdaa0f780
BLAKE2b-256 d9b4e482d913f1db683a1635fc0b069e773144e56a98d9ce36c105fd99c61d87

See more details on using hashes here.

File details

Details for the file e2tts_mlx-0.0.3a0-py3-none-any.whl.

File metadata

  • Download URL: e2tts_mlx-0.0.3a0-py3-none-any.whl
  • Upload date:
  • Size: 8.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for e2tts_mlx-0.0.3a0-py3-none-any.whl
Algorithm Hash digest
SHA256 66a1c7e0bae3461a6040c1688b043a555a4baebb9fbd9184053d150a6290efe9
MD5 3fc612094adcc079cd69c8cce6ab6949
BLAKE2b-256 cd0069f1880f820f6eecb3dd5b263b7c68b8ac91c9735cb1a583fe7504b7a9e1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page