Tortoise TTS is a high-quality text-to-speech model with voice cloning capabilities
Project description
Tortoise TTS Extension for TTS-WebUI
License - the source code within this repository is licensed under the MIT license.
This extension provides a high-quality text-to-speech model with voice cloning capabilities.
Features
- High-quality speech synthesis
- Voice cloning capabilities
- Multiple quality presets (ultra_fast, fast, standard, high_quality)
- Adjustable parameters for both autoregressive and diffusion models
- Support for custom models and tokenizers
- Split prompt functionality for long text
Usage
- Select a model (Default or custom)
- Choose a voice from the dropdown or upload your own voice samples
- Select a preset quality level
- Adjust parameters as needed
- Enter your text in the prompt field
- Click "Generate" to create speech
Advanced Options
Model Settings
- KV Cache: Enable for faster inference at the cost of more VRAM
- DeepSpeed: Enable for optimized performance on supported hardware
- Half Precision: Enable for reduced memory usage
- Custom Tokenizer: Upload a custom tokenizer file for specialized use cases
Autoregressive Parameters
- Num Autoregressive Samples: Higher values produce better quality but slower generation
- Temperature: Controls randomness in the autoregressive model
- Length Penalty: Penalizes longer sequences
- Repetition Penalty: Discourages repetitive outputs
- Top P: Controls diversity of outputs
- Max Mel Tokens: Maximum length of generated speech
Diffusion Parameters
- Diffusion Iterations: Higher values produce better quality but slower generation
- Cond Free: Enable for better quality
- Cond Free K: Controls the strength of conditioning
- Temperature: Controls randomness in the diffusion model
This extension uses the Tortoise TTS model.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file tts_webui_extension_tortoise_tts-0.2.0-py3-none-any.whl.
File metadata
- Download URL: tts_webui_extension_tortoise_tts-0.2.0-py3-none-any.whl
- Upload date:
- Size: 8.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.10.19
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7e662265159178ecb8b15d9f2c2660359d90bb36bde41be717c9ac8787144886
|
|
| MD5 |
81dde225993ae181bd232960da3f3c08
|
|
| BLAKE2b-256 |
2d36c693851c07808fe44c88ea28b6eef4cd36ebdf18444995f54b7e611d38bb
|