Skip to main content

Tortoise TTS is a high-quality text-to-speech model with voice cloning capabilities

Project description

Tortoise TTS Extension for TTS-WebUI

License - the source code within this repository is licensed under the MIT license.

This extension provides a high-quality text-to-speech model with voice cloning capabilities.

Features

  • High-quality speech synthesis
  • Voice cloning capabilities
  • Multiple quality presets (ultra_fast, fast, standard, high_quality)
  • Adjustable parameters for both autoregressive and diffusion models
  • Support for custom models and tokenizers
  • Split prompt functionality for long text

Usage

  1. Select a model (Default or custom)
  2. Choose a voice from the dropdown or upload your own voice samples
  3. Select a preset quality level
  4. Adjust parameters as needed
  5. Enter your text in the prompt field
  6. Click "Generate" to create speech

Advanced Options

Model Settings

  • KV Cache: Enable for faster inference at the cost of more VRAM
  • DeepSpeed: Enable for optimized performance on supported hardware
  • Half Precision: Enable for reduced memory usage
  • Custom Tokenizer: Upload a custom tokenizer file for specialized use cases

Autoregressive Parameters

  • Num Autoregressive Samples: Higher values produce better quality but slower generation
  • Temperature: Controls randomness in the autoregressive model
  • Length Penalty: Penalizes longer sequences
  • Repetition Penalty: Discourages repetitive outputs
  • Top P: Controls diversity of outputs
  • Max Mel Tokens: Maximum length of generated speech

Diffusion Parameters

  • Diffusion Iterations: Higher values produce better quality but slower generation
  • Cond Free: Enable for better quality
  • Cond Free K: Controls the strength of conditioning
  • Temperature: Controls randomness in the diffusion model

This extension uses the Tortoise TTS model.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file tts_webui_extension_tortoise_tts-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for tts_webui_extension_tortoise_tts-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 7e662265159178ecb8b15d9f2c2660359d90bb36bde41be717c9ac8787144886
MD5 81dde225993ae181bd232960da3f3c08
BLAKE2b-256 2d36c693851c07808fe44c88ea28b6eef4cd36ebdf18444995f54b7e611d38bb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page