Skip to main content

Tortoise TTS is a high-quality text-to-speech model with voice cloning capabilities

Project description

Tortoise TTS Extension for TTS-WebUI

License - the source code within this repository is licensed under the MIT license.

This extension provides a high-quality text-to-speech model with voice cloning capabilities.

Features

  • High-quality speech synthesis
  • Voice cloning capabilities
  • Multiple quality presets (ultra_fast, fast, standard, high_quality)
  • Adjustable parameters for both autoregressive and diffusion models
  • Support for custom models and tokenizers
  • Split prompt functionality for long text

Usage

  1. Select a model (Default or custom)
  2. Choose a voice from the dropdown or upload your own voice samples
  3. Select a preset quality level
  4. Adjust parameters as needed
  5. Enter your text in the prompt field
  6. Click "Generate" to create speech

Advanced Options

Model Settings

  • KV Cache: Enable for faster inference at the cost of more VRAM
  • DeepSpeed: Enable for optimized performance on supported hardware
  • Half Precision: Enable for reduced memory usage
  • Custom Tokenizer: Upload a custom tokenizer file for specialized use cases

Autoregressive Parameters

  • Num Autoregressive Samples: Higher values produce better quality but slower generation
  • Temperature: Controls randomness in the autoregressive model
  • Length Penalty: Penalizes longer sequences
  • Repetition Penalty: Discourages repetitive outputs
  • Top P: Controls diversity of outputs
  • Max Mel Tokens: Maximum length of generated speech

Diffusion Parameters

  • Diffusion Iterations: Higher values produce better quality but slower generation
  • Cond Free: Enable for better quality
  • Cond Free K: Controls the strength of conditioning
  • Temperature: Controls randomness in the diffusion model

This extension uses the Tortoise TTS model.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

File details

Details for the file tts_webui_extension_tortoise_tts-0.1.0-py3-none-any.whl.

File metadata

File hashes

Hashes for tts_webui_extension_tortoise_tts-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8b3f219e92d6a1b241b2fb7b289060ee8851a52deea4b6bc8c71de43140b4d82
MD5 172f02c8ff6f7a1a30edd7e69d1842ac
BLAKE2b-256 ddde2bafd1f06e5b85fc8b51837ead0c4eecec1bf0044ccdf3de5240000fb1ae

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page