OuteAI Text-to-Speech (TTS)
Project description
OuteTTS
OuteTTS is an experimental text-to-speech model that uses a pure language modeling approach to generate speech.
Installation
pip install outetts
Important: For GGUF support, you must manually install llama-cpp-python
first.
Visit https://github.com/abetlen/llama-cpp-python for specific installation instructions
Usage
Interface Usage
from outetts.v0_1.interface import InterfaceHF, InterfaceGGUF
# Initialize the interface with the Hugging Face model
interface = InterfaceHF("OuteAI/OuteTTS-0.1-350M")
# Or initialize the interface with a GGUF model
# interface = InterfaceGGUF("path/to/model.gguf")
# Generate TTS output
# Without a speaker reference, the model generates speech with random speaker characteristics
output = interface.generate(
text="Hello, am I working?",
temperature=0.1,
repetition_penalty=1.1,
max_length=4096
)
# Play the generated audio
output.play()
# Save the generated audio to a file
output.save("output.wav")
Voice Cloning
# Create a custom speaker from an audio file
speaker = interface.create_speaker(
"path/to/reference.wav",
"reference text matching the audio"
)
# Save the speaker to a file
interface.save_speaker(speaker, "speaker.pkl")
# Load the speaker from a file
speaker = interface.load_speaker("speaker.pkl")
# Generate TTS with the custom voice
output = interface.generate(
text="This is a cloned voice speaking",
speaker=speaker,
temperature=0.1,
repetition_penalty=1.1,
max_length=4096
)
Technical Blog
https://www.outeai.com/blog/OuteTTS-0.1-350M
Credits
- WavTokenizer: GitHub Repository
- Decoder and encoder folder files are from this repository
- CTC Forced Alignment: PyTorch Tutorial
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
outetts-0.1.7.tar.gz
(57.8 kB
view details)
Built Distribution
outetts-0.1.7-py3-none-any.whl
(73.7 kB
view details)
File details
Details for the file outetts-0.1.7.tar.gz
.
File metadata
- Download URL: outetts-0.1.7.tar.gz
- Upload date:
- Size: 57.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | cfb84df50cc80c91e82aa2f14635b1706863b47e61942ba5e27bc8fde11fbaa1 |
|
MD5 | e19f2b1ffccce42252393dd2f56b50d0 |
|
BLAKE2b-256 | 787aa141edf0aa14353f8ce289ecb771b38d583c34ce3d10f46c0cae602b2f9a |
File details
Details for the file outetts-0.1.7-py3-none-any.whl
.
File metadata
- Download URL: outetts-0.1.7-py3-none-any.whl
- Upload date:
- Size: 73.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.0 CPython/3.12.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 13ee51c0c787c26ceda04982efc6094cf6bd64d1f3180e4882d7bcbe10f28e71 |
|
MD5 | ef55757582cbbfbee5d72fbc86ab7484 |
|
BLAKE2b-256 | 48e422373534e64dc5b331b805fe85badbc9321cc83eb7fc67fc6f36b6d76091 |