Text to speech with voice cloning
Project description
Speaky
Text‑to‑Speech with Voice Cloning made easy with Chatterbox TTS
Generate natural‑sounding speech from plain text—locally on your GPU/CPU using a single, ergonomic command‑line tool.
Features
- Voice cloning from a short audio prompt (optional)
- Built‑in voices accessible via
-v NAME(or-v ALLfor every voice) - Emotion control via exaggeration & classifier‑free guidance
- Device auto‑detection — Apple Silicon (mps), CUDA GPUs, or CPU
- Smart sentence chunking (NLTK) to handle long passages gracefully
- Trailing‑silence trimming so outputs end crisply
- Glitch / clipping detection heuristic for cleaner audio
- Verification via transcription (Distil‑Whisper) to catch missing words
🚀 Quickstart (CLI)
Run without installation:
uvx speak "Do or do not. There is no try." --voice vader
or install first:
uv tool install speaky
speak --file speech.txt --voice attenborough
Run speak --help for the full list of options.
🐍 Python API
from pathlib import Path
from speaky.core import batch_synthesize
batch_synthesize(
inputs=[("Hello there!", "greeting")], # (text, stem)
output_dir=Path("out"),
)
Examples
https://github.com/user-attachments/assets/f1956d1a-c407-4944-b9f8-a5402f71cbd8
https://github.com/user-attachments/assets/8e6a81df-773c-4d48-876b-4cced0b4f643
https://github.com/user-attachments/assets/2c2884e2-759a-4b1d-884c-c1d5eed9ddf0
https://github.com/user-attachments/assets/2d70a497-b156-40d8-907f-0beb33ea68d6
https://github.com/user-attachments/assets/0ab6195d-ae9e-420e-bde2-719dad772563
https://github.com/user-attachments/assets/aeabd1d9-3839-4190-8460-8ad2e364dd02
Credit
This project is built on top of chatterbox open-source TTS.
Development
Quick Commands
make initcreate the environment and install dependenciesmake helpsee available commandsmake afformat codemake lintrun lintermake typecheckrun type checkermake testrun testsmake checkrun all checks (format, lint, typecheck, test)uv add pkgadd a python dependencyuv run -- python foo/bar.pyrun arbitrary command in python env
Code Conventions
- Always run
make checkuafter making changes.
Testing
- Use pytest (no test classes).
- Always set
match=inpytest.raises. - Prefer
monkeypatchover other mocks. - Mirror the source-tree layout in
tests/.
Exceptions
- Catch only specific exceptions—never blanket
except:blocks. - Don't raise bare
Exception.
Python
- Manage env/deps with uv (
uv add|remove,uv run -- ...). - No logging config or side-effects at import time.
- Keep interfaces (CLI, web, etc.) thin; put logic elsewhere.
- Use
typerfor CLI interfaces,fastapifor web interfaces, andpydanticfor data models.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file speaky-0.1.1.tar.gz.
File metadata
- Download URL: speaky-0.1.1.tar.gz
- Upload date:
- Size: 4.3 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e16a5327d7f3d826598fe74eaabacc0316b2439c2d1ca22c9c764b2e2db40ca1
|
|
| MD5 |
5707856d9a0dd908ad6069ce5c5f2ba7
|
|
| BLAKE2b-256 |
d0ce74d5356d8773384f7acbceeb437b8296017a8da226799db916125704704a
|
File details
Details for the file speaky-0.1.1-py3-none-any.whl.
File metadata
- Download URL: speaky-0.1.1-py3-none-any.whl
- Upload date:
- Size: 1.7 MB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
e9fa6392ab6d396c6feace2b75ee2b1fd3c4c4bfd375e3875227405b5b3e2cb1
|
|
| MD5 |
9cdec986120f47eead9a5e88dd74a7dc
|
|
| BLAKE2b-256 |
e52808e2119d60aa2e423b053069ec7ec17d62fb30b18cd7491e0633bac65807
|