Replace voices in youtube videos
Project description
TurnVoice
A command-line tool to replace voices in youtube videos.
Installation
pip install turnvoice
For faster rendering prepare your CUDA environment:
CUDA 11.8:
pip install torch==2.1.1+cu118 torchaudio==2.1.1+cu118 --index-url https://download.pytorch.org/whl/cu118
CUDA v12.1:
pip install torch==2.1.1+cu118 torchaudio==2.1.1+cu211 --index-url https://download.pytorch.org/whl/cu211
Usage
turnvoice -u <YouTube Video URL> -rw <Reference WAV File> -ov <Output Video Filename>
Parameters Explained:
-u
,--url
: (required) The YouTube video URL you want to transform.-l
,--language
: The language for transcription and synthesis (default: 'en').-dd
,--download_directory
: Where to save the video downloads (default: 'downloads').-sd
,--synthesis_directory
: Where to save the text to speech audio files (default: 'synthesis').-e
,--extract
: Use with -e to extract audio directly from the video (may lead to lower quality but can reduce likelihood of errors)-rw
,--reference_wav
: Your chosen voice in wav format (24kHz, 16 bit, mono, ~10-30s)-ov
,--output_video
: The grand finale video file name (default: 'final_cut.mp4').
Example Command:
Ever wanted Arthur Morgan to narrate a cooking tutorial? Here's how:
turnvoice -u https://www.youtube.com/watch?v=AmC9SmCBUj4 -rw arthur.wav -ov cooking_with_arthur.mp4
This example needs a arthur.wav clone wav file in the same directory. Works when executed from the tests directory.
Pro Tips
The Art of Choosing a Reference Wav
- A 24000, 44100 or 22050 Hz 16-bit mono wav file of 10-30 seconds is your golden ticket.
- 24k mono 16 is my default, but I also had voices where I found 44100 32-bit to yield best results
- I test voices with this tool before rendering
- Audacity is your friend for adjusting sample rates. Experiment with frame rates for best results!
Fixed TTS Model Download Folder
Keep your models organized! Set COQUI_MODEL_PATH
to your preferred folder.
Windows example:
setx COQUI_MODEL_PATH "C:\Downloads\CoquiModels"
Future Improvements
- Optional Translation: Polyglot? Coming soon!
- Optimized Synthesis: Reducing the synthesis tries for faster results.
- Voice Cloning from YouTube: Imagine cloning voices directly from other videos!
License
TurnVoice is proudly under the Coqui Public Model License 1.0.0.
Let's Make It Fun! 🎉
Share your funniest or most creative TurnVoice creations with me!
And if you've got a cool feature idea or just want to say hi, drop me a line on
Don't forget to leave a star.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for TurnVoice-0.0.20-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 102443a740ca4ac79f5181328629eb3b0bc6b69b323f3e6e6c6f56a4bc2c5cba |
|
MD5 | b8c0182351bc6afd198d5bfe36d3b963 |
|
BLAKE2b-256 | b4a3e9103cfac13318c2e4bfee2cd54ab1419ff0b76ba40af01028cb80030ad5 |