Skip to main content

Replace voices in youtube videos

Project description

TurnVoice 🎬

A command-line tool to replace voices in youtube videos.

Installation

pip install turnvoice

For a speedier experience, prepare your CUDA environment:

CUDA 11.8:

pip install torch==2.1.1+cu118 torchaudio==2.1.1+cu118 --index-url https://download.pytorch.org/whl/cu118

CUDA v12.1:

pip install torch==2.1.1+cu118 torchaudio==2.1.1+cu211 --index-url https://download.pytorch.org/whl/cu211

Usage 🎮

turnvoice -u <YouTube Video URL> -rw <Reference WAV File> -ov <Output Video Filename>

Parameters Explained:

  • -u, --url: (required) The YouTube video URL you want to transform.
  • -l, --language: The language for transcription and synthesis (default: 'en').
  • -dd, --download_directory: Where to save the video downloads (default: 'downloads').
  • -sd, --synthesis_directory: Secret lab for tts synthesis audio files (default: 'synthesis').
  • -e, --extract: To extract or not to extract audio directly from the video? That is the question. False can lead to better quality but also increase likelihood of errors.
  • -rw, --reference_wav: Your chosen voice in wav format (24000 Hz, 16 bit)
  • -ov, --output_video: The grand finale video file name (default: 'final_cut.mp4').

Example Command:

Ever wanted Arthur Morgan to narrate a cooking tutorial? Here's how:

turnvoice -u https://www.youtube.com/watch?v=AmC9SmCBUj4 -rw arthur.wav -ov cooking_with_arthur.mp4

This example needs a arthur.wav clone wav file in the same directory. May magically work when executed from the test directory though.

Pro Tips 🧙‍♂️

The Art of Choosing a Reference Wav:

  • A 24000, 44100 or 22050 Hz 16-bit, mono wav file of 10-30 seconds is your golden ticket.
  • Audacity is your friend for adjusting sample rates. Experiment with frame rates for best results!

Fixed TTS Model Download Folder:

Keep your models organized! Set COQUI_MODEL_PATH to your preferred folder.

Windows example:

setx COQUI_MODEL_PATH "C:\Downloads\CoquiModels"

Future Improvements 🚀

  • Optional Translation: Polyglot? Coming soon!
  • Optimized Synthesis: Reducing the synthesis tries for faster results.
  • Voice Cloning from YouTube: Imagine cloning voices directly from other videos!

License 📜

TurnVoice is proudly under the Coqui Public Model License 1.0.0.

Let's Make It Fun! 🎉

Share your funniest or most creative TurnVoice creations with me!

And if you've got a cool feature idea or just want to say hi, drop me a line on

Don't forget to leave a star.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

TurnVoice-0.0.11.tar.gz (10.2 kB view hashes)

Uploaded Source

Built Distribution

TurnVoice-0.0.11-py3-none-any.whl (12.5 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page