Replace voices in youtube videos

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Project description

TurnVoice

A YouTube Video Voice Replacement Tool

This tool provides a convenient way to download YouTube videos, transcribe their audio, and replace the original voice with another one.

Features

YouTube Video Download: Downloads a specific YouTube video.
Audio Transcription: Transcribes the audio of the downloaded video.
Voice Synthesis: Replaces the original voice in the video with a synthetic voice.

Installation

This should make you ready:

pip install turnvoice

If you like it faster, get your CUDA ready:

For CUDA 11.8:

pip install torch==2.1.1+cu118 torchaudio==2.1.1+cu118 --index-url https://download.pytorch.org/whl/cu118

For CUDA v12.1:

pip install torch==2.1.1+cu118 torchaudio==2.1.1+cu211 --index-url https://download.pytorch.org/whl/cu211

Usage

This tool is executed via the command line. Below are the parameters that can be configured:

-u, --url (required): URL of the YouTube video to process.
-l, --language: Language code for assumed transcription (default: 'en') language and synthesis language.
-dd, --download_directory: Directory for saving downloaded files (default: 'downloads').
-sd, --synthesis_directory: Directory for saving synthesized audio files (default: 'synthesis').
-e, --extract: Flag to extract audio from the video file. Set to false downloads audio seperately, can lead to better quality but also increase likelihood of errors.
-rw, --reference_wav: Reference audio file for voice synthesis. Can be wav or calculated json file.
-ov, --output_video: Filename for the output video with synthetic voice (default: 'final_cut.mp4').

Example Command

turnvoice -u https://www.youtube.com/watch?v=JfV6UcF18ts -rw arthur_morgan.wav -ov rdr2.mp4

Exchanges all voices in the given youtube vid with the voice in "arthur_morgan.wav" and writes the final cut into rdr2.mp4.
Requires a wave file "arthur_morgan.wav" in the working directory.

Hints

Fixed TTS model download folder

Create a fixed folder (for example C:\Downloads\CoquiModels) for your coqui xtts model downloads and set the environment variable COQUI_MODEL_PATH to this folder.

Windows (example folder):

setx COQUI_MODEL_PATH "C:\Downloads\CoquiModels"

Reference Wav

I recommend using a 24000 Hz, 16 bit, mono wav file of 10-30 second length. According to the config.json of the synthesis model it should be 22050Hz but it works better with 24000 in my experience. You can use Audacity to drop any input audio, set the sample rate at the down left bottom to 24000 and then export as 16 bit PCM.

Future Improvements

Optional Translation: coming soon
Optimized synthesis. Currently does too many synthesis tries per sentence fragment. Can be done better.
Grab clone voice from another youtube video.

License

The project is under Coqui Public Model License 1.0.0.

Make sure to comply with both YouTube's terms of service and copyright laws as well as Coqui's Public Model License 1.0.0 when using this tool.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Release history Release notifications | RSS feed

0.0.65

Dec 20, 2023

0.0.60

Dec 18, 2023

0.0.50

Dec 15, 2023

0.0.46

Dec 13, 2023

0.0.45

Dec 12, 2023

0.0.41

Dec 12, 2023

0.0.40

Dec 12, 2023

0.0.33

Dec 8, 2023

0.0.32

Dec 8, 2023

0.0.31

Dec 8, 2023

0.0.30

Dec 8, 2023

0.0.22

Dec 5, 2023

0.0.21

Dec 5, 2023

0.0.20

Dec 5, 2023

0.0.13

Dec 5, 2023

0.0.12

Dec 5, 2023

0.0.11

Dec 5, 2023

0.0.2 yanked

Dec 5, 2023

Reason this release was yanked:

wrong version number

This version

0.0.1

Dec 4, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

TurnVoice-0.0.1.tar.gz (10.2 kB view hashes)

Uploaded Dec 4, 2023 Source

Built Distribution

TurnVoice-0.0.1-py3-none-any.whl (12.2 kB view hashes)

Uploaded Dec 4, 2023 Python 3

Hashes for TurnVoice-0.0.1.tar.gz

Hashes for TurnVoice-0.0.1.tar.gz
Algorithm	Hash digest
SHA256	`bf7aece42221dab9501010b890f2fc6944ebf395c8caeb5aff71763cfa25b9a1`
MD5	`7cb62977e09ba2ad1f960d72385dcc38`
BLAKE2b-256	`2d003d9005cea534fbdb1810021b2de611d2ca1e3f10e52d7d8bf44854fe950d`

Hashes for TurnVoice-0.0.1-py3-none-any.whl

Hashes for TurnVoice-0.0.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6d957d18efe8bb37075db5a22d7b67b7186d97218ea0f86c6388c1495177afae`
MD5	`96d3955f77382d02913a5631941150dc`
BLAKE2b-256	`29d019e3972899916529ecc6ff08b73a22478edbe7b3b9546777ac61ea5df2fd`