Generate audio or speech from any subtitle file
Project description
Subtitle to Audio
Generate audio from any subtitle file using coqui-ai TTS.
Dependencies
ffmpeg, pydub, librosa, coqui-ai TTS
Installation
pip install git+https://github.com/bnsantoso/sub-to-audio
pip install subtoaudio
ffmpeg on linux
apt-get install ffmpeg
Example usage
Basic use is very similiar to Coqui-ai TTS, you can check their documentation.
from subtoaudio import SubToAudio
#Using the Fairseq English speaker model as the default, the code will output 'output.wav' in the current directory.
sub = SubToAudio(gpu=True)
subtitle = sub.subtitle("yoursubtitle.srt")
sub.convert_to_audio(data=subtitle)
#you can choose 1100 different language using fairseq model
sub = SubToAudio(language='<lang-iso_code>')
subtitle = sub.subtitle("yoursubtitle.ass")
sub.convert_to_audio(data=subtitle)
#specify model name
sub = SubToAudio(model_name="tts_models/multilingual/multi-dataset/your_tts")
subtitle = sub.subtitle("yoursubtitle.srt")
sub.convert_to_audio(data=subtitle, output_path="subtitle.wav")
#specify model and config path
sub = SubToAudio(model_path="path/to/your/model.pth" config_path="config/path.json")
subtitle = sub.subtitle("yoursubtitle.srt")
sub.convert_to_audio(data=subtitle)
#By default, it is using "speaker=tts.speakers[0]/None, language=tts.languages[0]/None, speaker_wav=None"
sub = SubToAudio(model_name="tts_models/multilingual/multi-dataset/your_tts")
subtitle = sub.subtitle("yoursubtitle.srt")
sub.convert_to_audio(data=subtitle, language="en", speaker="speakername", speaker_wav="your/path/speaker.wav", output_path="subtitle.wav")
Citation
Eren, G., & The Coqui TTS Team. (2021). Coqui TTS (Version 1.4) [Computer software]. https://doi.org/10.5281/zenodo.6334862
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
subtoaudio-0.1.1.tar.gz
(9.2 kB
view hashes)
Built Distribution
Close
Hashes for subtoaudio-0.1.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7051c2b2209e6ce2525b5781bf9b9b29aedf53875aa2eb76b30fcbc19989656b |
|
MD5 | 0b6f67d8a1c83861886fd702aafa32b2 |
|
BLAKE2b-256 | 543e50a8a814c4743fd6c495300379e6ef1b03b9dbea53aa7950b9b914190116 |