A pakage for crawling and processing audio, caption from Youtube
Project description
Audio, Caption Crawler and Processor -TTS Data Generator-
Downloads and processes the audios and captions(subtitles) from Youtube videos for Speech AI
Generates audio datas from Youtube for TTS
Requirements
- Currently requires python >= 3.6
- FFmpeg
To Use
from vctube import VCtube
playlist_name=""
playlist_url = ""
lang = "" #ex) ko, en, fr, de...
vc = VCtube(playlist_name, playlist_url, lang)
vc.download_audio() #download audios from youtube
vc.download_captions() #download captions from youtube
vc.audio_split() #split audio with captions
Results
datasets
|- playlist name
|- metadata.csv
|- alignment.json
|- wavs
├── 1.wav
├── 2.wav
├── 3.wav
└── ...
and metadata.csv
should look like:
{
"0001.wav|그래서 사람들도 날 핍이라고 불렀다.",
"0002.wav|크리스마스 덕분에 부엌에 먹을게 가득했다.",
"0003.wav|조가 자신이 그 사람이라고 나섰다.",
...
}
and alignment.json
should look like:
{
"./datasets/playlist name/wavs/0001.wav": "그래서 사람들도 날 핍이라고 불렀다.",
"./datasets/playlist name/wavs/0002.wav": "크리스마스 덕분에 부엌에 먹을게 가득했다.",
"./datasets/playlist name/wavs/0003.wav": "조가 자신이 그 사람이라고 나섰다.",
...
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
vctube-1.1-py3-none-any.whl
(7.6 kB
view hashes)