download youtube subtitles(closed caption, cc) as txt or json
Project description
Download Youtube Subtitle
Download youtube subtitles(closed caption, cc) or srt as txt or json.
Features
- Support exportting translation at the same time which is useful for language study.
- Support proxy for youtube, follow the step at Using Anaconda behind a company proxy — Anaconda documentation.
- Full test with traivis to make sure things are on rail.
python version of algolia/youtube-captions-scraper: Fetch youtube user submitted or fallback to auto-generated captions
Example
dl-youtube-cc 5tKOV0KqPlg --translation zh-Hans
will saved as HowSouthAfricaCouldPreparetheU.S.forPresidentTrumpTheDailyShow.txt
https://youtube.com/get_video_info?video_id=5tKOV0KqPlg
---------00:01----------
All right, well, uh,
let's get back to real life
好吧,恩,让我们回到现实生活中
---------00:04----------
or whatever we're calling
this thing now.
或我们现在所说的这个东西。
// continue
dl-youtube-cc 5tKOV0KqPlg --translation False --to_json=True
will saved as HowSouthAfricaCouldPreparetheU.S.forPresidentTrumpTheDailyShow.json
[
{
"start": "1.367",
"dur": "3.137",
"text": "All right, well, uh,\nlet's get back to real life"
},
]
// continue
Install and Run
Install vai download-youtube-subtitle · PyPI
pip install download-youtube-subtitle
orpip install download-youtube-subtitle --user
dl-youtube-cc -h
run in cli
SYNOPSIS
dl-youtube-cc VIDEOID <flags>
DESCRIPTION
Examples:
dl-youtube-cc 5tKOV0KqPlg --save_to_file=False # print stuff in console
dl-youtube-cc 5tKOV0KqPlg --output_file='test.txt' # print stuff in named file
dl-youtube-cc 5tKOV0KqPlg --to_json=True # print stuff in json
dl-youtube-cc 5tKOV0KqPlg --translation 'ja' # use japanese translation, see ./lang_code for full list
dl-youtube-cc 5tKOV0KqPlg --translation False # without translation
POSITIONAL ARGUMENTS
VIDEOID
string, the id of youtube video, the string after 'v=' in a youtube video link
FLAGS
--output_file=OUTPUT_FILE
string, default to vidio title
--save_to_file=SAVE_TO_FILE
bool, default to True, True or False
--translation=TRANSLATION
bool or string, default to 'zh-Hans' for simplified Chinese, False or lang code, see ./lang_code.json for full list
--to_json=TO_JSON
bool, default to False, export caption to json
Use in Code
import download_youtube_subtitle.common as common
import download_youtube_subtitle.main as download_youtube_subtitle
# ...
Development
Environment Setup
pip install 'fire' 'requests' 'IPython'
Usage
python main.py -h
python main.py VIDEOID
Tests
cd tests
./run.sh
./test_cli.sh
Ref
Packaging Python Projects — Python Packaging User Guide
./nb/notebook2script.py
from course-v3/nbs/dl2 at master · fastai/course-v3
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Hashes for download-youtube-subtitle-0.0.7.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8b4b568b46b1fd69c4d09c261dc354a38b9153f18e6efce0e7aae1c3fb9345f2 |
|
MD5 | 871b18f408e4f53d5097428a07e76a01 |
|
BLAKE2b-256 | 8ac7a5a9d179d1deec24edfad2ea1c26dd0717ad4592054e62d72306c0ed1786 |