Generate text captions for audio files & youtube video using OpenAI Whisper. Multiple languages support.
Project description
I. INTRODUCTION
ur_audio_sub
made generating captions easy for any audio files & youtube video using OpenAI Whisper. Multiple languages support.
II. REFERENCES
2.1. First thing first
You will need to install OpenAI whisper package from source using pip:
Let's runt this command in the terminal fist: !pip install git+https://github.com/openai/whisper.git -q
You also need ffmpeg be installed to start generating captions:
- on Ubuntu or Debian, or Google Colab:
sudo apt update && sudo apt install ffmpeg
- on MacOS using Homebrew (https://brew.sh/):
brew install ffmpeg
- on Windows using Chocolatey (https://chocolatey.org/):
choco install ffmpeg
- on Windows using Scoop (https://scoop.sh/):
scoop install ffmpeg
2.2. How to install this package?
- Using pip to installed pre-builded package on Pypip
pip install ur_audio_sub
- If you want to use the latest pydata_master version instead of the stable one, you can install it from source with the following command:
pip install git+https://github.com/thinh-vu/ur_audio_sub.git@main
III. APENDICES
- This package has been built on top of pytube and OpenAI Whisper:
- pytube is a genuine, lightweight, dependency-free Python library (and command-line utility) for downloading YouTube videos.
- whisper: Whisper is a general-purpose speech recognition model. It is trained on a large dataset of diverse audio and is also a multi-task model that can perform multilingual speech recognition as well as speech translation and language identification.
IV. 🙋♂️ CONTACT INFORMATION
You can contact me at one of my social network profiles:
- 💼 LinkedIn: https://linkedin.com/in/thinh-vu
- :octocat: GitHub: https://github.com/thinh-vu
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
ur_audio_sub-0.0.1.tar.gz
(4.4 kB
view hashes)
Built Distribution
Close
Hashes for ur_audio_sub-0.0.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d139e93e888f3fcad579a81d159554d41d046369935f8ae2db2e85813b1b28b5 |
|
MD5 | b8b3f0f2e4565dce0b5296302d61a1e7 |
|
BLAKE2b-256 | e3354df4c15cff3611d6613dadbe6f40a36beb72457d736f8b64ee824079ea7e |