Input a local file or url and this service will transcribe it using Mozilla Deepspeech
Project description
transcribe-anything
Input a local file or url and this service will transcribe it using Mozilla Deepspeech 0.9.3.
- Example (cmd):
transcribe_anything <YOUTUBE_URL> out_subtitles.txt
transcribe_anything <LOCAL.MP4/WAV> out_subtitles.txt
- Example (api):
from transcribe_anything.transcribe_anything import bulk_fetch_subtitles urls = ['https://www.youtube.com/watch?v=Erk4_jFDjzQ'] def onresolve(url, sub): print(url, sub) bulk_fetch_subtitles(urls, onresolve=onresolve)
Quick start
Optional: Create a virtual python package
- Works for Ubuntu/MacOS bash or win32 git-bash
mkdir transcribe_anything
cd transcribe_anything
- Download and install virtual env:
curl -X GET https://raw.githubusercontent.com/zackees/make_venv/main/make_venv.py -o make_env.py
python make_env.py
- Enter the environment:
source activate.sh
The environment is now active and the next step will only install to the local python. If the terminal
is closed then to get back into the environment cd transcribe_anything
and execute source activate.sh
Required: Install to current python environment
pip install transcribe-anything
- The command
transcribe_anything
will magically become available.
- The command
transcribe_anything <YOUTUBE_URL> out_subtitles.txt
- -or-
transcribe_anything <MY_LOCAL.MP4/WAV> out_subtitles.txt
Tech Stack
- Mozilla DeepSpeech: https://github.com/mozilla/DeepSpeech
- pydeepspeech: https://github.com/zackees/pydeepspeech
- mic_vad_streaming: https://github.com/hadran9/DeepSpeech-examples/tree/r0.9/mic_vad_streaming
- youtube-dl:
- static-ffmpeg
Testing
- All tests are run by
tox
, simply go to the project directory root and run it.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Close
Hashes for transcribe-anything-1.1.2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2266d0bce468ba983472589c933b6954385f294a1f3827cf3bf1962d9ee1ac58 |
|
MD5 | 23d1d8a1312fec34add241f91450b898 |
|
BLAKE2b-256 | 4777cbb1464a7290225ec500076f1dd83dee77a9dee1582d8f0cc37069047266 |
Close
Hashes for transcribe_anything-1.1.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1ac5e631ecd0d2bd7ef0a2f7b3509385a084abbe40a9e96bfad14174fb90e354 |
|
MD5 | 2238dd9f3aaf3f7863a03499aa767dce |
|
BLAKE2b-256 | ea5f2b69746a1dfbdf0475be633df22c9b1f78a4774664800f07200ddffa1e5a |