Mozilla's DeepSpeech transcriber in a pip installable package.
Project description
pydeepspeech
- The simpliest way to use AI to generate transcriptions from a wav file.
- This project uses the Mozilla DeepSpeech engine built from the included demo:
Quick start
Install to current python environment
Console api:
$ pip install pydeepspeech
$ pydeepspeech --wav_file <WAVE_FILE> --out_file <TEXT_FILE>
-or-
$ pydeepspeech --wav_file <WAVE_FILE> --out_file <TEXT_FILE> --model_dir <MY_PBMM_AND_SCORER_FILES>
-or-
$ pydeepspeech_installmodels --pbmm <PBMM_FILE_OR_URL> --scorer <SCORER_FILE_OR_URL>
$ pydeepspeech --wav_file <WAVE_FILE> --out_file <TEXT_FILE>
Or in python
from pydeepspeech.transcribe import transcribe
transcribe(...)
Optional: Create a virtual python package
Download and install virtual env:
# Download
curl -X GET https://raw.githubusercontent.com/zackees/make_venv/main/make_venv.py -o make_env.py
python make_env.p # Make the environment
source activate.sh # Enter environment
To get back into the environment execute source activate.sh
(if windows, you must be using git-bash)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pydeepspeech-1.1.4.tar.gz
(10.5 kB
view hashes)
Built Distribution
Close
Hashes for pydeepspeech-1.1.4-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3a9810df8b0e743894079bcf5775f0751f1d8c6b8cb89a816d161e7044f6edbd |
|
MD5 | 382c06282f9398b583ebc385cfba354d |
|
BLAKE2b-256 | 2ce69cb53181374701a560760d7c65867ef5163b4d5c5dd307b5d413a3804e80 |