Mozilla's DeepSpeech transcriber in a pip installable package.
Project description
pydeepspeech
- The simpliest way to use AI to generate transcriptions from a wav file.
- This project uses the Mozilla DeepSpeech engine built from the included demo:
Quick start
Install to current python environment
Console api:
$ pip install pydeepspeech
$ pydeepspeech --wav_file <WAVE_FILE> --out_file <TEXT_FILE>
-or-
$ pydeepspeech --wav_file <WAVE_FILE> --out_file <TEXT_FILE> --model_dir <MY_PBMM_AND_SCORER_FILES>
-or-
$ pydeepspeech_installmodels --pbmm <PBMM_FILE_OR_URL> --scorer <SCORER_FILE_OR_URL>
$ pydeepspeech --wav_file <WAVE_FILE> --out_file <TEXT_FILE>
Or in python
from pydeepspeech.transcribe import transcribe
transcribe(...)
Optional: Create a virtual python package
Download and install virtual env:
# Download
curl -X GET https://raw.githubusercontent.com/zackees/make_venv/main/make_venv.py -o make_env.py
python make_env.p # Make the environment
source activate.sh # Enter environment
To get back into the environment execute source activate.sh (if windows, you must be using git-bash)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
pydeepspeech-1.1.4.tar.gz
(10.5 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pydeepspeech-1.1.4.tar.gz.
File metadata
- Download URL: pydeepspeech-1.1.4.tar.gz
- Upload date:
- Size: 10.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.9.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
738ffe2d088e5fb1bbe4eb2852cd303a7a0b44d6486bfbe9fdd7d5484510bb89
|
|
| MD5 |
27287fe2e5595a8852e3d7b5679dd67e
|
|
| BLAKE2b-256 |
bb7169c98d1940cf7cd6283732e7a5c1aaacd9af9fc39a200bbf734b8adbe4e7
|
File details
Details for the file pydeepspeech-1.1.4-py2.py3-none-any.whl.
File metadata
- Download URL: pydeepspeech-1.1.4-py2.py3-none-any.whl
- Upload date:
- Size: 11.6 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.9.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3a9810df8b0e743894079bcf5775f0751f1d8c6b8cb89a816d161e7044f6edbd
|
|
| MD5 |
382c06282f9398b583ebc385cfba354d
|
|
| BLAKE2b-256 |
2ce69cb53181374701a560760d7c65867ef5163b4d5c5dd307b5d413a3804e80
|