Mozilla's DeepSpeech transcriber in a pip installable package.
Project description
pydeepspeech
- The simpliest way to use AI to generate transcriptions from a wav file.
- This project uses the Mozilla DeepSpeech engine built from the included demo:
Quick start
Install to current python environment
Console api:
$ pip install pydeepspeech
$ pydeepspeech --wav_file <WAVE_FILE> --aggressive 1 --out_file <TEXT_FILE>
-or-
$ pydeepspeech --wav_file <WAVE_FILE> --out_file <TEXT_FILE> --model_dir <MY_PBMM_AND_SCORER_FILES>
-or-
$ pydeepspeech_installmodels --pbmm <PBMM_FILE_OR_URL> --scorer <SCORER_FILE_OR_URL>
$ pydeepspeech --wav_file <WAVE_FILE> --out_file <TEXT_FILE>
Or in python
from pydeepspeech.transcribe import transcribe
transcribe(...)
Optional: Create a virtual python package
Download and install virtual env:
# Download
curl -X GET https://raw.githubusercontent.com/zackees/make_venv/main/make_venv.py -o make_env.py
python make_env.py # Make the environment
source activate.sh # Enter environment
pip install -e .
To get back into the environment execute source activate.sh (if windows, you must be using git-bash)
Testing
Testing and linting is very simple. Just run tox
$ tox
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pydeepspeech-1.1.6.tar.gz.
File metadata
- Download URL: pydeepspeech-1.1.6.tar.gz
- Upload date:
- Size: 10.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.9.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8b0ea565a31f76bfb81f213c52d63c5a6a2cd3c3aff52abece6fb28a0773d53e
|
|
| MD5 |
71a713db360da45ab70eee5d123b9e5c
|
|
| BLAKE2b-256 |
7088f705bfdf8d2d1757e5ee3979bfe28919ecbfc8e8e5259e13731ee4f5c485
|
File details
Details for the file pydeepspeech-1.1.6-py2.py3-none-any.whl.
File metadata
- Download URL: pydeepspeech-1.1.6-py2.py3-none-any.whl
- Upload date:
- Size: 15.0 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.7.1 importlib_metadata/4.9.0 pkginfo/1.8.2 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.3 CPython/3.8.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
0366a2c41e9866ded96cf1f8a4952e828bfc05b9f58d612c0143057d1a71693a
|
|
| MD5 |
d8ea2140fa3be74168b4d2c1373166ce
|
|
| BLAKE2b-256 |
ec3c710d7d5816a5f0b88fa5b7452e037513fc806e8ce74bc03e9ef9ee389586
|