Skip to main content

A library for running inference on a DeepSpeech model

Project description

Documentation Task Status

DeepSpeech is an open source Speech-To-Text engine, using a model trained by machine learning techniques based on Baidu’s Deep Speech research paper. Project DeepSpeech uses Google’s TensorFlow to make the implementation easier.

To install and use deepspeech all you have to do is:

# Create and activate a virtualenv
virtualenv -p python3 $HOME/tmp/deepspeech-venv/
source $HOME/tmp/deepspeech-venv/bin/activate

# Install DeepSpeech
pip3 install deepspeech

# Download pre-trained English model and extract
curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.5.1/deepspeech-0.5.1-models.tar.gz
tar xvf deepspeech-0.5.1-models.tar.gz

# Download example audio files
curl -LO https://github.com/mozilla/DeepSpeech/releases/download/v0.5.1/audio-0.5.1.tar.gz
tar xvf audio-0.5.1.tar.gz

# Transcribe an audio file
deepspeech --model deepspeech-0.5.1-models/output_graph.pbmm --alphabet deepspeech-0.5.1-models/alphabet.txt --lm deepspeech-0.5.1-models/lm.binary --trie deepspeech-0.5.1-models/trie --audio audio/2830-3980-0043.wav

A pre-trained English model is available for use and can be downloaded using the instructions below. Currently, only 16-bit, 16 kHz, mono-channel WAVE audio files are supported in the Python client. A package with some example audio files is available for download in our release notes.

Quicker inference can be performed using a supported NVIDIA GPU on Linux. See the release notes to find which GPUs are supported. To run deepspeech on a GPU, install the GPU specific package:

# Create and activate a virtualenv
virtualenv -p python3 $HOME/tmp/deepspeech-gpu-venv/
source $HOME/tmp/deepspeech-gpu-venv/bin/activate

# Install DeepSpeech CUDA enabled package
pip3 install deepspeech-gpu

# Transcribe an audio file.
deepspeech --model deepspeech-0.5.1-models/output_graph.pbmm --alphabet deepspeech-0.5.1-models/alphabet.txt --lm deepspeech-0.5.1-models/lm.binary --trie deepspeech-0.5.1-models/trie --audio audio/2830-3980-0043.wav

Please ensure you have the required CUDA dependencies.

See the output of deepspeech -h for more information on the use of deepspeech. (If you experience problems running deepspeech, please check required runtime dependencies).


Table of Contents

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

deepspeech_gpu-0.6.0a10-cp37-cp37m-win_amd64.whl (17.3 MB view details)

Uploaded CPython 3.7mWindows x86-64

deepspeech_gpu-0.6.0a10-cp37-cp37m-manylinux1_x86_64.whl (18.7 MB view details)

Uploaded CPython 3.7m

deepspeech_gpu-0.6.0a10-cp36-cp36m-win_amd64.whl (17.3 MB view details)

Uploaded CPython 3.6mWindows x86-64

deepspeech_gpu-0.6.0a10-cp36-cp36m-manylinux1_x86_64.whl (18.7 MB view details)

Uploaded CPython 3.6m

deepspeech_gpu-0.6.0a10-cp35-cp35m-win_amd64.whl (17.3 MB view details)

Uploaded CPython 3.5mWindows x86-64

deepspeech_gpu-0.6.0a10-cp35-cp35m-manylinux1_x86_64.whl (18.7 MB view details)

Uploaded CPython 3.5m

deepspeech_gpu-0.6.0a10-cp34-cp34m-manylinux1_x86_64.whl (18.7 MB view details)

Uploaded CPython 3.4m

deepspeech_gpu-0.6.0a10-cp27-cp27mu-manylinux1_x86_64.whl (18.7 MB view details)

Uploaded CPython 2.7mu

deepspeech_gpu-0.6.0a10-cp27-cp27m-manylinux1_x86_64.whl (18.7 MB view details)

Uploaded CPython 2.7m

File details

Details for the file deepspeech_gpu-0.6.0a10-cp37-cp37m-win_amd64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a10-cp37-cp37m-win_amd64.whl
  • Upload date:
  • Size: 17.3 MB
  • Tags: CPython 3.7m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a10-cp37-cp37m-win_amd64.whl
Algorithm Hash digest
SHA256 f0f62c07b131c154d60722bd1b761a595a6a41140b5e2cc81eaeb0ccdeffca10
MD5 362e83d7bd4417a1aef43bcb4c9f3e95
BLAKE2b-256 7786dbf42419a6901945899deb7295c47a79c350ea058fbd07ce1ca596d07550

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a10-cp37-cp37m-manylinux1_x86_64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a10-cp37-cp37m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 18.7 MB
  • Tags: CPython 3.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a10-cp37-cp37m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 2b30dda9f819d7bbb2f6e27951e7f42953c54b34344380922da6a0a20d56ef79
MD5 a3018b8a0f5707a84ace9d0e15ba489b
BLAKE2b-256 00c3355371de79b1a7a2609487f3bc8b00b0172d010bc675e3d0fcd064682d3f

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a10-cp36-cp36m-win_amd64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a10-cp36-cp36m-win_amd64.whl
  • Upload date:
  • Size: 17.3 MB
  • Tags: CPython 3.6m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a10-cp36-cp36m-win_amd64.whl
Algorithm Hash digest
SHA256 fa17ee814f02b9edfba7d5ca776d14e6f1faf78f5879bfbcd8a9630623bedf1a
MD5 a470ea9303e3f281e5f7d588b2daa1c6
BLAKE2b-256 67f1cd9a0ad39da1f9c30b827598db297b8b369087aec85a25d119838400306d

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a10-cp36-cp36m-manylinux1_x86_64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a10-cp36-cp36m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 18.7 MB
  • Tags: CPython 3.6m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a10-cp36-cp36m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 4fa4b671deaa5eb78cb349c8aee70ac728f5d8d1a56d333d5eebdb6269ea42c8
MD5 4f42f487533b0b2d440f85121accc8e3
BLAKE2b-256 ab5624db861ff7a87d32e09d49ab2812182f557a2c2f2b7679ce3a0406df1465

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a10-cp35-cp35m-win_amd64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a10-cp35-cp35m-win_amd64.whl
  • Upload date:
  • Size: 17.3 MB
  • Tags: CPython 3.5m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a10-cp35-cp35m-win_amd64.whl
Algorithm Hash digest
SHA256 d8f2c239c6e9eec9654a6228565e8eebe49008e6001f1fa0ab8c1033d04f103a
MD5 7e13fdd17cfcab1dc1d5797e48c8638a
BLAKE2b-256 843e4e834cb12e707643ce541b14a7d03873f22e28c04cb4a02e1885b3609cdd

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a10-cp35-cp35m-manylinux1_x86_64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a10-cp35-cp35m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 18.7 MB
  • Tags: CPython 3.5m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a10-cp35-cp35m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 f1fb0a4ac700b97c93011f2447b4c1459879ad5b66697dbd49f731f29ea8aa94
MD5 ba9a0671c97584c7b9ab9db4a855ae1d
BLAKE2b-256 a93c36c3767f640798152f31a28ae962709cd69abaef0e741f0d4f1ec2fca02e

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a10-cp34-cp34m-manylinux1_x86_64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a10-cp34-cp34m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 18.7 MB
  • Tags: CPython 3.4m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a10-cp34-cp34m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 940873b03962aa751af2fb6e7e97fe0fcd87f0a9b007aa302729594a223512be
MD5 a69db0e00efd3d789b1c3cc705a6aaa2
BLAKE2b-256 70f6a199e24a71a6593d1e5bf00fcb8361325a5ae8370daeccaf755eaa327304

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a10-cp27-cp27mu-manylinux1_x86_64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a10-cp27-cp27mu-manylinux1_x86_64.whl
  • Upload date:
  • Size: 18.7 MB
  • Tags: CPython 2.7mu
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a10-cp27-cp27mu-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 a1dfd02732e5401aff5e10852d3220c729a92ab840e2b0ba8aa6ac06c4ab7255
MD5 e5ed54e1e449432859b488bee3dfcd2e
BLAKE2b-256 bd00736b579fa476978b5173be80f2b0c1694782422f9e4296967bc458065f8e

See more details on using hashes here.

File details

Details for the file deepspeech_gpu-0.6.0a10-cp27-cp27m-manylinux1_x86_64.whl.

File metadata

  • Download URL: deepspeech_gpu-0.6.0a10-cp27-cp27m-manylinux1_x86_64.whl
  • Upload date:
  • Size: 18.7 MB
  • Tags: CPython 2.7m
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.19.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.26.0 CPython/3.6.2

File hashes

Hashes for deepspeech_gpu-0.6.0a10-cp27-cp27m-manylinux1_x86_64.whl
Algorithm Hash digest
SHA256 4f2750c76019e1c124ec6963d2857b3fe5c8321c8b2586f36e7ade1707c71748
MD5 57e18f71fb673994e29402ef47975ae2
BLAKE2b-256 d15e2532f2fb16fbaca5ec2d13d3880ce9458179e2c33fb7f5d1c7f361e067de

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page