Skip to main content

Python interface to the Google WebRTC Voice Activity Detector (VAD)

Project description

https://travis-ci.org/wiseman/py-webrtcvad.svg?branch=master

py-webrtcvad

This is a python interface to the WebRTC Voice Activity Detector (VAD). It is compatible with Python 2 and Python 3.

A VAD classifies a piece of audio data as being voiced or unvoiced. It can be useful for telephony and speech recognition.

The VAD that Google developed for the WebRTC project is reportedly one of the best available, being fast, modern and free.

How to use it

  1. Install the webrtcvad module:

    pip install webrtcvad
  2. Create a Vad object:

    import webrtcvad
    vad = webrtcvad.Vad()
  3. Optionally, set its aggressiveness mode, which is an integer between 0 and 3. 0 is the least aggressive about filtering out non-speech, 3 is the most aggressive. (You can also set the mode when you create the VAD, e.g. vad = webrtcvad.Vad(3)):

    vad.set_mode(1)
  4. Give it a short segment (“frame”) of audio. The WebRTC VAD only accepts 16-bit mono PCM audio, sampled at 8000, 16000, or 32000 Hz. A frame must be either 10, 20, or 30 ms in duration:

    # Run the VAD on 10 ms of silence. The result should be False.
    sample_rate = 16000
    frame_duration = 10  # ms
    frame = b'\x00\x00' * (sample_rate * frame_duration / 1000)
    print 'Contains speech: %s' % (vad.is_speech(frame, sample_rate)

See example.py for a more detailed example that will process a .wav file, find the voiced segments, and write each one as a separate .wav.

How to run unit tests

To run unit tests:

pip install -e ".[dev]"
python setup.py test

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distributions

webrtcvad_wheels-2.0.10-cp27-cp27m-win_amd64.whl (23.7 kB view details)

Uploaded CPython 2.7mWindows x86-64

webrtcvad_wheels-2.0.10-cp27-cp27m-win32.whl (20.4 kB view details)

Uploaded CPython 2.7mWindows x86

File details

Details for the file webrtcvad_wheels-2.0.10-cp27-cp27m-win_amd64.whl.

File metadata

  • Download URL: webrtcvad_wheels-2.0.10-cp27-cp27m-win_amd64.whl
  • Upload date:
  • Size: 23.7 kB
  • Tags: CPython 2.7m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/2.7.15

File hashes

Hashes for webrtcvad_wheels-2.0.10-cp27-cp27m-win_amd64.whl
Algorithm Hash digest
SHA256 62dff503ac6119a8197b344b2e31cce946078ffe0a8b1aeba58acf22e1e2df09
MD5 b1a421aff1ada4b6b6ddb5f2ee029932
BLAKE2b-256 73cc95526b7bc03da7b7d64fc19208693e1ec7f999f761efa56f0d9c7e8f418c

See more details on using hashes here.

File details

Details for the file webrtcvad_wheels-2.0.10-cp27-cp27m-win32.whl.

File metadata

  • Download URL: webrtcvad_wheels-2.0.10-cp27-cp27m-win32.whl
  • Upload date:
  • Size: 20.4 kB
  • Tags: CPython 2.7m, Windows x86
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/42.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/2.7.15

File hashes

Hashes for webrtcvad_wheels-2.0.10-cp27-cp27m-win32.whl
Algorithm Hash digest
SHA256 b5fb1c6564690101b67b2c06f742e3ad1d6909d170e5cb401a5887cde01beb24
MD5 4219c0e9542464883f9aba30f6408242
BLAKE2b-256 adb4d979d85c5c5c8692590f2b325b9b524a5147c427bc56e0514ea3dd6082fb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page