Skip to main content

Voice engine to build voice enabled applications

Project description

Voice Engine

Build Status

The library is used to create voice interface applications. It includes building blocks such as KWS (keyword spotting), DOA (Direction Of Arrival). There are also elements to measure RMS (dBFS or dB(A)).

Requirements

  • pyaudio
  • numpy
  • snowboy

Installation

Install pyaudio, numpy and snowboy, use virtualenv a virtual python environment.

sudo apt install python-pyaudio python-numpy python-virtualenv
sudo apt-get install swig python-dev libatlas-base-dev build-essential make
git clone --depth 1 https://github.com/Kitt-AI/snowboy.git
cd snowboy
virtualenv --system-site-packages env
source env/bin/activate
python setup.py build
python setup.py bdist_wheel
pip install dist/snowboy*.whl
cd ..
git clone https://github.com/voice-engine/voice-engine.git
cd voice-engine
python setup.py bdist_wheel
pip install dist/*.whl

Get started

To record audio and search keyword "snowboy", see also kws_snowboy.py

import time
from voice_engine.kws import KWS
from voice_engine.source import Source

src = Source()
kws = KWS()
src.link(kws)

def on_detected(keyword):
    print('found {}'.format(keyword))
kws.on_detected = on_detected

kws.start()
src.start()
while True:
    try:
        time.sleep(1)
    except KeyboardInterrupt:
        break
kws.stop()
src.stop()

Building blocks

The library uses gstreamer-like elements which can be linked together as an audio pipeline. One element can connect to more than one other elements.

The topology can be:

Source --> ChannelPicker --> KWS          Source --> ChannelPicker --> KWS --> Alexa
  |                          /\
  V                        /   \
 DOA                   Alexa   Google Asissitant 

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

voice-engine-0.1.3.tar.gz (443.4 kB view details)

Uploaded Source

Built Distribution

voice_engine-0.1.3-py2.py3-none-any.whl (18.7 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file voice-engine-0.1.3.tar.gz.

File metadata

File hashes

Hashes for voice-engine-0.1.3.tar.gz
Algorithm Hash digest
SHA256 01cf84e21457452b7ab9df40b377333cc74d144d5069165cbfed2c77f75eab3f
MD5 d6998f3649842fc4a5d776602d755ed2
BLAKE2b-256 b52b2299457faefa003f8fe19f5e9339f47f92453cd2f5811893d6bd40e13557

See more details on using hashes here.

File details

Details for the file voice_engine-0.1.3-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for voice_engine-0.1.3-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 aed60895b793aa898114f58168bcae7055ceaad8eae44ae00e085488fef75ab7
MD5 e78e6a53824cd10656b79e80004177a1
BLAKE2b-256 0a204588f8cc40000a2bab3c51edf5efe705c766a008617304b1a87dbd8c165d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page