Voice engine to build voice enabled applications
Project description
Voice Engine
The library is used to create voice interface applications. It includes building blocks such as KWS (keyword spotting), DOA (Direction Of Arrival). There are also elements to measure RMS (dBFS or dB(A)).
Requirements
- pyaudio
- numpy
- snowboy
Installation
Install pyaudio, numpy and snowboy, use virtualenv a virtual python environment.
sudo apt install python-pyaudio python-numpy python-virtualenv
sudo apt-get install swig python-dev libatlas-base-dev build-essential make
git clone --depth 1 https://github.com/Kitt-AI/snowboy.git
cd snowboy
virtualenv --system-site-packages env
source env/bin/activate
python setup.py build
python setup.py bdist_wheel
pip install dist/snowboy*.whl
cd ..
git clone https://github.com/voice-engine/voice-engine.git
cd voice-engine
python setup.py bdist_wheel
pip install dist/*.whl
Get started
To record audio and search keyword "snowboy", see also kws_snowboy.py
import time
from voice_engine.kws import KWS
from voice_engine.source import Source
src = Source()
kws = KWS()
src.link(kws)
def on_detected(keyword):
print('found {}'.format(keyword))
kws.on_detected = on_detected
kws.start()
src.start()
while True:
try:
time.sleep(1)
except KeyboardInterrupt:
break
kws.stop()
src.stop()
Building blocks
The library uses gstreamer-like elements which can be linked together as an audio pipeline. One element can connect to more than one other elements.
The topology can be:
Source --> ChannelPicker --> KWS Source --> ChannelPicker --> KWS --> Alexa
| /\
V / \
DOA Alexa Google Asissitant
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file voice-engine-0.1.3.tar.gz
.
File metadata
- Download URL: voice-engine-0.1.3.tar.gz
- Upload date:
- Size: 443.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 01cf84e21457452b7ab9df40b377333cc74d144d5069165cbfed2c77f75eab3f |
|
MD5 | d6998f3649842fc4a5d776602d755ed2 |
|
BLAKE2b-256 | b52b2299457faefa003f8fe19f5e9339f47f92453cd2f5811893d6bd40e13557 |
File details
Details for the file voice_engine-0.1.3-py2.py3-none-any.whl
.
File metadata
- Download URL: voice_engine-0.1.3-py2.py3-none-any.whl
- Upload date:
- Size: 18.7 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | aed60895b793aa898114f58168bcae7055ceaad8eae44ae00e085488fef75ab7 |
|
MD5 | e78e6a53824cd10656b79e80004177a1 |
|
BLAKE2b-256 | 0a204588f8cc40000a2bab3c51edf5efe705c766a008617304b1a87dbd8c165d |