Open-source customizable voice assistant

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Home Automation

Project description

Voice Assistant

Open-source customizable voice assistant

Features

Trigger word can be said anywhere in a sentence, for example, "play music, Jarvis". Few last seconds of audio input are recoded into memory, so when trigger word is said, pre-recorded audio data is also processed
Speech is processed continuously while user is speaking which, for certain requests, results in a very fast action/response
Integration with Home Assistant REST API allows to define custom skills and link HASS entities via a simle config
Text-to-Speech relies on Amazon Polly and allows to pick a custom voice from available Polly voices
Voice Assistant REST API enables remote control. For example, it can be used to make Voice Assistant say something when Home Assistant automation is triggered

Running in virtual environment (Python 3.6+)

Install dependencies:

sudo apt-get update
sudo apt-get install -y libasound-dev portaudio19-dev libportaudio2 libportaudiocpp0\
python3-pyaudio python3-venv mpg123

Create virtual environment and install with pip:

python3 -m venv vass-venv
source vass-venv/bin/activate
pip install voiceassistant

Start with command: vass

Running on Docker

Docker Installation guide. Raspbian only supports installation with convenience script.

Download and build Voice Assistant on Docker

git clone https://github.com/vadimtitov/voice-assistant
cd voice-assistant
docker build -t voice-assistant .

Create a folder to store configuration files e.g.
```
mkdir -p ~/.config/voice-assistant
```
Run with Docker Compose:

Create docker-compose.yaml:

version: '1'
services:
 voice-assistant:
   container_name: voice-assistant
   image: voice-assistant
   volumes:
     # volume to share config and credentials from host to container
     - <YOUR/CONFIGURATION/FOLDER>:/usr/src/config
   devices:
     # sounds devices need to be exposed to container
     - /dev/snd:/dev/snd
   restart: unless-stopped
   network_mode: host

Start with: docker-compose up

First-time configuration

On the first start configuration.yaml and google_credentials.json files will be auto-generated inside voice assistant configuration folder. If running on Docker configuration folder path is chosen manually as per instructions above. If running in virtual environment, folder path is automatically chosen as /home/<username>/.config/voiceassistant/

As Voice Assistant relies on external engines for Speech-To-Text and Text-to-Speech the following needs to be set up:

Google Cloud Speech-To-Text

Create Google Cloud Platform project, download and put your credentials as google_credentials.json inside configuration folder
Activate Cloud Speech-To-Text API for your project

AWS for Text-to-Speech

Create AWS account and take a note of Access Key ID and Secret Access Key

Insert the above into configuration.yaml:

tts:
 aws:
   access_key_id: "YOUR_ACCES_KEY_ID"
   secret_access_key: "YOUR_SECRET_ACCESS_KEY"
   ...

Customization

Currently the following customization options (YAML-paths) are available to specify in configuration.yaml:

triggerword.picovoice.word - trigger word to use, the following words are available in Picovoice porcupine: alexa, americano, blueberry, bumblebee, computer, grapefruit, grasshopper, hey google, hey siri, jarvis, ok google, pico clock, picovoice, porcupine, smart mirror, snowboy, terminator, view glass
triggerword.picovoice.sensitivity - trigger word sensitivity, a number within [0, 1]
stt.google_cloud.language_code - one of language codes for STT
tts.aws.region_name - one of AWS region names, set closest to your location
tts.aws.voice_id - one of Polly Voice Samples

Integrating with Home Assistant

If haven't already, enable Home Assistant API and generate access token: in HASS Web UI click on your profile, scroll down to "Long-Lived Access Tokens" and click "create token".

Add record to configuration.yaml:

hass:
  url: http://<your.hass.address>:8123
  token: <your_access_token>

There are 2 types of integration with HASS:

Exposing HASS entities, for example:
```
  hass:
    ...
    entities:
      - ids:
        - switch.bedroom_light_1
        - switch.bedroom_light_2
        names:
        - lights
      - ids:
        - cover.curtain_bedroom
        names:
        - curtain
        - blinds
```
The above sample configuration will allow user to control lights and blinds with the following phrases:
- turn on lights, lights on, enable lights, start lights, lights off, disable lights, etc
- open blinds, blinds up, open curtains, close curtains, etc
Note that this only supports "binary" types of entities e.g. the ones that have turn_on-turn_off or open_cover-close_cover services.

Creating custom skills

Custom skills can be defined by providing 3 things:
1. Arbitrary skill name
2. List of speech expressions that trigger the skill (can be regex)
3. List of actions to execute when skill is triggered
Currently available actions:

Action Arguments

say_from_template template

call_service service, entity_id, any additional data

fire_event event_type, event_data

set_state entity_id, state

run_script script_id

Consider an example:
```
  hass:
    ...
    skills:
      - name: good_morning
        expressions:
          - good morning
          - (wake&&time)
        actions:
        - action: run_script
          script_id: good_morning
        - action: say_from_template
          template: Good Morning. The temperature outside is {{ state_attr('weather.my_home','temperature') }} degrees.
```
When user says "good morning" or "time to wake up" Voice Assistant will execute two actions:
- run_script - execute HASS script with id good_morning
- say_from_template - say a phrase rendered by HASS Jinja2 template

Action	Arguments
say_from_template	template
call_service	service, entity_id, any additional data
fire_event	event_type, event_data
set_state	entity_id, state
run_script	script_id

Project details

These details have not been verified by PyPI

Project links

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 3 - Alpha
License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Home Automation

Release history Release notifications | RSS feed

0.11.0

Oct 30, 2022

0.10.1

Oct 9, 2022

0.10.0

Sep 11, 2022

0.9.0

Aug 29, 2022

0.8.1

Aug 25, 2022

0.8.0

Jun 21, 2022

0.7.1

May 7, 2022

0.7.0

May 3, 2022

0.6.4

May 1, 2022

0.6.3

Mar 26, 2022

0.6.2

Feb 6, 2022

0.6.1

Jan 30, 2022

0.6.0

Dec 6, 2021

0.5.0

Oct 10, 2021

0.4.2

Sep 21, 2021

0.4.0

Sep 21, 2021

0.3.2

Aug 20, 2021

0.3.1

Aug 15, 2021

0.3.0

Aug 15, 2021

This version

0.2.2

Jun 1, 2021

0.2.1

May 27, 2021

0.2.0

May 23, 2021

0.1.1

May 21, 2021

0.1.0

May 17, 2021

0.0.1

May 15, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

voiceassistant-0.2.2.tar.gz (27.4 kB view hashes)

Uploaded Jun 1, 2021 Source

Built Distribution

voiceassistant-0.2.2-py3-none-any.whl (32.6 kB view hashes)

Uploaded Jun 1, 2021 Python 3

Hashes for voiceassistant-0.2.2.tar.gz

Hashes for voiceassistant-0.2.2.tar.gz
Algorithm	Hash digest
SHA256	`41d42ef483722ea14854961dff55931ca7bc254b35ad40a484fb18372a622766`
MD5	`ce9de4a17c26089f1ea977be089775b1`
BLAKE2b-256	`fdaa21b675dcc5cf6b3b461a96067ceea76a472e205327aeaca72ae86cde5688`

Hashes for voiceassistant-0.2.2-py3-none-any.whl

Hashes for voiceassistant-0.2.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`155607a4159fa41213a293a7400f738ba48683d1c72efa158597913c0cdc0824`
MD5	`bff1c93b6817d327709574db64aef207`
BLAKE2b-256	`2621e5a9c6a54b1b61761b551525df71804d0d8dfd5127c2cf563fe9f9604d2d`