ovos-core listener daemon client

These details have not been verified by PyPI

Project links

Homepage

Project description

OpenVoiceOS Dinkum Listener

Documentation can be found in the technical manual

Install

pip install ovos-dinkum-listener[extras] to install this package and the default plugins. Note that by default, either tensorflow or tflite_runtime will need to be installed separately for wakeword detection.

If unable to install tflite_runtime in your platform, you can find wheels here https://whl.smartgic.io/. eg, for pyhon 3.11 in x86 pip install https://whl.smartgic.io/tflite_runtime-2.13.0-cp311-cp311-linux_x86_64.whl

Without extras, wakeword and STT audio upload will be disabled unless you install ovos-backend-client separately. You will also need to manually install, and possibly configure STT, WW, and VAD modules as described below.

Using ovos-vad-plugin-silero is strongly recommended

Configuration

you can set the Wakeword, VAD, STT and Microphone plugins

eg, to run under MacOS you should use https://github.com/OpenVoiceOS/ovos-microphone-plugin-sounddevice

non exhaustive list of config options

{
  "stt": {
    "module": "ovos-stt-plugin-server",
    "fallback_module": "",
    "ovos-stt-plugin-server": {"url": "https://stt.openvoiceos.com/stt"}
  },
  "listener": {
    // NOTE, multiple hotwords are supported, these fields define the main wake_word,
    // this is equivalent to setting "active": true in the "hotwords" section
    // see "hotwords" section at https://github.com/OpenVoiceOS/ovos-config/blob/dev/ovos_config/mycroft.conf
    "wake_word": "hey_mycroft",
    "stand_up_word": "wake_up",
    "microphone": {
      "module": "ovos-microphone-plugin-alsa"
    },
    VAD": {
     // recommended plugin: "ovos-vad-plugin-silero"
     "module": "ovos-vad-plugin-silero",
     "ovos-vad-plugin-silero": {"threshold": 0.2},
     "ovos-vad-plugin-webrtcvad": {"vad_mode": 3}
    },
    // Seconds of speech before voice command has begun
    "speech_begin": 0.1,
    // Seconds of silence before a voice command has finished
    "silence_end": 0.5,
    // Settings used by microphone to set recording timeout with and without speech detected
    "recording_timeout": 10.0,
    // Settings used by microphone to set recording timeout without speech detected.
    "recording_timeout_with_silence": 3.0,
    // max time allowed without user speaking before exiting RECORDING mode
    "recording_mode_max_silence_seconds": 30.0,
    // Setting to remove all silence/noise from start and end of recorded speech (only non-streaming)
    "remove_silence": true,
    // continuous listen is an experimental setting, it removes the need for
    // wake words and uses VAD only, a streaming STT is strongly recommended
    // NOTE: depending on hardware this may cause mycroft to hear its own TTS responses as questions
    "continuous_listen": false,

    // hybrid listen is an experimental setting,
    // it will not require a wake word for X seconds after a user interaction
    // this means you dont need to say "hey mycroft" for follow up questions
    "hybrid_listen": false,
    // number of seconds to wait for an interaction before requiring wake word again
    "listen_timeout": 45
  }
}

Tips and tricks

Saving Transcriptions

You can enable saving of recordings to file, this should be your first step to diagnose problems, is the audio inteligible? is it being cropped? too noisy? low volume?

set "save_utterances": true in your listener config, recordings will be saved to ~/.local/share/mycroft/listener/utterances

If the recorded audio looks good to you, maybe you need to use a different STT plugin, maybe the one you are using does not like your microphone, or just isn't very good for your language

Wrong Transcriptions

If you consistently get specific words or utterances transcribed wrong, you can remedy around this to some extent by using the ovos-utterance-corrections-plugin

You can define replacements at word level ~/.local/share/mycroft/word_corrections.json

for example whisper STT often gets artist names wrong, this allows you to correct them

{
    "Jimmy Hendricks": "Jimi Hendrix",
    "Eric Klapptern": "Eric Clapton",
    "Eric Klappton": "Eric Clapton"
}

Silence Removal

By default OVOS applies VAD (Voice Activity Detection) to crop silence from the audio sent to STT, this helps in performance and in accuracy (reduces hallucinations in plugins like FasterWhisper)

Depending on your microphone/VAD plugin, this might be removing too much audio

set "remove_silence": false in your listener config, this will send the full audio recording to STT

Listen Sound

does your listen sound contain speech? some users replace the "ding" sound with words such as "yes?"

In this case the listen sound will be sent to STT and might negatively affect the transcription

set "instant_listen": false in your listener config, this will drop the listen sound audio from the STT audio buffer. You will need to wait for the listen sound to finish before speaking your command in this case

Credits

Voice Loop state machine implementation by @Synesthesiam for mycroft-dinkum

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.3.4

Nov 20, 2024

This version

0.3.4a1 pre-release

Nov 20, 2024

0.3.3

Nov 19, 2024

0.3.3a1 pre-release

Nov 19, 2024

0.3.2

Nov 5, 2024

0.3.2a1 pre-release

Nov 5, 2024

0.3.1

Oct 24, 2024

0.3.1a1 pre-release

Oct 24, 2024

0.3.0

Oct 23, 2024

0.3.0a1 pre-release

Oct 23, 2024

0.2.4

Oct 21, 2024

0.2.4a1 pre-release

Oct 21, 2024

0.2.3

Oct 19, 2024

0.2.3a1 pre-release

Oct 19, 2024

0.2.2

Oct 19, 2024

0.2.1

Sep 15, 2024

0.2.1a1 pre-release

Sep 15, 2024

0.2.0a1 pre-release

Sep 11, 2024

0.1.3

Sep 11, 2024

0.1.3a1 pre-release

Sep 11, 2024

0.1.2

Sep 11, 2024

0.1.2a1 pre-release

Sep 11, 2024

0.1.1

Sep 10, 2024

0.1.0

Sep 10, 2024

0.1.0a2 pre-release

Sep 10, 2024

0.1.0a1 pre-release

Sep 10, 2024

0.0.3a47 pre-release

Jul 5, 2024

0.0.3a46 pre-release

Jun 20, 2024

0.0.3a45 pre-release

Jun 20, 2024

0.0.3a44 pre-release

Jun 11, 2024

0.0.3a43 pre-release

Jun 9, 2024

0.0.3a42 pre-release

Jun 6, 2024

0.0.3a41 pre-release

Jun 6, 2024

0.0.3a40 pre-release

Jun 3, 2024

0.0.3a39 pre-release

May 30, 2024

0.0.3a38 pre-release

Apr 16, 2024

0.0.3a37 pre-release

Apr 15, 2024

0.0.3a36 pre-release

Apr 13, 2024

0.0.3a35 pre-release

Apr 10, 2024

0.0.3a34 pre-release

Mar 28, 2024

0.0.3a33 pre-release

Mar 21, 2024

0.0.3a32 pre-release

Mar 16, 2024

0.0.3a31 pre-release

Mar 16, 2024

0.0.3a30 pre-release

Mar 11, 2024

0.0.3a29 pre-release

Feb 25, 2024

0.0.3a28 pre-release

Feb 24, 2024

0.0.3a27 pre-release

Jan 13, 2024

0.0.3a26 pre-release

Jan 8, 2024

0.0.3a25 pre-release

Dec 29, 2023

0.0.3a24 pre-release

Dec 29, 2023

0.0.3a23 pre-release

Dec 29, 2023

0.0.3a22 pre-release

Dec 7, 2023

0.0.3a21 pre-release

Dec 6, 2023

0.0.3a20 pre-release

Dec 1, 2023

0.0.3a19 pre-release

Nov 24, 2023

0.0.3a18 pre-release

Nov 24, 2023

0.0.3a17 pre-release

Nov 23, 2023

0.0.3a16 pre-release

Oct 26, 2023

0.0.3a15 pre-release

Oct 25, 2023

0.0.3a14 pre-release

Oct 23, 2023

0.0.3a13 pre-release

Oct 13, 2023

0.0.3a12 pre-release

Oct 12, 2023

0.0.3a11 pre-release

Sep 18, 2023

0.0.3a9 pre-release

Sep 16, 2023

0.0.3a8 pre-release

Sep 14, 2023

0.0.3a7 pre-release

Sep 8, 2023

0.0.3a6 pre-release

Sep 4, 2023

0.0.3a5 pre-release

Aug 18, 2023

0.0.3a4 pre-release

Aug 17, 2023

0.0.3a3 pre-release

Jul 29, 2023

0.0.3a2 pre-release

Jul 27, 2023

0.0.2

Jun 5, 2023

0.0.2a35 pre-release

Jun 3, 2023

0.0.2a34 pre-release

Jun 2, 2023

0.0.2a33 pre-release

Jun 1, 2023

0.0.2a32 pre-release

May 31, 2023

0.0.2a31 pre-release

May 31, 2023

0.0.2a30 pre-release

May 31, 2023

0.0.2a29 pre-release

May 31, 2023

0.0.2a28 pre-release

May 30, 2023

0.0.2a27 pre-release

May 30, 2023

0.0.2a26 pre-release

May 30, 2023

0.0.2a25 pre-release

May 30, 2023

0.0.2a24 pre-release

May 26, 2023

0.0.2a23 pre-release

May 26, 2023

0.0.2a22 pre-release

May 25, 2023

0.0.2a21 pre-release

May 24, 2023

0.0.2a20 pre-release

May 24, 2023

0.0.2a18 pre-release

May 22, 2023

0.0.2a17 pre-release

May 7, 2023

0.0.2a16 pre-release

May 7, 2023

0.0.2a15 pre-release

May 6, 2023

0.0.2a14 pre-release

May 6, 2023

0.0.2a13 pre-release

May 6, 2023

0.0.2a12 pre-release

May 5, 2023

0.0.2a11 pre-release

May 4, 2023

0.0.2a10 pre-release

May 4, 2023

0.0.2a9 pre-release

May 4, 2023

0.0.2a8 pre-release

Apr 26, 2023

0.0.2a7 pre-release

Apr 23, 2023

0.0.2a6 pre-release

Apr 23, 2023

0.0.2a5 pre-release

Apr 22, 2023

0.0.2a4 pre-release

Apr 22, 2023

0.0.2a3 pre-release

Apr 20, 2023

0.0.2a2 pre-release

Apr 10, 2023

0.0.2a1 pre-release

Apr 10, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ovos-dinkum-listener-0.3.4a1.tar.gz (101.7 kB view details)

Uploaded Nov 20, 2024 Source

Built Distribution

ovos_dinkum_listener-0.3.4a1-py3-none-any.whl (110.1 kB view details)

Uploaded Nov 20, 2024 Python 3

File details

Details for the file ovos-dinkum-listener-0.3.4a1.tar.gz.

File metadata

Download URL: ovos-dinkum-listener-0.3.4a1.tar.gz
Upload date: Nov 20, 2024
Size: 101.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for ovos-dinkum-listener-0.3.4a1.tar.gz
Algorithm	Hash digest
SHA256	`8652da82091ab3134b6608737081b1d61721bf76a0bcd396ece6b0ffcf31d29c`
MD5	`4591fc228369bd493f8b7f3c9ba52b73`
BLAKE2b-256	`8407063acc49c14cfb3e43a626ecf9099400ee04aac0f8cbdc4e0ad3eec8e758`

See more details on using hashes here.

File details

Details for the file ovos_dinkum_listener-0.3.4a1-py3-none-any.whl.

File metadata

Download URL: ovos_dinkum_listener-0.3.4a1-py3-none-any.whl
Upload date: Nov 20, 2024
Size: 110.1 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.9.20

File hashes

Hashes for ovos_dinkum_listener-0.3.4a1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3485b35983639fa3bb58c00565bd2abbc9fcac033cc0789f3e00ce8c7ce7e374`
MD5	`112dd83df3a4d48fe7e9f19bcab39f66`
BLAKE2b-256	`d72c201a0f7d5fdeeb60fa135d2ea256c478214c5517baf3afc3fe3c1c71b00d`