Rhino Speech-to-Intent engine.

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Multimedia :: Sound/Audio :: Speech

Project description

Rhino Speech-to-Intent Engine

Made in Vancouver, Canada by Picovoice

Rhino is Picovoice's Speech-to-Intent engine. It directly infers intent from spoken commands within a given context of interest, in real-time. For example, given a spoken command "Can I have a small double-shot espresso with a lot of sugar and some milk", Rhino infers that the user wants to order a drink with these specifications:

{
  "type": "espresso",
  "size": "small",
  "numberOfShots": "2",
  "sugar": "a lot",
  "milk": "some"
}

Rhino is:

using deep neural networks trained in real-world environments.
compact and computationally-efficient, making it perfect for IoT.
self-service. Developers and designers can train custom models using Picovoice Console.

Compatibility

Python 3
Runs on Linux (x86_64), Mac (x86_64), Windows (x86_64), Raspberry Pi (all variants), and BeagleBone.

Installation

pip3 install pvrhino

Usage

Create an instance of the engine:

import pvrhino

handle = pvrhino.create(context_path='/absolute/path/to/context')

Where context_path is the absolute path to Speech-to-Intent context created either using Picovoice Console or one of the default contexts available on Rhino's GitHub repository.

The sensitivity of the engine can be tuned using the sensitivity parameter. It is a floating point number within [0, 1]. A higher sensitivity value results in fewer misses at the cost of (potentially) increasing the erroneous inference rate.

import pvrhino

handle = pvrhino.create(context_path='/absolute/path/to/context', sensitivity=0.25)

When initialized, the valid sample rate is given by handle.sample_rate. Expected frame length (number of audio samples in an input array) is handle.frame_length. The engine accepts 16-bit linearly-encoded PCM and operates on single-channel audio.

def get_next_audio_frame():
    pass

while True:
    is_finalized = rhino.process(get_next_audio_frame())

    if is_finalized:
        inference = rhino.get_inference()
        if not inference.is_understood:
            # add code to handle unsupported commands
            pass
        else:
            intent = inference.intent
            slots = inference.slots
            # add code to take action based on inferred intent and slot values

When done resources have to be released explicitly:

handle.delete()

Demos

pvrhinodemo provides command-line utilities for processing real-time audio (i.e. microphone) and files using Rhino.

Project details

These details have not been verified by PyPI

Project links

Homepage

GitHub Statistics

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

Development Status
- 5 - Production/Stable
Intended Audience
- Developers
License
- OSI Approved :: Apache Software License
Operating System
- OS Independent
Programming Language
- Python :: 3
Topic
- Multimedia :: Sound/Audio :: Speech

Release history Release notifications | RSS feed

3.0.2

Jan 31, 2024

3.0.1

Nov 16, 2023

3.0.0

Oct 25, 2023

2.2.1

Jun 7, 2023

2.2.0

Apr 12, 2023

2.1.7

Jun 29, 2022

2.1.6

Jun 10, 2022

2.1.5

Jun 10, 2022

2.1.4

May 31, 2022

2.1.3

May 12, 2022

2.1.2

Mar 11, 2022

2.1.1

Feb 4, 2022

2.1.0

Jan 19, 2022

2.0.2

Dec 21, 2021

2.0.1

Nov 30, 2021

2.0.0

Nov 23, 2021

1.6.3

Apr 14, 2021

This version

1.6.2

Mar 5, 2021

1.6.1

Feb 26, 2021

1.6.0

Dec 2, 2020

1.5.0

Oct 2, 2020

1.3.6

Oct 2, 2020

1.3.5

Oct 2, 2020

1.3.1

Sep 25, 2020

1.3.0

Sep 24, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pvrhino-1.6.2.tar.gz (2.5 MB view hashes)

Uploaded Mar 5, 2021 Source

Built Distribution

pvrhino-1.6.2-py3-none-any.whl (2.5 MB view hashes)

Uploaded Mar 5, 2021 Python 3

Hashes for pvrhino-1.6.2.tar.gz

Hashes for pvrhino-1.6.2.tar.gz
Algorithm	Hash digest
SHA256	`c0fd14c17fb8069bca80709ac6ab3ab59dfec03558f485c10f864e8998ce9046`
MD5	`c82d21369109a9ade9244a71c776210c`
BLAKE2b-256	`2fd7ad058074d4c0c9bd2958142cb48201f2379fc9151aeb8b87faa45b9a0a1f`

Hashes for pvrhino-1.6.2-py3-none-any.whl

Hashes for pvrhino-1.6.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c9f523aae992588165bd2758cd8ca2c869e2958f5b0ab4dcb96641cb08518b82`
MD5	`66538e94fc07624b161823dd56349775`
BLAKE2b-256	`9a86d8b1806589e520eb93b3528c7926d45e4884099b4afa07c8a1d02dfaee79`