Skip to main content

Picovoice is an end-to-end platform for building voice products on your terms.

This project has been archived.

The maintainers of this project have marked this project as archived. No new releases are expected.

Project description

Picovoice

Made in Vancouver, Canada by Picovoice

Picovoice is an end-to-end platform for building voice products on your terms. It enables creating voice experiences similar to Alexa and Google. But it entirely runs 100% on-device. Picovoice is

  • Private: Everything is processed offline. Intrinsically HIPAA and GDPR-compliant.
  • Reliable: Runs without needing constant connectivity.
  • Zero Latency: Edge-first architecture eliminates unpredictable network delay.
  • Accurate: Resilient to noise and reverberation. It outperforms cloud-based alternatives by wide margins *.
  • Cross-Platform: Design once, deploy anywhere. Build using familiar languages and frameworks.

Compatibility

  • Python 3.8+
  • Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), and Raspberry Pi (Zero, 3, 4, 5).

Installation

pip3 install picovoice

AccessKey

Picovoice requires a valid Picovoice AccessKey at initialization. AccessKey acts as your credentials when using Picovoice SDKs. You can get your AccessKey for free. Make sure to keep your AccessKey secret. Signup or Login to Picovoice Console to get your AccessKey.

Usage

Create a new instance of Picovoice runtime engine

from picovoice import Picovoice

access_key = "${ACCESS_KEY}" # AccessKey obtained from Picovoice Console (https://console.picovoice.ai/)

keyword_path = ...

def wake_word_callback():
    pass

context_path = ...

def inference_callback(inference):
    # `inference` exposes three immutable fields:
    # (1) `is_understood`
    # (2) `intent`
    # (3) `slots`
    pass

picovoice = Picovoice(
        access_key=access_key,
        keyword_path=keyword_path,
        wake_word_callback=wake_word_callback,
        context_path=context_path,
        inference_callback=inference_callback)

picovoice is an instance of Picovoice runtime engine that detects utterances of wake phrase defined in the file located at keyword_path. Upon detection of wake word it starts inferring user's intent from the follow-on voice command within the context defined by the file located at context_path. keyword_path is the absolute path to Porcupine wake word engine keyword file (with .ppn suffix). context_path is the absolute path to Rhino Speech-to-Intent engine context file (with .rhn suffix). wake_word_callback is invoked upon the detection of wake phrase and inference_callback is invoked upon completion of follow-on voice command inference.

When instantiated, valid sample rate can be obtained via .sample_rate. Expected number of audio samples per frame is .frame_length. The engine accepts 16-bit linearly-encoded PCM and operates on single-channel audio.

def get_next_audio_frame():
    pass

while True:
    picovoice.process(get_next_audio_frame())

When done resources have to be released explicitly

picovoice.delete()

Non-English Models

In order to detect wake words and run inference in other languages you need to use the corresponding model file. The model files for all supported languages are available here and here.

Demos

picovoicedemo provides command-line utilities for processing real-time audio (i.e. microphone) and files using Picovoice platform.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

picovoice-3.0.3.tar.gz (12.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

picovoice-3.0.3-py3-none-any.whl (11.0 kB view details)

Uploaded Python 3

File details

Details for the file picovoice-3.0.3.tar.gz.

File metadata

  • Download URL: picovoice-3.0.3.tar.gz
  • Upload date:
  • Size: 12.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.5

File hashes

Hashes for picovoice-3.0.3.tar.gz
Algorithm Hash digest
SHA256 f8958050d847e91f4e571e251b670d4f65ec768991c97a4f735be039f239e006
MD5 608b4d62239e923f2f13471365f85a5a
BLAKE2b-256 e3ef6ce30e00222e5193029eb5f183ffdf122f67cdd012b13213e8fed53ecc89

See more details on using hashes here.

File details

Details for the file picovoice-3.0.3-py3-none-any.whl.

File metadata

  • Download URL: picovoice-3.0.3-py3-none-any.whl
  • Upload date:
  • Size: 11.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.5

File hashes

Hashes for picovoice-3.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 7d9cfebe26e33dd3c851656b180224de8107de0aec9935af32eeea7e9b64260b
MD5 71c0508de1e092a1ee89b9d234b5a36c
BLAKE2b-256 22c22d8bc0b5e3cfb3c0c523e4c95d66d300a9fd2872d796a7413c58220050ff

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page