Skip to main content

Eagle Speaker Recognition Engine demos

Project description

Eagle Speaker Recognition Demos

Made in Vancouver, Canada by Picovoice

Eagle

Eagle is an on-device speaker recognition engine. Eagle is:

  • Private; All voice processing runs locally.
  • Cross-Platform:
    • Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64, arm64)
    • Android and iOS
    • Chrome, Safari, Firefox, and Edge
    • Raspberry Pi (3, 4, 5)

Compatibility

  • Python 3.9+
  • Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64, arm64), and Raspberry Pi (3, 4, 5).

Installation

pip3 install pveagledemo

AccessKey

Eagle requires a valid Picovoice AccessKey at initialization. AccessKey acts as your credentials when using Eagle SDKs. You can get your AccessKey for free. Make sure to keep your AccessKey secret. Signup or Login to Picovoice Console to get your AccessKey.

Overview

Eagle consists of two distinct steps: Enrollment and Recognition. In the enrollment step, Eagle analyzes a series of utterances from a particular speaker to learn their unique voiceprint. This step results in an EagleProfile object, which can be stored and utilized during inference. During the Recognition step, Eagle compares the incoming frames of audio to the voiceprints of all enrolled speakers in real-time to determine the similarity between them.

Microphone Demo

The microphone demo captures audio input from a microphone that is connected. To run the demo, use the following command in the terminal:

eagle_demo_mic {enroll, test} --access_key ${ACCESS_KEY} ...

Replace ${ACCESS_KEY} with yours obtained from Picovoice Console.

The commands enroll and test are used to create a speaker profile and perform speaker recognition, respectively. Detailed explanations of these commands will be provided in their respective sections.

Furthermore, the demo offers optional arguments, which can be accessed by utilizing the --help argument. By doing so, you will receive a comprehensive listing of the available arguments along with their corresponding descriptions.

eagle_demo_mic --help

Speaker Enrollment

If the demo is executed in the enrollment mode by using the enroll command, it will initiate the enrollment process using the audio captured from the microphone. It will display the progress percentage in the terminal until it reaches 100%. Once completed, it will save the profile of the enrolled speaker to the disk.

eagle_demo_mic enroll --access_key ${ACCESS_KEY} --output_profile_path ${OUTPUT_PROFILE_PATH}

Replace ${OUTPUT_PROFILE_PATH} with the absolute path where the generated profile should be written.

Speaker Recognition

Once the speaker profile for all speakers are created, the demo can be run in the test mode by running the following command:

eagle_demo_mic test --access_key ${ACCESS_KEY} --input_profile_paths ${INPUT_PROFILE_PATH_1 ...}

In this mode, you can include multiple speaker profiles by specifying them with the --input_profile_paths option. Eagle will assess and provide a distinct score for each profile, which will be displayed in the terminal.

Note: the demo uses the default input audio device on your machine. To choose a different device, use the --show_audio_devices option to see a list of available devices and their indexes, then run the above commands with the --audio_device_index option:

eagle_demo_mic enroll --access_key ${ACCESS_KEY} --output_profile_path ${OUTPUT_PROFILE_PATH} --audio_device_index 2

File Demo

Similar to the mic demo, the file demo can be run in two modes: enroll and test

eagle_demo_file {enroll,test} --access_key ${ACCESS_KEY} ...

Replace ${ACCESS_KEY} with yours obtained from Picovoice Console.

The commands enroll and test are used to create a speaker profile and perform speaker recognition, respectively, and will be discussed in detail in their respective sections.

To view the optional arguments for the demo, use the --help argument. This will display a list of available arguments and their descriptions.

eagle_demo_file --help

Speaker Enrollment

To run the demo in enroll mode, you need two additional input arguments along with the AccessKey.

eagle_demo_file enroll --access_key ${ACCESS_KEY} \
  --output_profile_path ${OUTPUT_PROFILE_PATH} --enroll_audio_paths ${ENROLL_AUDIO_PATH_1 ...} 

In this command, {ENROLL_AUDIO_PATH_1 ...} represents the absolute paths to the enroll audio files. If multiple files are provided, Eagle will process all of them. Once the specified files are processed, the demo will generate a profile at ${OUTPUT_PROFILE_PATH}.

Speaker Recognition

The file demo requires a test audio and one or more speaker profiles that were created during the enrollment step.

To run the demo, use the following command in the console:

eagle_demo_file test --access_key ${ACCESS_KEY} \
  --input_profile_paths ${INPUT_PROFILE_PATH_1 ...} --test_audio_path ${TEST_AUDIO_PATH}

The demo will display the result for each enrolled speaker in the terminal.

Optionally, you can also generate a .csv file for further analysis by including the --csv_output_path parameter:

eagle_demo_file test --access_key ${ACCESS_KEY} \
  --input_profile_paths ${INPUT_PROFILE_PATH_1 ...} --test_audio_path ${TEST_AUDIO_PATH} \
  --csv_output_path ${CSV_OUTPUT_PATH}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pveagledemo-2.0.0.tar.gz (13.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pveagledemo-2.0.0-py3-none-any.whl (13.6 kB view details)

Uploaded Python 3

File details

Details for the file pveagledemo-2.0.0.tar.gz.

File metadata

  • Download URL: pveagledemo-2.0.0.tar.gz
  • Upload date:
  • Size: 13.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for pveagledemo-2.0.0.tar.gz
Algorithm Hash digest
SHA256 de640a8a3e1b14883b299b7bb4a6e8ea61cdffc3f870f960b636f5fe0a5defec
MD5 4aa5cd4a6a03d115205c0629d199f921
BLAKE2b-256 82b58636ae53461d65588f8f09daabb4ee6dc7521b42ec44533b81c0ccd86acd

See more details on using hashes here.

File details

Details for the file pveagledemo-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: pveagledemo-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 13.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.3

File hashes

Hashes for pveagledemo-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a446493d027522987326c952d334e23793a7d31a31a6ff91c7bd92e089426ee2
MD5 f8378e3643e62b1f0d8fd62d78f9ac1d
BLAKE2b-256 95e77802e4380f2453e085bf8b2eb3be4cdc24c318ede796723480fae90c699f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page