Eagle Speaker Recognition Engine demos
Project description
Eagle Speaker Recognition Demos
Made in Vancouver, Canada by Picovoice
Eagle
Eagle is an on-device speaker recognition engine. Eagle is:
- Private; All voice processing runs locally.
- Cross-Platform:
- Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64)
- Android and iOS
- Chrome, Safari, Firefox, and Edge
- Raspberry Pi (4, 3) and NVIDIA Jetson Nano
Compatibility
- Python 3.5+
- Runs on Linux (x86_64), macOS (x86_64, arm64), Windows (x86_64), Raspberry Pi (4, 3), and NVIDIA Jetson Nano.
Installation
pip3 install pveagledemo
AccessKey
Eagle requires a valid Picovoice AccessKey
at initialization. AccessKey
acts as your credentials when using Eagle
SDKs. You can get your AccessKey
for free. Make sure to keep your AccessKey
secret.
Signup or Login to Picovoice Console to get your AccessKey
.
Overview
Eagle consists of two distinct steps: Enrollment and Recognition. In the enrollment step, Eagle analyzes a series of
utterances from a particular speaker to learn their unique voiceprint. This step results in an EagleProfile
object,
which can be stored and utilized during inference. During the Recognition step, Eagle compares the incoming frames of
audio to the voiceprints of all enrolled speakers in real-time to determine the similarity between them.
Microphone Demo
The microphone demo captures audio input from a microphone that is connected. To run the demo, use the following command in the terminal:
eagle_demo_mic {enroll, test} --access_key ${ACCESS_KEY} ...
Replace ${ACCESS_KEY}
with yours obtained from Picovoice Console.
The commands enroll
and test
are used to create a speaker profile and perform speaker recognition, respectively.
Detailed explanations of these commands will be provided in their respective sections.
Furthermore, the demo offers optional arguments, which can be accessed by utilizing the --help
argument. By doing so,
you will receive a comprehensive listing of the available arguments along with their corresponding descriptions.
eagle_demo_mic --help
Speaker Enrollment
If the demo is executed in the enrollment mode by using the enroll
command, it will initiate the enrollment process
using the audio captured from the microphone. It will display the progress percentage in the terminal until it reaches
100%. Once completed, it will save the profile of the enrolled speaker to the disk.
eagle_demo_mic enroll --access_key ${ACCESS_KEY} --output_profile_path ${OUTPUT_PROFILE_PATH}
Replace ${OUTPUT_PROFILE_PATH}
with the absolute path where the generated profile should be written.
Speaker Recognition
Once the speaker profile for all speakers are created, the demo can be run in the test
mode by running the following
command:
eagle_demo_mic test --access_key ${ACCESS_KEY} --input_profile_paths ${INPUT_PROFILE_PATH_1 ...}
In this mode, you can include multiple speaker profiles by specifying them with the --input_profile_paths
option.
Eagle will assess and provide a distinct score for each profile, which will be displayed in the terminal.
File Demo
Similar to the mic demo, the file demo can be run in two modes: enroll
and test
eagle_demo_file {enroll,test} --access_key ${ACCESS_KEY} ...
Replace ${ACCESS_KEY}
with yours obtained from Picovoice Console.
The commands enroll
and test
are used to create a speaker profile and perform speaker recognition, respectively, and
will be discussed in detail in their respective sections.
To view the optional arguments for the demo, use the --help
argument. This will display a list of available arguments
and their descriptions.
eagle_demo_file --help
Speaker Enrollment
To run the demo in enroll
mode, you need two additional input arguments along with the AccessKey.
eagle_demo_file enroll --access_key ${ACCESS_KEY} \
--output_profile_path ${OUTPUT_PROFILE_PATH} --enroll_audio_paths ${ENROLL_AUDIO_PATH_1 ...}
In this command, {ENROLL_AUDIO_PATH_1 ...}
represents the absolute paths to the enroll audio files. If multiple files
are provided, Eagle will process all of them. Once the specified files are processed, the demo will generate a profile at
${OUTPUT_PROFILE_PATH}
.
Speaker Recognition
The file demo requires a test audio and one or more speaker profiles that were created during the enrollment step.
To run the demo, use the following command in the console:
eagle_demo_file test --access_key ${ACCESS_KEY} \
--input_profile_paths ${INPUT_PROFILE_PATH_1 ...} --test_audio_path ${TEST_AUDIO_PATH}
The demo will display the result for each enrolled speaker in the terminal.
Optionally, you can also generate a .csv
file for further analysis by including the --csv_output_path
parameter:
eagle_demo_file test --access_key ${ACCESS_KEY} \
--input_profile_paths ${INPUT_PROFILE_PATH_1 ...} --test_audio_path ${TEST_AUDIO_PATH} \
--csv_output_path ${CSV_OUTPUT_PATH}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file pveagledemo-0.1.2.tar.gz
.
File metadata
- Download URL: pveagledemo-0.1.2.tar.gz
- Upload date:
- Size: 13.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bac1d8777e2d5c818dd2e15ddfc9c0125d516ab787a05f00347ebbd93145193c |
|
MD5 | e7009d21285d19e2dd6e09163da0a0a7 |
|
BLAKE2b-256 | 14995e0880e569a0115706a669a4485e46ff29a9e488514b76febfa2adc34ea4 |
File details
Details for the file pveagledemo-0.1.2-py3-none-any.whl
.
File metadata
- Download URL: pveagledemo-0.1.2-py3-none-any.whl
- Upload date:
- Size: 13.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5cdfd3c3769c74e744dd07ffd26599a300ad629e5b044963e1895ae324e74525 |
|
MD5 | b6580f393c134e51f6b71b43231465bd |
|
BLAKE2b-256 | 1b5d11c5f8c2c363509d98a505150e25ce15acef81dc82339d44dd2d1e8bd9d8 |