Provides Phoneme Error Rate & Visualisation Assessment
Project description
Author: Jonathan Lim Wei Siang Email: Jonathanlimws@gmail.com
# Main Function This Python package allows you to assess the phonetic error rate and visualise them.
# Workflow of package 1. TIMIT file is loaded 2. ASR Model converts audio.wav file into phoneme string 3. ASR Phoneme string is standardized to TIMIT standard to allow for comparison 4. Phonetic Error Rate is generate
## Additional Noise can be added to audio.wav fiel to test the ASR model and step 2-4 is repeated
# Types of Plots 1. Boxplot of accuracy rate for each phoneme across selected TIMIT files 2. Stacked boxplot of accuracy rate across varying added noise 3. Time/frequency Plot any given TIMIT audio showing the timing/phoneme which was incorrected predicted (substitution and deletion only) ![alt text](/Users/jonathanlim/Desktop/Code_2022/ASR_Assessment/Screenshot 2022-08-12 at 16.57.12.png?raw=True)
# Requirements TIMIT file: TIMIT is a corpus of phonemically and lexically transcribed speech of American English speakers of different sexes and dialects link: https://en.wikipedia.org/wiki/TIMIT
Change Log
0.0.1 (12/08/2022)
First Release
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file asr_phoneme_assessment-0.0.1.tar.gz
.
File metadata
- Download URL: asr_phoneme_assessment-0.0.1.tar.gz
- Upload date:
- Size: 13.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2ab7804b441d3fd8929170cd3ace1fd5e2b0a577e31be2c1f82b2c140851187f |
|
MD5 | 6015be77177b24f5a4e3fbc147038b94 |
|
BLAKE2b-256 | 4fe6bc31dbb2454169945f03dac53ebf34c370eed1da27856e7ee43bfe319ff0 |