Vocal Fry and other Metrics
Project description
Resemble Speech Tools
This package contains collated tools for quantifying issues in speech clips.
The repo currently has support for detecting vocal fry (H2H1 metric proposed by Kane-Drugman):
Usage:
from speechtools.features import get_creak_features
get_creak_features(waveform, fs, f0_min=20, f0_max=500, window=1600, hop=160, use_fixed_windows=False, return_frames=False
Special Parameters
use_fixed_frames: This needs to beTrueif user wants to override internally usedwindowandhopsizes. IfFalse, the default window and hop sizes specified by the KaneDrugman papers are used for analysis.return_frames: WhenFalsethe resulting metric is interpolated to be ofwaveformlength.
Miscellaneous
There are also a few private functions bundled in for signal processing under speechtools.audio_utils:
- Splitting up the signal into chunks of
win_lengthstrided byhop_size - Obtaining frame level LPC residuals
Papers:
- Kane, J., Drugman, T., Gobl, C., (2013) "Improved automatic detection of creak", 27(4), pp. 1028-1047, Computer Speech and Language.
- Drugman, T., Kane, J., Gobl, C. (2012) "Resonator-based creaky voice detection", Proceedings of Interspeech.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file speech-analysis-tools-0.2.tar.gz.
File metadata
- Download URL: speech-analysis-tools-0.2.tar.gz
- Upload date:
- Size: 5.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.6.3 pkginfo/1.7.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.7.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b9c326fd17678d32fcef4e3df852f9db44297db0f0f0cd3de793feba868e833f
|
|
| MD5 |
292610211ba54bb22c4d4707665c46a6
|
|
| BLAKE2b-256 |
0ffe7447830335682361f4eef7fefe86e343ccc0938b4d0dece6ba03a562d2fa
|