Vocal Fry and other Metrics
Project description
Resemble Speech Tools
This package contains collated tools for quantifying issues in speech clips.
The repo currently has support for detecting vocal fry (H2H1 metric proposed by Kane-Drugman):
Usage:
from speechtools.features import get_creak_features
get_creak_features(waveform, fs, f0_min=20, f0_max=500, window=1600, hop=160, use_fixed_windows=False, return_frames=False
Special Parameters
use_fixed_frames
: This needs to beTrue
if user wants to override internally usedwindow
andhop
sizes. IfFalse
, the default window and hop sizes specified by the KaneDrugman papers are used for analysis.return_frames
: WhenFalse
the resulting metric is interpolated to be ofwaveform
length.
Miscellaneous
There are also a few private functions bundled in for signal processing under speechtools.audio_utils
:
- Splitting up the signal into chunks of
win_length
strided byhop_size
- Obtaining frame level LPC residuals
Papers:
- Kane, J., Drugman, T., Gobl, C., (2013) "Improved automatic detection of creak", 27(4), pp. 1028-1047, Computer Speech and Language.
- Drugman, T., Kane, J., Gobl, C. (2012) "Resonator-based creaky voice detection", Proceedings of Interspeech.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Close
Hashes for speech-analysis-tools-0.2.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | b9c326fd17678d32fcef4e3df852f9db44297db0f0f0cd3de793feba868e833f |
|
MD5 | 292610211ba54bb22c4d4707665c46a6 |
|
BLAKE2b-256 | 0ffe7447830335682361f4eef7fefe86e343ccc0938b4d0dece6ba03a562d2fa |