Vocal Fry and other Metrics
Project description
#Resemble Speech Tools
This package contains collated tools for quantifying issues in speech clips.
The repo currently has support for detecting vocal fry (H2H1 metric proposed by Kane-Drugman):
Usage:
from speechtools.metrics import get_creak_features
get_creak_features(waveform, fs, f0_min=20, f0_max=500, window=1600, hop=160, use_fixed_windows=False, return_frames=False
Special Parameters
use_fixed_frames
: This needs to beTrue
if user wants to override internally usedwindow
andhop
sizes. IfFalse
, the default window and hop sizes specified by the KaneDrugman papers are used for analysis.return_frames
: WhenFalse
the resulting metric is interpolated to be ofwaveform
length.
Miscellaneous
There are also a few private functions bundled in for signal processing under speechtools.audio_utils
:
- Splitting up the signal into chunks of
win_length
strided byhop_size
- Obtaining frame level LPC residuals
Papers:
- Kane, J., Drugman, T., Gobl, C., (2013) "Improved automatic detection of creak", 27(4), pp. 1028-1047, Computer Speech and Language.
- Drugman, T., Kane, J., Gobl, C. (2012) "Resonator-based creaky voice detection", Proceedings of Interspeech.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Close
Hashes for speech-analysis-tools-0.1.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | e65eb2de76087b820f83b14bacf498012f0bd9ada21c3b8be5494e3b420979f3 |
|
MD5 | 655c58674e61c59bc56be1a8dd32921b |
|
BLAKE2b-256 | b622e0b774d1227dd4e6e05a8a40f07f774963c07e35eafcac0fd60d3e21ab47 |