Vocal Fry and other Metrics

These details have not been verified by PyPI

Project links

Project description

Resemble Speech Tools

This package contains collated tools for quantifying issues in speech clips.

The repo currently has support for detecting vocal fry (H2H1 metric proposed by Kane-Drugman):

Usage:

from speechtools.features import get_creak_features

get_creak_features(waveform, fs, f0_min=20, f0_max=500, window=1600, hop=160, use_fixed_windows=False, return_frames=False

Special Parameters

use_fixed_frames: This needs to be True if user wants to override internally used window and hop sizes. If False, the default window and hop sizes specified by the KaneDrugman papers are used for analysis.
return_frames: When False the resulting metric is interpolated to be of waveform length.

Miscellaneous

There are also a few private functions bundled in for signal processing under speechtools.audio_utils:

Splitting up the signal into chunks of win_length strided by hop_size
Obtaining frame level LPC residuals

Papers:

Kane, J., Drugman, T., Gobl, C., (2013) "Improved automatic detection of creak", 27(4), pp. 1028-1047, Computer Speech and Language.
Drugman, T., Kane, J., Gobl, C. (2012) "Resonator-based creaky voice detection", Proceedings of Interspeech.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.2

Aug 5, 2021

0.1.2

Aug 5, 2021

0.1

Aug 5, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

speech-analysis-tools-0.2.tar.gz (5.4 kB view details)

Uploaded Aug 5, 2021 Source

File details

Details for the file speech-analysis-tools-0.2.tar.gz.

File metadata

Download URL: speech-analysis-tools-0.2.tar.gz
Upload date: Aug 5, 2021
Size: 5.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.2 importlib_metadata/4.6.3 pkginfo/1.7.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.7.3

File hashes

Hashes for speech-analysis-tools-0.2.tar.gz
Algorithm	Hash digest
SHA256	`b9c326fd17678d32fcef4e3df852f9db44297db0f0f0cd3de793feba868e833f`
MD5	`292610211ba54bb22c4d4707665c46a6`
BLAKE2b-256	`0ffe7447830335682361f4eef7fefe86e343ccc0938b4d0dece6ba03a562d2fa`

See more details on using hashes here.

speech-analysis-tools 0.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Project description

Resemble Speech Tools

Usage:

Special Parameters

Miscellaneous

Papers:

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes