Skip to main content

Vocal Fry and other Metrics

Project description

Resemble Speech Tools

This package contains collated tools for quantifying issues in speech clips.

The repo currently has support for detecting vocal fry (H2H1 metric proposed by Kane-Drugman):

Usage:

from speechtools.features import get_creak_features

get_creak_features(waveform, fs, f0_min=20, f0_max=500, window=1600, hop=160, use_fixed_windows=False, return_frames=False

Special Parameters
  • use_fixed_frames: This needs to be True if user wants to override internally used window and hop sizes. If False, the default window and hop sizes specified by the KaneDrugman papers are used for analysis.
  • return_frames: When False the resulting metric is interpolated to be of waveform length.

Miscellaneous

There are also a few private functions bundled in for signal processing under speechtools.audio_utils:

  • Splitting up the signal into chunks of win_length strided by hop_size
  • Obtaining frame level LPC residuals

Papers:

  1. Kane, J., Drugman, T., Gobl, C., (2013) "Improved automatic detection of creak", 27(4), pp. 1028-1047, Computer Speech and Language.
  2. Drugman, T., Kane, J., Gobl, C. (2012) "Resonator-based creaky voice detection", Proceedings of Interspeech.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

speech-analysis-tools-0.2.tar.gz (5.4 kB view details)

Uploaded Source

File details

Details for the file speech-analysis-tools-0.2.tar.gz.

File metadata

  • Download URL: speech-analysis-tools-0.2.tar.gz
  • Upload date:
  • Size: 5.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.2 importlib_metadata/4.6.3 pkginfo/1.7.1 requests/2.24.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.7.3

File hashes

Hashes for speech-analysis-tools-0.2.tar.gz
Algorithm Hash digest
SHA256 b9c326fd17678d32fcef4e3df852f9db44297db0f0f0cd3de793feba868e833f
MD5 292610211ba54bb22c4d4707665c46a6
BLAKE2b-256 0ffe7447830335682361f4eef7fefe86e343ccc0938b4d0dece6ba03a562d2fa

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page