Skip to main content

A pip installable version of the glottal function from jcvazquezc's DisVoice library

Project description

Glottal source features

glottal.py

Compute phonation features derived from the glottal source reconstruction from sustained vowels.

Nine descriptors are computed:

  1. Variability of time between consecutive glottal closure instants (GCI)
  2. Average opening quotient (OQ) for consecutive glottal cycles-> rate of opening phase duration / duration of glottal cycle
  3. Variability of opening quotient (OQ) for consecutive glottal cycles-> rate of opening phase duration /duration of glottal cycle
  4. Average normalized amplitude quotient (NAQ) for consecutive glottal cycles-> ratio of the amplitude quotient and the duration of the glottal cycle
  5. Variability of normalized amplitude quotient (NAQ) for consecutive glottal cycles-> ratio of the amplitude quotient and the duration of the glottal cycle
  6. Average H1H2: Difference between the first two harmonics of the glottal flow signal
  7. Variability H1H2: Difference between the first two harmonics of the glottal flow signal
  8. Average of Harmonic richness factor (HRF): ratio of the sum of the harmonics amplitude and the amplitude of the fundamental frequency
  9. Variability of HRF

Static or dynamic matrices can be computed:

Static matrix is formed with 36 features formed with (9 descriptors) x (4 functionals: mean, std, skewness, kurtosis)

Dynamic matrix is formed with the 9 descriptors computed for frames of 200 ms length.

Notes:

  1. The fundamental frequency is computed using the RAPT algorithm.

Script is called as follows

python glottal.py <file_or_folder_audio> <file_features.txt> [dynamic_or_static (default static)] [plots (true or false) (default false)] [kaldi output (true or false) (default false)]

Examples:

python glottal.py "../audios/001_a1_PCGITA.wav" "glottalfeaturesAst.txt" "true" "true" "txt"
python glottal.py "../audios/098_u1_PCGITA.wav" "glottalfeaturesUst.csv" "true" "true" "csv"
python glottal.py "../audios/098_u1_PCGITA.wav" "glottalfeaturesUdyn.pt" "false" "true" "torch"

python glottal.py "../audios/" "glottalfeaturesst.txt" "true" "false" "txt"
python glottal.py "../audios/" "glottalfeaturesst.csv" "true" "false" "csv"
python glottal.py "../audios/" "glottalfeaturesdyn.pt" "false" "false" "torch"

KALDI_ROOT=/home/camilo/Camilo/codes/kaldi-master2
export PATH=$PATH:$KALDI_ROOT/src/featbin/
python glottal.py "../audios/098_u1_PCGITA.wav" "glottalfeaturesUdyn" "false" "false" "kaldi"

python glottal.py "../audios/" "glottalfeaturesdyn" "false" "false" "kaldi"

Jupyter notebook

Results:

Glottal analysis from a sustained vowel !Image

References

[1] Belalcázar-Bolaños, E. A., Orozco-Arroyave, J. R., Vargas-Bonilla, J. F., Haderlein, T., & Nöth, E. (2016, September). Glottal Flow Patterns Analyses for Parkinson’s Disease Detection: Acoustic and Nonlinear Approaches. In International Conference on Text, Speech, and Dialogue (pp. 400-407). Springer.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

disvoice-glottalLP-0.0.1.tar.gz (25.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

disvoice_glottalLP-0.0.1-py3-none-any.whl (48.6 kB view details)

Uploaded Python 3

File details

Details for the file disvoice-glottalLP-0.0.1.tar.gz.

File metadata

  • Download URL: disvoice-glottalLP-0.0.1.tar.gz
  • Upload date:
  • Size: 25.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.24.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.54.0 CPython/3.8.6

File hashes

Hashes for disvoice-glottalLP-0.0.1.tar.gz
Algorithm Hash digest
SHA256 a1ce8a21d89e04812c9fe96018ccb0b6c0538f713a1aaeca843e80e23f39eb7f
MD5 510a9fcc2d5b347a3b04486a8c41c9a3
BLAKE2b-256 268a381a9a23997d3cd2409b68d205327a3ed359a8bc9cf7b51338e57c75201a

See more details on using hashes here.

File details

Details for the file disvoice_glottalLP-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: disvoice_glottalLP-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 48.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.6.1 requests/2.24.0 setuptools/50.3.2 requests-toolbelt/0.9.1 tqdm/4.54.0 CPython/3.8.6

File hashes

Hashes for disvoice_glottalLP-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 045a35831cd7c5e849f48df87ce509a6854080386065c1a5daedc38d7d9bf8ca
MD5 961a8bf1d41c3b8045fde6c3b808d068
BLAKE2b-256 9502e4120b6a9f253d9f42dba7101c5120b332985f38a2882c3865f1244a1355

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page