Skip to main content

Convenient wrappers for audio signal processing in Python

Project description

Voice Toolbox

The place to solve all your audio signal processing needs.

The current repo is under construction. Goal is to create a repository that contains all voice signal processing functions available from different open source projects and libraries, such as parsel mouth and librosa.

Files

To start: Setup a conda environment and run 'pip3 install -r requirements.txt' before running the available scripts.

Important: if you get an error with parselmouth make sure the installation is 'pip3 install praat-parselmouth'


The script for extracting features is parsel_process.py.

  • To run: "python3 feature_extraction.py [sampling rate] [filepath] [output filepath] --[feature flag]"

feature flags: formants, ZCR, harmonics, rate_of_speech, loudness, pitch_features, spectral_features, energy

Features currently availabe:

  1. Spectral Features:
  • pitch
  • pitch range
  • spectral slope
  • mel-frequency cepstral coefficients (MFCC)
  • mean spectral roll-off
  • median F0 (fundamental frequency)
  1. Rate of Speech and loudness:
  • max intensity
  • mean intensity
  • syllables per second
  • pause rate
  • energy
  1. Harmonics
  • harmonics to noise (HNR)
  • Formants: f1,f2, f3, f4
  • number of zero crossings (ZCR)

Extra Scripts for processed features

For visualization:

  1. visualize_voice.py for all scatter plots along with other plotting features from praat.
  • To run: 'python3 visualize_voice.py'
  1. radar_plot.py for all radar plots
  • To run: 'python3 radar_plot.py'

For PCA analysis of voice data: voice_pca.py is for PCA, RFE and Correlation plot:

    • To run: 'voice_pca.py'

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

voice_toolbox-1.0.0.tar.gz (8.9 kB view details)

Uploaded Source

Built Distribution

voice_toolbox-1.0.0-py3-none-any.whl (8.6 kB view details)

Uploaded Python 3

File details

Details for the file voice_toolbox-1.0.0.tar.gz.

File metadata

  • Download URL: voice_toolbox-1.0.0.tar.gz
  • Upload date:
  • Size: 8.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.0.0 CPython/3.11.5

File hashes

Hashes for voice_toolbox-1.0.0.tar.gz
Algorithm Hash digest
SHA256 4ffabb7357fe6d2b34337cff28024b48b60c8ba0be68c6958ea8a476c2d21f79
MD5 5a4ccc1a64a1bc25aaf0b455e5a85816
BLAKE2b-256 43a407965a1e06f197c857ce1a3ffc3cfee8d2371367bc71b4adb8500db965b5

See more details on using hashes here.

File details

Details for the file voice_toolbox-1.0.0-py3-none-any.whl.

File metadata

File hashes

Hashes for voice_toolbox-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 78bd431dfc6ff815cde486dbdfee404bd73c94eaa5d5527995d58eb46afdd134
MD5 1d896fd13e248bcc66e8bf6dc68f0c60
BLAKE2b-256 42a69e1cf8cc454d4c40359865057739cd8490ec0dde22260ba6964191079288

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page