Convenient wrappers for audio signal processing in Python
Project description
Voice Toolbox
The place to solve all your audio signal processing needs.
The current repo is under construction. Goal is to create a repository that contains all voice signal processing functions available from different open source projects and libraries, such as parsel mouth and librosa.
Files
To start: Setup a conda environment and run 'pip3 install -r requirements.txt' before running the available scripts.
Important: if you get an error with parselmouth make sure the installation is 'pip3 install praat-parselmouth'
The script for extracting features is parsel_process.py.
- To run: "python3 feature_extraction.py [sampling rate] [filepath] [output filepath] --[feature flag]"
feature flags: formants, ZCR, harmonics, rate_of_speech, loudness, pitch_features, spectral_features, energy
Features currently availabe:
- Spectral Features:
- pitch
- pitch range
- spectral slope
- mel-frequency cepstral coefficients (MFCC)
- mean spectral roll-off
- median F0 (fundamental frequency)
- Rate of Speech and loudness:
- max intensity
- mean intensity
- syllables per second
- pause rate
- energy
- Harmonics
- harmonics to noise (HNR)
- Formants: f1,f2, f3, f4
- number of zero crossings (ZCR)
Extra Scripts for processed features
For visualization:
- visualize_voice.py for all scatter plots along with other plotting features from praat.
- To run: 'python3 visualize_voice.py'
- radar_plot.py for all radar plots
- To run: 'python3 radar_plot.py'
For PCA analysis of voice data: voice_pca.py is for PCA, RFE and Correlation plot:
-
- To run: 'voice_pca.py'
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file voice_toolbox-1.0.0.tar.gz
.
File metadata
- Download URL: voice_toolbox-1.0.0.tar.gz
- Upload date:
- Size: 8.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4ffabb7357fe6d2b34337cff28024b48b60c8ba0be68c6958ea8a476c2d21f79 |
|
MD5 | 5a4ccc1a64a1bc25aaf0b455e5a85816 |
|
BLAKE2b-256 | 43a407965a1e06f197c857ce1a3ffc3cfee8d2371367bc71b4adb8500db965b5 |
File details
Details for the file voice_toolbox-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: voice_toolbox-1.0.0-py3-none-any.whl
- Upload date:
- Size: 8.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 78bd431dfc6ff815cde486dbdfee404bd73c94eaa5d5527995d58eb46afdd134 |
|
MD5 | 1d896fd13e248bcc66e8bf6dc68f0c60 |
|
BLAKE2b-256 | 42a69e1cf8cc454d4c40359865057739cd8490ec0dde22260ba6964191079288 |