Skip to main content

A module for audio features extraction from Techmo

Project description

pypi Supported Python versions example workflow

Techmo Sp. z o.o. module for audio features extraction

How to use

:warning: Add ! character if you install the module in a jupyter notebook

pip install techmo-wavelet 

#import functions for feature extraction
from techmo.feature_extraction import calculate_wavelet_fft, calculate_fft_wavelet

# install numpy first in case is not installed in your environment
import numpy as np 

# signal must be 1d array read from wav file, e.x by using Soundfile. Here we generate random signal
signal = np.random.uniform(-1.0, 1.0, 16000)

# Here's an example of how to use `calculate_wavelet_fft` function
features = calculate_wavelet_fft(signal)

# Here's an example of how to use `calculate_fft_wavelet` function
features = calculate_fft_wavelet(signal)

The code implements 2 functions to extract features:

The calculate_wavelet_fft function implements an algorithm consisting of the following stages:

  1. If the number of samples N is greater than or equal to 4800, the signal is divided into int(N/2400) segments to compute finally 60 features for each segment containing int(N/int(N/2400)) samples, i.e. the feature vector will have 60*int(N/2400) elements,
  2. Segments are processed by the Hann window,
  3. Segments are normalized separately,
  4. Each segment is processed by the Wavelet Transform (WT),
  5. Each WT subband is subjected to the Fast Fourier Transform (FFT),
  6. FFT spectra are inputs of the triangular filtration to obtain the feature sub-vectors of length 60 for each segment,
  7. The logarithms of filter outputs are computed to obtain the feature sub-vectors of length 60 for each segment.
  8. Sub-vectors are concatenated to obtain a final feature matrix as numpy ndarray of shape int(N/2400), 60.

The calculate_fft_wavelet function implements an algorithm consisting of the following stages:

  1. If the number of samples N is greater than or equal to 9600, the signal is divided into int(N/4800) segments to compute finally 60 features for each segment containing int(N/int(N/4800)) samples, i.e. the feature vector will have 60*int(N/4800) elements,
  2. Segments are processed by the Hann window,
  3. Segments are normalized separately,
  4. Speech segments are processed by the the Fast Fourier Transform,
  5. The complex spectra are subjected to Wavelet Transform (WT),
  6. Absolute values of WT are calculated,
  7. The computed modules are inputs of the triangular filtration,
  8. The logarithms of filter outputs are computed to obtain the feature sub-vectors of length 60 for each segment.
  9. Sub-vectors are concatenated to obtain a final feature matrix as numpy ndarray of shape int(N/4800), 60.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

techmo-wavelet-0.3.2.tar.gz (5.4 kB view details)

Uploaded Source

Built Distribution

techmo_wavelet-0.3.2-py3-none-any.whl (5.8 kB view details)

Uploaded Python 3

File details

Details for the file techmo-wavelet-0.3.2.tar.gz.

File metadata

  • Download URL: techmo-wavelet-0.3.2.tar.gz
  • Upload date:
  • Size: 5.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.5.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.1 CPython/3.9.5

File hashes

Hashes for techmo-wavelet-0.3.2.tar.gz
Algorithm Hash digest
SHA256 21932c65ef53c22d66824965e32e2c86521a92faec50351435fcad6be5d2039f
MD5 a331a616ecb0f29677b5d711fb0f447b
BLAKE2b-256 f026f0bbd666a02d93e22620a5275b8933bb9180676b83b4f1831a24de5ddf53

See more details on using hashes here.

File details

Details for the file techmo_wavelet-0.3.2-py3-none-any.whl.

File metadata

  • Download URL: techmo_wavelet-0.3.2-py3-none-any.whl
  • Upload date:
  • Size: 5.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.5.0 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.1 CPython/3.9.5

File hashes

Hashes for techmo_wavelet-0.3.2-py3-none-any.whl
Algorithm Hash digest
SHA256 3814aaab3cba758417625e57f63f4738b2378d1c161f8ebe6c011f10b20c78d8
MD5 066b73b123f32d67c154cb26444e7a2a
BLAKE2b-256 a84cf62342870b02baf09739854f5f0f23f28924517cccccbe333bc5136eb399

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page