Skip to main content

Spectra Extraction based on PyTorch

Project description

spectra_torch

This library provides common spectra features from an audio signal including MFCCs and filter bank energies. This library mimics the library python_speech_features but PyTorch-style.

This library provides voice activity detection (VAD) based on energy. This library mimics the library VAD-python but PyTorch-style.

Use: Rui Wang. (2020, february 26). mechanicalsea/spectra: release v0.2.2 (Version 0.2.2).

Installation

This library is avaliable on pypi.org

To install from Pypi:

pip install --upgrade spectra-torch

Require:

  • python: 3.7.3
  • torch: 1.4.0
  • torchaudio: 0.4.0

Usage

Supported features:

  • Mel Frequency Cepstral Coefficients (MFCC)
  • Filterbank Energies
  • Log Filterbank Energies
  • Voice Activity Detection (VAD)

Here are examples.

Easy demo:

import spectra_torch as mm
import torchaudio as ta

sig, sr = ta.load_wav('mywav.wav')
sig = sig[0]
mfcc = mm.mfcc(sig, sr) # MFCC
starts, detection = is_speech(sig, sr, speechlen=1) # VAD

Performance

The difference between spectra_torch and python_speech_features:

  • Precision bais: 1e-4
  • Speed up: 0.1s/mfcc

MFCC

def mfcc(signal, samplerate=16000, winlen=0.025, hoplen=0.01, 
         numcep=13, nfilt=26, nfft=None, lowfreq=0, highfreq=None, 
         preemph=0.97, ceplifter=22, plusEnergy=True)

Filterbank

def fbank(signal, samplerate=16000, winlen=0.025, hoplen=0.01, 
          nfilt=26, nfft=512, lowfreq=0, highfreq=None, preemph=0.97)

VAD

def is_speech(signal, samplerate=16000, winlen=0.02, hoplen=0.01, 
              thresEnergy=0.6, speechlen=0.5, lowfreq=300, highfreq=3000, 
              preemph=0.97)

Reference

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

spectra-torch-0.3.0.tar.gz (5.4 kB view hashes)

Uploaded Source

Built Distribution

spectra_torch-0.3.0-py2.py3-none-any.whl (5.8 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page