A Conv-STFT/iSTFT implement based on Torch

Project description

Conv-STFT/iSTFT in PyTorch

Author: Shimin Zhang

The code refers to the following repo:

An STFT/iSTFT written up in PyTorch(py3) using 1D Convolutions. There are two window logic, break and continue.

break - a kaldi-like framing method

When the parameters win_len and fft_len are different, padding fft_len-win_len zero points after each frame( len(frame) = win_len ), and the window ( len(window) = win_len ) always wise-multiply with frame before padding.

continue - a librosa-like framing method.

When the parameters win_len and fft_len are different, framing the signal using win_len=fft_len, and zero padding on both sides of window ( len(window) = win_len ), which is len(center_pad(window))=fft_len

Installation

Install easily with pip:pip install conv_stft or download this repo, python setup.py install.

Usage

import torch
from conv_stft import STFT
import numpy as np
import librosa 
import matplotlib.pyplot as plt

audio = librosa.load(librosa.util.example_audio_file(), duration=10.0, offset=30)[0]
device = 'cpu'
fft_len = 1024
win_hop = 256
win_len = 1024
window = 'hann'

audio = torch.FloatTensor(audio)
audio = audio.unsqueeze(0)
audio = audio.to(device)

stft = STFT(
    fft_len=fft_len, 
    win_hop=win_hop, 
    win_len=win_len,
    win_type=window,
).to(device)

magnitude, phase = stft.transform(audio, return_type='magphase') # 'magphase' or 'realimag'
output = stft.inverse(magnitude, phase, input_type='magphase') # 'magphase' or 'realimag'
output = output.cpu().data.numpy()[..., :]
audio = audio.cpu().data.numpy()[..., :]
print(np.mean((output - audio) ** 2)) # on order of 1e-15

Output of compare_stft.py:

images/stft.png

Tests

Test it by just cloning this repo and running

pip install -r requirements.txt
python -m pytest .

Project details

Release history Release notifications | RSS feed

0.2.0

Jun 15, 2021

This version

0.1.2

Aug 19, 2020

0.1.1

Aug 19, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

conv_stft-0.1.2.tar.gz (4.3 kB view hashes)

Uploaded Aug 19, 2020 Source

Built Distribution

conv_stft-0.1.2-py2.py3-none-any.whl (5.4 kB view hashes)

Uploaded Aug 19, 2020 Python 2 Python 3

Hashes for conv_stft-0.1.2.tar.gz

Hashes for conv_stft-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`48970837ba165fa7c07b30008561602e1488e55cbac20a21930131089848b26f`
MD5	`080e0aca0864ea47de044c3bfc5d8be8`
BLAKE2b-256	`6828174a52171f60783c49163ebbe4c66b48bc60393483c16a635724f775f4e5`

Hashes for conv_stft-0.1.2-py2.py3-none-any.whl

Hashes for conv_stft-0.1.2-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`fa89c253cc5dd86c51f379e4bff59d46d3d8d22a81f6eff69920cc29795e8bc4`
MD5	`1c2d57d1b3a1d109c07db152229c47b0`
BLAKE2b-256	`6535213ae9b78a4ce5abb9fbe081f48b715fa2b4270f3b89d3c8f20ae368bdbc`