A package for training audio denoisers
Project description
Denoisers
Denoisers is a denoising library for audio with a focus on simplicity and ease of use. There are two major types of architectures available. WaveUNet for waveform denoising and UNet for spectrogram denoising.
Usage/Examples
import torch
import torchaudio
from denoisers import WaveUNetModel
from tqdm import tqdm
model = WaveUNetModel.from_pretrained("wrice/waveunet-vctk-24khz")
audio, sr = torchaudio.load("noisy_audio.wav")
if sr != model.config.sample_rate:
audio = torchaudio.functional.resample(audio, sr, model.config.sample_rate)
chunk_size = model.config.max_length
padding = abs(audio.size(-1) % chunk_size - chunk_size)
padded = torch.nn.functional.pad(audio, (0, padding))
clean = []
for i in tqdm(range(0, padded.shape[-1], chunk_size)):
audio_chunk = padded[:, i:i+chunk_size].cuda()
with torch.no_grad():
clean_chunk = model(audio_chunk[None])
clean.append(clean_chunk.squeeze(0).cpu())
denoised = torch.concat(clean)[:, :audio.shape[-1]]
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
denoisers-0.1.3.tar.gz
(18.3 kB
view hashes)
Built Distribution
denoisers-0.1.3-py3-none-any.whl
(23.1 kB
view hashes)
Close
Hashes for denoisers-0.1.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | c96c10485f600d38b056b7834cc84d881b0a6f4d8fbf9cf82c8b5e03b4fa3914 |
|
MD5 | 651491aeaadccf2edfed7457ed7a01f9 |
|
BLAKE2b-256 | d2450dc78b1ba0990d8084d63f92692ce4de5bca7bf6fec3a886e56b1195ca33 |