PyTorch implementation of the Perceptual Evaluation of Speech Quality

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Programming Language
- Python :: 3

Project description

Loss function inspired by the PESQ score

Testing badge Linting badge Docs badge

Implementation of the widely used Perceptual Evaluation of Speech Quality (PESQ) score as a torch loss function. The PESQ loss alone performs not good for noise suppression, instead combine with scale invariant SDR. For more information see 1,2.

Installation

To install the package just run:

$ pip install torch-pesq

Usage

import torch
from torch_pesq import PesqLoss

pesq = PesqLoss(0.5,
    sample_rate=44100, 
)

mos = pesq.mos(reference, degraded)
loss = pesq(reference, degraded)

print(mos, loss)
loss.backward()

Comparison to reference implementation

The following figures uses samples from the VCTK 1 speech and DEMAND 2 noise dataset with varying mixing factors. They illustrate correlation and maximum error between the reference and torch implementation:

Correlation

The difference is a result from missing time alignment implementation and a level alignment done with IIR filtering instead of a frequency weighting. They are minor and should not be significant when used as a loss function. There are two outliers which may degrade results and further investigation is needed to find the source of difference.

Validation improvements when used as loss function

Validation results for fullband noise suppression:

Noise estimator: Recurrent SRU with soft masking. 8 layers, width of 512 result in ~1586k parameters of the unpruned model.
STFT for signal coding: 512 window length, 50% overlap, hamming window
Mel filterbank with 32 Mel features

The baseline system uses L1 time domain loss. Combining the PESQ loss function together with scale invariant SDR gives improvement of ~0.1MOS for PESQ and slight improvements in speech distortions, as well as a more stable training progression. Horizontal lines indicate the score of noisy speech.

Validation comparison

Relevant references

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Intended Audience
- Science/Research
License
- OSI Approved :: MIT License
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

This version

0.1.2

Nov 16, 2022

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

torch-pesq-0.1.2.tar.gz (44.0 kB view details)

Uploaded Nov 16, 2022 Source

Built Distribution

torch_pesq-0.1.2-py3-none-any.whl (14.7 kB view details)

Uploaded Nov 16, 2022 Python 3

File details

Details for the file torch-pesq-0.1.2.tar.gz.

File metadata

Download URL: torch-pesq-0.1.2.tar.gz
Upload date: Nov 16, 2022
Size: 44.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.8.10

File hashes

Hashes for torch-pesq-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`4c5ecc0660eaa8bee3840efa1adcc84d7b53fdc51554c9b1affae4903218acbb`
MD5	`d48fa3b53c7d0c65b562366236eece94`
BLAKE2b-256	`7bc29bb24c373f5468c1c433e5703b7857af85a341cbe69e558ef858aa57ea0a`

See more details on using hashes here.

File details

Details for the file torch_pesq-0.1.2-py3-none-any.whl.

File metadata

Download URL: torch_pesq-0.1.2-py3-none-any.whl
Upload date: Nov 16, 2022
Size: 14.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.1 CPython/3.8.10

File hashes

Hashes for torch_pesq-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`6f3fa836f6517f86652332c67b653164c16a95867beb3095dd0392b814efda45`
MD5	`009f0bb5cce57a1d2a79b762ae90ae56`
BLAKE2b-256	`5099c07d07829b7f6e934c4d00d77c2bdc8e5f012cb0037a09c75aa1c78a55f9`

See more details on using hashes here.

torch-pesq 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Loss function inspired by the PESQ score

Installation

Usage

Comparison to reference implementation

Validation improvements when used as loss function

Relevant references

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes