Skip to main content

UT-Sarulab MOS prediction system using SSL models

Project description

NOTE: Not affiliated with original authors of UTMOS!

Unofficial UTMOS Python Package

This is an unofficial Python package for UTMOS (UTokyo-SaruLab MOS Prediction System). This repository is based on the original code. The paper is available here.

What is UTMOS?

UTMOS is designed for calculating the mean opinion score (MOS) for a given voice sample. It can be used to calculate audio quality for datasets.

Note

The score is on a scale of 1 to 5. If you'd like a score on 1 to 100, just multiply the score by 20 (score * 20).

Example: new_score = round(score * 100, 2)

Support

This implementation supports CPU, CUDA, and MPS, as well as ROCm if PyTorch is configured properly. This implementation will automatically use the GPU if available.

Installation

pip install utmos

Usage

CLI (Command Line Interface)

utmos audio.wav

Python API

import utmos
model = utmos.Score() # The model will be automatically downloaded and will automatically utilize the GPU if available.
model.calculate_wav_file('audio_file.wav') # -> Float
# or model.calculate_wav(wav, sample_rate)

License

This software is licensed under the MIT license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

utmos-1.1.10.tar.gz (7.9 kB view hashes)

Uploaded Source

Built Distribution

utmos-1.1.10-py3-none-any.whl (9.7 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page