This package is written for MOS score prediction based on fine-tuned wav2vec2.0 model
Project description
WV-MOS
MOS score prediction by fine-tuned wav2vec2.0 model
Keywords: MOS-Net, MB-Net, PESQ, STOI, speech quality
Getting started
The package installation was tested with python3.9
pip install git+https://github.com/AndreevP/wvmos
Inference
from wvmos import get_wvmos
model = get_wvmos(cuda=True)
mos = model.calculate_one("path/to/wav/file") # infer MOS score for one audio
mos = model.calculate_dir("path/to/dir/with/wav/files", mean=True) # infer average MOS score across .wav files in directory
Citation and Acknowledgment
This work was done for the deep learning course in Skolteh university by Pavel Andreev, Nikolay Patakin, Oleg Desheulin, Alexander Kagan and Arthur Bulanbaev. More details are described in paper https://arxiv.org/abs/2203.13086
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file wvmos-1.0.tar.gz.
File metadata
- Download URL: wvmos-1.0.tar.gz
- Upload date:
- Size: 3.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
44245b85ac88acc594d46319e8920085874cfcfa584113a96b0c1628c8ef2c00
|
|
| MD5 |
4fa31db5abed3e58c6bead00821e6314
|
|
| BLAKE2b-256 |
8bb4b2613bb0346a3cdaf998d290981bf0487daa16d8cd9549e8eee0ff677dc8
|
File details
Details for the file wvmos-1.0-py3-none-any.whl.
File metadata
- Download URL: wvmos-1.0-py3-none-any.whl
- Upload date:
- Size: 3.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.9
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
68ce4c13cfa693a81f14d53f3dc5bfe341ce751bb10a2e06d5fc8c148014c500
|
|
| MD5 |
6a5b4bd2bdb889adf2c1bebe4d8f4665
|
|
| BLAKE2b-256 |
09d65db98dda8bfe381cc471c81590bafa53b2bc7d2c11cc8300fc8ef8162ee7
|