This package is written for the restoration of degraded speech

These details have not been verified by PyPI

Project links

Homepage

Project description

VoiceFixer

This package provides:

A pretrained 44.1k universal speaker-independent neural vocoder.
A pretrained Voicefixer, which is build based on neural vocoder.

Voicefixer aims at the restoration of human speech regardless how serious its degraded. It can handle noise, reveberation, low resolution (2kHz~44.1kHz) and clipping (0.1-1.0 threshold) effect within one model.

Demo

Please visit demo page to view what voicefixer can do.

Usage

Basic example:

# Will automatically download model parameters.
from voicefixer import VoiceFixer
from voicefixer import Vocoder

# Initialize model
voicefixer = VoiceFixer()
# Speech restoration

# Mode 0
voicefixer.restore(input="", # input wav file path
                   output="", # output wav file path
                   cuda=False, # whether to use gpu acceleration
                   mode = 0) # You can try out mode 0, 1, 2 to find out the best result
# Mode 1
voicefixer.restore(input="", # input wav file path
                   output="", # output wav file path
                   cuda=False, # whether to use gpu acceleration
                   mode = 1) # You can try out mode 0, 1, 2 to find out the best result
# Mode 2
voicefixer.restore(input="", # input wav file path
                   output="", # output wav file path
                   cuda=False, # whether to use gpu acceleration
                   mode = 2) # You can try out mode 0, 1, 2 to find out the best result




# Universal speaker independent vocoder
vocoder = Vocoder(sample_rate=44100) # Only 44100 sampling rate is supported.

# Convert mel spectrogram to waveform
wave = vocoder.forward(mel=mel_spec) # This forward function is used in the following oracle function.

# Test vocoder using the mel spectrogram of 'fpath', save output to file out_path
vocoder.oracle(fpath="", # input wav file path
               out_path="") # output wav file path

Materials

Voicefixer training: https://github.com/haoheliu/voicefixer_main.git
Demo page: https://haoheliu.github.io/demopage-voicefixer/
If you found this repo helpful, please consider citing

@misc{liu2021voicefixer,
title={VoiceFixer: Toward General Speech Restoration With Neural Vocoder},
author={Haohe Liu and Qiuqiang Kong and Qiao Tian and Yan Zhao and DeLiang Wang and Chuanzeng Huang and Yuxuan Wang},
year={2021},
eprint={2109.13731},
archivePrefix={arXiv},
primaryClass={cs.SD}
}

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.1.3

Nov 12, 2023

0.1.2

Sep 3, 2022

0.1.1

Aug 19, 2022

0.1.0

Aug 19, 2022

0.0.18

Aug 4, 2022

0.0.17

Nov 6, 2021

0.0.16

Oct 26, 2021

0.0.15

Oct 26, 2021

0.0.14

Oct 26, 2021

0.0.13

Oct 26, 2021

0.0.12

Oct 7, 2021

0.0.11

Oct 7, 2021

0.0.10

Oct 6, 2021

0.0.9

Oct 1, 2021

This version

0.0.8

Sep 30, 2021

0.0.7

Sep 29, 2021

0.0.6

Sep 26, 2021

0.0.5

Sep 18, 2021

0.0.4

Sep 14, 2021

0.0.3

Sep 14, 2021

0.0.2

Sep 14, 2021

0.0.1

Sep 14, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

voicefixer-0.0.8.tar.gz (36.4 kB view details)

Uploaded Sep 30, 2021 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

voicefixer-0.0.8-py3-none-any.whl (42.3 kB view details)

Uploaded Sep 30, 2021 Python 3

File details

Details for the file voicefixer-0.0.8.tar.gz.

File metadata

Download URL: voicefixer-0.0.8.tar.gz
Upload date: Sep 30, 2021
Size: 36.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.7.10

File hashes

Hashes for voicefixer-0.0.8.tar.gz
Algorithm	Hash digest
SHA256	`3e0d87fa97ead9c92593cb477dba0ffc2c77ebd5ec05d94be2c2c6eb7a24aa6e`
MD5	`b228085d0c1fa40c96ae9e55abaf8483`
BLAKE2b-256	`9d7320f883fbaecc513264a33d1443471654ad2d9a026718de93848c6f722b16`

See more details on using hashes here.

File details

Details for the file voicefixer-0.0.8-py3-none-any.whl.

File metadata

Download URL: voicefixer-0.0.8-py3-none-any.whl
Upload date: Sep 30, 2021
Size: 42.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.26.0 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.7.10

File hashes

Hashes for voicefixer-0.0.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7d3a0cb48eadc728c5cdf6972e8f9fa429516a8803739f452014e91af6aed3e2`
MD5	`0ff4100f7ac309eb917d13dd8ae425ed`
BLAKE2b-256	`28908cf9e37bd128cdca4747c5c8f7a336a6214b8ac3c17a314bedd24e2bbfac`

See more details on using hashes here.

voicefixer 0.0.8

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

VoiceFixer

Demo

Usage

Materials

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes