Audio effects classifiers for voice

Project description

Voice Of Chunk

What do we want? Clean vocal acapellas! When do we want them? Now!

Cascaded Sorting

Flowchart

This repository hosts several models that are meant to cascade results. The output of the final classifier is audio ready to be used in training a Singing Voice Conversion (SVC) model.

State of Research

Voice vs. Non-voice classifier
Reverb vs. Dry classifier
Polyphony vs. Monophony classifier
Multi-voice unison vs. Single-voice unison

Installation

This package was developed using Python 3.11.4, and thus that version of python is recommended.

Clone this repository and make it the current directory in the terminal.

The, open a new virtual environment and install the requirements.

pip install -r requirements.txt

Model Weights

Download model weights from my dropbox.

cd voicequalityclassifier
wget https://www.dropbox.com/scl/fi/w6yqkhzpq4l8wpw53zrjk/weights.zip?rlkey=3ju9ofcn93wdklzc3sr8nlezz&dl=1 -O weights.zip
unzip weights.zip
cd ..

Usage

For some example audio download a few files with this command:

wget https://www.dropbox.com/scl/fi/sclv9z73xobw2hbagjqob/audio.zip?rlkey=3cq04ds7ymt5q7d4v1dzkfgib&dl=1 -O audio.zip
unzip audio.zip

The voicequalityclassifier is a package that can also be run as a module. To run the whole classification scheme, run

./env/bin/python voicequalityclassifier/main.py --input path/to/audio.wav

In the console, you will see the classification results. If you see:

for file path/to/audio.mp3 detected nonvoice

This means that no voice was detected (i.e. the source is an instrumental stem or stereo mix). If you see:

for file path/to/audio.mp3 detected voice
for file path/to/audio.mp3 detected reverb

This means there was reverb detected on the track.

Finally, if you see:

for file path/to/audio.mp3 detected voice
for file path/to/audio.mp3 detected dry

Then we're in business and we have a dry acapella file.

for file path/to/audio.mp3 detected voice
for file path/to/audio.mp3 detected reverb

This means there was reverb detected on the track.

Finally, if you see:

for file path/to/audio.mp3 detected voice
for file path/to/audio.mp3 detected dry

Then we're in business and we have a dry acapella file.

Project details

Release history Release notifications | RSS feed

1.0.4

Feb 10, 2024

1.0.3

Feb 9, 2024

This version

1.0.2

Feb 9, 2024

1.0.1

Feb 9, 2024

1.0.0

Feb 9, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vchunk-1.0.2-py3-none-any.whl (9.7 MB view details)

Uploaded Feb 9, 2024 Python 3

File details

Details for the file vchunk-1.0.2-py3-none-any.whl.

File metadata

Download URL: vchunk-1.0.2-py3-none-any.whl
Upload date: Feb 9, 2024
Size: 9.7 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.10.13

File hashes

Hashes for vchunk-1.0.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`d3b6fc283f1b6439b346bb2823c442bf985634a332a38d01c5d750bfd8cdb35a`
MD5	`2839beed33c300f948cddff2918f037c`
BLAKE2b-256	`947ad5c94d626d7365931159d2e4db651c2b0fb541e378e316f791dfec817e6f`

See more details on using hashes here.

vchunk 1.0.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Voice Of Chunk

Cascaded Sorting

State of Research

Installation

Model Weights

Usage

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distributions

Built Distribution

File details

File metadata

File hashes