Vocal remover.

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Project description

vocal-remover

This is a deep-learning-based tool to extract instrumental track from your songs.

Installation

Getting vocal-remover

Download the latest version from here.

Install PyTorch

See: GET STARTED

Install the other packages

cd vocal-remover
pip install -r requirements.txt

Usage

The following command separates the input into instrumental and vocal tracks. They are saved as *_Instruments.wav and *_Vocals.wav.

Run on CPU

python inference.py --input path/to/an/audio/file

Run on GPU

python inference.py --input path/to/an/audio/file --gpu 0

Advanced options

--tta option performs Test-Time-Augmentation to improve the separation quality.

python inference.py --input path/to/an/audio/file --tta --gpu 0

--postprocess option masks instrumental part based on the vocals volume to improve the separation quality.

[!WARNING] This is an experimental feature. If you get any problems with this option, please disable it.

python inference.py --input path/to/an/audio/file --postprocess --gpu 0

Train your own model

Place your dataset

path/to/dataset/
  +- instruments/
  |    +- 01_foo_inst.wav
  |    +- 02_bar_inst.mp3
  |    +- ...
  +- mixtures/
       +- 01_foo_mix.wav
       +- 02_bar_mix.mp3
       +- ...

Train a model

python train.py --dataset path/to/dataset --mixup_rate 0.5 --reduction_rate 0.5 --gpu 0

References

[1] Jansson et al., "Singing Voice Separation with Deep U-Net Convolutional Networks", https://ejhumphrey.com/assets/pdf/jansson2017singing.pdf
[2] Takahashi et al., "Multi-scale Multi-band DenseNets for Audio Source Separation", https://arxiv.org/pdf/1706.09588.pdf
[3] Takahashi et al., "MMDENSELSTM: AN EFFICIENT COMBINATION OF CONVOLUTIONAL AND RECURRENT NEURAL NETWORKS FOR AUDIO SOURCE SEPARATION", https://arxiv.org/pdf/1805.02410.pdf
[4] Choi et al., "PHASE-AWARE SPEECH ENHANCEMENT WITH DEEP COMPLEX U-NET", https://openreview.net/pdf?id=SkeRTsAcYm
[5] Jansson et al., "Learned complex masks for multi-instrument source separation", https://arxiv.org/pdf/2103.12864.pdf
[6] Liutkus et al., "The 2016 Signal Separation Evaluation Campaign", Latent Variable Analysis and Signal Separation - 12th International Conference

Project details

These details have not been verified by PyPI

Project links

Homepage

License
- OSI Approved :: MIT License
Operating System
- OS Independent
Programming Language
- Python :: 3

Release history Release notifications | RSS feed

0.0.6

Mar 9, 2025

This version

0.0.5

Mar 9, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vocal_remover-0.0.5.tar.gz (12.4 kB view details)

Uploaded Mar 9, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vocal_remover-0.0.5-py3-none-any.whl (12.8 kB view details)

Uploaded Mar 9, 2025 Python 3

File details

Details for the file vocal_remover-0.0.5.tar.gz.

File metadata

Download URL: vocal_remover-0.0.5.tar.gz
Upload date: Mar 9, 2025
Size: 12.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.0

File hashes

Hashes for vocal_remover-0.0.5.tar.gz
Algorithm	Hash digest
SHA256	`b1f91bf2385a9cba1598590c12306525529eb833dfe0c1741df6c486ccf59705`
MD5	`f83eada2f473b8543e6ac0941168064d`
BLAKE2b-256	`7d2edb0c960e3a2028b5002dbd348a8889396b964cdd08598f0c9623c7404837`

See more details on using hashes here.

File details

Details for the file vocal_remover-0.0.5-py3-none-any.whl.

File metadata

Download URL: vocal_remover-0.0.5-py3-none-any.whl
Upload date: Mar 9, 2025
Size: 12.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.0

File hashes

Hashes for vocal_remover-0.0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`a140e40b8dd3e703fbe55831ed5e68255a81d1e9f75ae2655fdc323940588111`
MD5	`fa97af59052bdc17ff8f9be4a5034171`
BLAKE2b-256	`207633237cac1a1dbb0b3a8f8fbcee8f8b09bf74489e64998e83db8e4e97239b`

See more details on using hashes here.

vocal-remover 0.0.5

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

vocal-remover

Installation

Getting vocal-remover

Install PyTorch

Install the other packages

Usage

Run on CPU

Run on GPU

Advanced options

Train your own model

Place your dataset

Train a model

References

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes