Vocal remover.
Project description
vocal-remover
This is a deep-learning-based tool to extract instrumental track from your songs.
Installation
Getting vocal-remover
Download the latest version from here.
Install PyTorch
See: GET STARTED
Install the other packages
cd vocal-remover
pip install -r requirements.txt
Usage
The following command separates the input into instrumental and vocal tracks. They are saved as *_Instruments.wav and *_Vocals.wav.
Run on CPU
python inference.py --input path/to/an/audio/file
Run on GPU
python inference.py --input path/to/an/audio/file --gpu 0
Advanced options
--tta option performs Test-Time-Augmentation to improve the separation quality.
python inference.py --input path/to/an/audio/file --tta --gpu 0
--postprocess option masks instrumental part based on the vocals volume to improve the separation quality.
[!WARNING] This is an experimental feature. If you get any problems with this option, please disable it.
python inference.py --input path/to/an/audio/file --postprocess --gpu 0
Train your own model
Place your dataset
path/to/dataset/
+- instruments/
| +- 01_foo_inst.wav
| +- 02_bar_inst.mp3
| +- ...
+- mixtures/
+- 01_foo_mix.wav
+- 02_bar_mix.mp3
+- ...
Train a model
python train.py --dataset path/to/dataset --mixup_rate 0.5 --reduction_rate 0.5 --gpu 0
References
- [1] Jansson et al., "Singing Voice Separation with Deep U-Net Convolutional Networks", https://ejhumphrey.com/assets/pdf/jansson2017singing.pdf
- [2] Takahashi et al., "Multi-scale Multi-band DenseNets for Audio Source Separation", https://arxiv.org/pdf/1706.09588.pdf
- [3] Takahashi et al., "MMDENSELSTM: AN EFFICIENT COMBINATION OF CONVOLUTIONAL AND RECURRENT NEURAL NETWORKS FOR AUDIO SOURCE SEPARATION", https://arxiv.org/pdf/1805.02410.pdf
- [4] Choi et al., "PHASE-AWARE SPEECH ENHANCEMENT WITH DEEP COMPLEX U-NET", https://openreview.net/pdf?id=SkeRTsAcYm
- [5] Jansson et al., "Learned complex masks for multi-instrument source separation", https://arxiv.org/pdf/2103.12864.pdf
- [6] Liutkus et al., "The 2016 Signal Separation Evaluation Campaign", Latent Variable Analysis and Signal Separation - 12th International Conference
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file vocal_remover-0.0.5.tar.gz.
File metadata
- Download URL: vocal_remover-0.0.5.tar.gz
- Upload date:
- Size: 12.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b1f91bf2385a9cba1598590c12306525529eb833dfe0c1741df6c486ccf59705
|
|
| MD5 |
f83eada2f473b8543e6ac0941168064d
|
|
| BLAKE2b-256 |
7d2edb0c960e3a2028b5002dbd348a8889396b964cdd08598f0c9623c7404837
|
File details
Details for the file vocal_remover-0.0.5-py3-none-any.whl.
File metadata
- Download URL: vocal_remover-0.0.5-py3-none-any.whl
- Upload date:
- Size: 12.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
a140e40b8dd3e703fbe55831ed5e68255a81d1e9f75ae2655fdc323940588111
|
|
| MD5 |
fa97af59052bdc17ff8f9be4a5034171
|
|
| BLAKE2b-256 |
207633237cac1a1dbb0b3a8f8fbcee8f8b09bf74489e64998e83db8e4e97239b
|