An easy-to-use Voice Conversion framework based on VITS.
Project description
[!NOTE] Currently under development... Provided as a library and API in rvc
Installation and usage
Standard Setup
First, create a directory in your project. The assets
folder will contain the models needed for inference and training, and the result
folder will contain the results of the training.
rvc init
This will create an assets
folder and .env
in your working directory.
[!WARNING] The directory should be empty or without an assets folder.
Custom Setup
If you have already downloaded models or want to change these configurations, edit the .env
file.
If you do not already have a .env
file,
rvc env create
can create one.
Also, when downloading a model, you can use the
rvc dlmodel
or
rvc dlmodel {download_dir}
Finally, specify the location of the model in the env file, and you are done!
Library Usage
Inference Audio
from pathlib import Path
from dotenv import load_dotenv
from scipy.io import wavfile
from rvc.modules.vc.modules import VC
def main():
vc = VC()
vc.get_vc("{model.pth}")
tgt_sr, audio_opt, times, _ = vc.vc_single(
1, Path("{InputAudio}")
)
wavfile.write("{OutputAudio}", tgt_sr, audio_opt)
if __name__ == "__main__":
load_dotenv("{envPath}")
main()
CLI Usage
Inference Audio
rvc infer -m {model.pth} -i {input.wav} -o {output.wav}
option | flag | type | default value | description |
---|---|---|---|---|
modelPath | -m | Path | *required | Model path or filename (reads in the directory set in env) |
inputPath | -i | Path | *required | Input audio path or folder |
outputPath | -o | Path | *required | Output audio path or folder |
sid | -s | int | 0 | Speaker/Singer ID |
f0_up_key | -fu | int | 0 | Transpose (integer, number of semitones, raise by an octave: 12, lower by an octave: -12) |
f0_method | -fm | str | rmvpe | pitch extraction algorithm (pm, harvest, crepe, rmvpe |
f0_file | -ff | Path | None | None | F0 curve file (optional). One pitch per line. Replaces the default F0 and pitch modulation |
index_file | -if | Path | None | None | Path to the feature index file |
index_rate | -if | float | 0.75 | Search feature ratio (controls accent strength, too high has artifacting) |
filter_radius | -fr | int | 3 | If >=3: apply median filtering to the harvested pitch results. The value represents the filter radius and can reduce breathiness |
resample_sr | -rsr | int | 0 | Resample the output audio in post-processing to the final sample rate. Set to 0 for no resampling |
rms_mix_rate | -rmr | float | 0.25 | Adjust the volume envelope scaling. Closer to 0, the more it mimicks the volume of the original vocals. Can help mask noise and make volume sound more natural when set relatively low. Closer to 1 will be more of a consistently loud volume |
protect | -p | float | 0.33 | Protect voiceless consonants and breath sounds to prevent artifacts such as tearing in electronic music. Set to 0.5 to disable. Decrease the value to increase protection, but it may reduce indexing accuracy |
API Usage
First, start up the server.
rvc-api
or
poetry run poe rvc-api
Inference Audio
curl -X 'POST' \
'http://127.0.0.1:8000/inference' \
-H 'accept: application/json' \
-H 'Content-Type: multipart/form-data' \
-F 'modelpath={model.pth}' \
-F 'input={input audio path}' \
-o {output audio path}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file rvc-0.3.5.tar.gz
.
File metadata
- Download URL: rvc-0.3.5.tar.gz
- Upload date:
- Size: 90.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.7.1 CPython/3.11.2 Darwin/23.2.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ad59fd93c004a8542240ef3ae5561a0f03b09ce6c7679e63fd582b1478d0bfa7 |
|
MD5 | 513d51478b7983d3e64681f34e528829 |
|
BLAKE2b-256 | 2529919d3e8e8d52ecc6a152683c7884d1c4914e79db6d7668fb9e88e427bb2b |
File details
Details for the file rvc-0.3.5-py3-none-any.whl
.
File metadata
- Download URL: rvc-0.3.5-py3-none-any.whl
- Upload date:
- Size: 130.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.7.1 CPython/3.11.2 Darwin/23.2.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5e55c887be49850869a347c71a51af03b0d5e86f737c9078ca14241730490de2 |
|
MD5 | ac3507afbb2c6e18ab5a71ee29cc2ee2 |
|
BLAKE2b-256 | db77090f8f9ed42eea138042f68b4734ee79e330c679531a7e20508701e33fc6 |