Skip to main content

An easy-to-use Voice Conversion framework based on VITS.

Project description

Retrieval-based-Voice-Conversion

An easy-to-use Voice Conversion framework based on VITS.

madewithlove


Licence

Discord


[!NOTE] Currently under development... Provided as a library and API in rvc

Installation and usage

Standard Setup

First, create a directory in your project. The assets folder will contain the models needed for inference and training, and the result folder will contain the results of the training.

rvc init

This will create an assets folder and .env in your working directory.

[!WARNING] The directory should be empty or without an assets folder.

Custom Setup

If you have already downloaded models or want to change these configurations, edit the .env file. If you do not already have a .env file,

rvc env create

can create one.

Also, when downloading a model, you can use the

rvc dlmodel

or

rvc dlmodel {download_dir}

Finally, specify the location of the model in the env file, and you are done!

Library Usage

Inference Audio

from pathlib import Path

from dotenv import load_dotenv
from scipy.io import wavfile

from rvc.modules.vc.modules import VC


def main():
      vc = VC()
      vc.get_vc("{model.pth}")
      tgt_sr, audio_opt, times, _ = vc.vc_single(
            1, Path("{InputAudio}")
      )
      wavfile.write("{OutputAudio}", tgt_sr, audio_opt)


if __name__ == "__main__":
      load_dotenv("{envPath}")
      main()

CLI Usage

Inference Audio

rvc infer -m {model.pth} -i {input.wav} -o {output.wav}
option flag  type default value description
modelPath -m Path *required Model path or filename (reads in the directory set in env)
inputPath -i Path *required Input audio path or folder
outputPath -o Path *required Output audio path or folder
sid -s int 0 Speaker/Singer ID
f0_up_key -fu int 0 Transpose (integer, number of semitones, raise by an octave: 12, lower by an octave: -12)
f0_method -fm str rmvpe pitch extraction algorithm (pm, harvest, crepe, rmvpe
f0_file -ff Path | None None F0 curve file (optional). One pitch per line. Replaces the default F0 and pitch modulation
index_file -if Path | None None Path to the feature index file
index_rate -if float 0.75 Search feature ratio (controls accent strength, too high has artifacting)
filter_radius -fr int 3 If >=3: apply median filtering to the harvested pitch results. The value represents the filter radius and can reduce breathiness
resample_sr -rsr int 0 Resample the output audio in post-processing to the final sample rate. Set to 0 for no resampling
rms_mix_rate -rmr float 0.25 Adjust the volume envelope scaling. Closer to 0, the more it mimicks the volume of the original vocals. Can help mask noise and make volume sound more natural when set relatively low. Closer to 1 will be more of a consistently loud volume
protect -p float 0.33 Protect voiceless consonants and breath sounds to prevent artifacts such as tearing in electronic music. Set to 0.5 to disable. Decrease the value to increase protection, but it may reduce indexing accuracy

API Usage

First, start up the server.

rvc-api

or

poetry run poe rvc-api

Inference Audio

curl -X 'POST' \
      'http://127.0.0.1:8000/inference' \
      -H 'accept: application/json' \
      -H 'Content-Type: multipart/form-data' \
      -F 'modelpath={model.pth}' \
      -F 'input={input audio path}' \
      -o {output audio path}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rvc-0.3.5.tar.gz (90.0 kB view details)

Uploaded Source

Built Distribution

rvc-0.3.5-py3-none-any.whl (130.0 kB view details)

Uploaded Python 3

File details

Details for the file rvc-0.3.5.tar.gz.

File metadata

  • Download URL: rvc-0.3.5.tar.gz
  • Upload date:
  • Size: 90.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.11.2 Darwin/23.2.0

File hashes

Hashes for rvc-0.3.5.tar.gz
Algorithm Hash digest
SHA256 ad59fd93c004a8542240ef3ae5561a0f03b09ce6c7679e63fd582b1478d0bfa7
MD5 513d51478b7983d3e64681f34e528829
BLAKE2b-256 2529919d3e8e8d52ecc6a152683c7884d1c4914e79db6d7668fb9e88e427bb2b

See more details on using hashes here.

File details

Details for the file rvc-0.3.5-py3-none-any.whl.

File metadata

  • Download URL: rvc-0.3.5-py3-none-any.whl
  • Upload date:
  • Size: 130.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.7.1 CPython/3.11.2 Darwin/23.2.0

File hashes

Hashes for rvc-0.3.5-py3-none-any.whl
Algorithm Hash digest
SHA256 5e55c887be49850869a347c71a51af03b0d5e86f737c9078ca14241730490de2
MD5 ac3507afbb2c6e18ab5a71ee29cc2ee2
BLAKE2b-256 db77090f8f9ed42eea138042f68b4734ee79e330c679531a7e20508701e33fc6

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page