Inference tools for RVC
Project description
RVC INFER PY
rvc_inferpy is a Python library designed for audio inference using RVC (Retrieval-based Voice Conversion). It provides a straightforward interface to process audio files with various configurable parameters.
Installation
Install the package using pip:
pip install rvc-inferpy
Usage
Here's a simple example demonstrating how to use the library:
from rvc_inferpy import infer_audio
inferred_audio = infer_audio(
MODEL_NAME="model_name_here", # Name or path to the RVC model
SOUND_PATH="path_to_audio.wav", # Path to the input audio file
F0_CHANGE=0, # Change in fundamental frequency
F0_METHOD="crepe", # F0 extraction method ("crepe", "dio", etc.)
MIN_PITCH=50, # Minimum pitch value
MAX_PITCH=800, # Maximum pitch value
CREPE_HOP_LENGTH=128, # Hop length for Crepe
INDEX_RATE=1.0, # Index rate for model inference
FILTER_RADIUS=3, # Radius for smoothing filters
RMS_MIX_RATE=0.75, # Mixing rate for RMS
PROTECT=0.33, # Protect level to prevent overfitting
SPLIT_INFER=True, # Whether to split audio for inference
MIN_SILENCE=0.5, # Minimum silence duration for splitting
SILENCE_THRESHOLD=-40, # Silence threshold in dB
SEEK_STEP=10, # Seek step in milliseconds
KEEP_SILENCE=0.1, # Keep silence duration in seconds
FORMANT_SHIFT=0.0, # Amount of formant shifting
QUEFRENCY=0.0, # Cepstrum quefrency adjustment
TIMBRE=1.0, # Timbre preservation level
F0_AUTOTUNE=False, # Enable or disable F0 autotuning
OUTPUT_FORMAT="wav" # Desired output format (e.g., "wav", "mp3")
)
Usage with cli
you can also use with cli by:
rvc-infer -h
Parameters
MODEL_NAME: Name or path of the RVC model to use.SOUND_PATH: Path to the input audio file to be processed.F0_CHANGE: Adjusts the fundamental frequency (F0) of the audio.F0_METHOD: Method for extracting F0 (e.g.,"crepe","dio").MIN_PITCH/MAX_PITCH: Minimum and maximum pitch values for processing.CREPE_HOP_LENGTH: Hop length parameter for the Crepe method.INDEX_RATE: Determines the index rate for the inference model.FILTER_RADIUS: Radius used for smoothing filters.RMS_MIX_RATE: Mix rate for RMS adjustments.PROTECT: Protects specific audio characteristics from overfitting.SPLIT_INFER: Splits the audio for inference if set toTrue.MIN_SILENCE: Minimum silence duration for splitting audio (in seconds).SILENCE_THRESHOLD: Threshold to detect silence (in decibels).SEEK_STEP: Seek step in milliseconds during splitting.KEEP_SILENCE: Duration of silence to retain after processing.FORMANT_SHIFT: Amount of formant shifting applied.QUEFRENCY: Adjusts the quefrency in the cepstrum domain.TIMBRE: Controls timbre preservation during processing.F0_AUTOTUNE: Enables or disables F0 autotuning.OUTPUT_FORMAT: Specifies the output file format (e.g.,"wav","mp3").
Output
The function returns an audio object with the processed audio based on the provided parameters.
License
This project is licensed under the MIT License.
For more details, visit the repository.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file rvc_inferpy-0.4.1.tar.gz.
File metadata
- Download URL: rvc_inferpy-0.4.1.tar.gz
- Upload date:
- Size: 48.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
77d706644253a767b5d1e7be072f2b7c9f459e4368635595b023115f725ed5df
|
|
| MD5 |
5e15241584cddaaf0e1c2d492ac740d0
|
|
| BLAKE2b-256 |
9abc54b9fcf5dd7ab07cc69e74e2adb2e0acd8da5fe67da5cae4334e6dce639f
|
File details
Details for the file rvc_inferpy-0.4.1-py3-none-any.whl.
File metadata
- Download URL: rvc_inferpy-0.4.1-py3-none-any.whl
- Upload date:
- Size: 52.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.0.1 CPython/3.10.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
dce8cbd46da890dbe25757dfb01c305ab27c6f1d7ae0633063c83b9210721c1d
|
|
| MD5 |
e997374f87b5eca697bf4375b66e17e8
|
|
| BLAKE2b-256 |
0a591fbda4ca83835e438dfcca61d9a897fbf238f0b75526a387f95b845f9681
|