A fork of so-vits-svc.
Project description
SoftVC VITS Singing Voice Conversion Fork
A fork of so-vits-svc
with a greatly improved interface. Based on branch 4.0
(v1). No differences in functionality and the models are compatible.
Installation
Install this via pip (or your favourite package manager):
pip install -U torch torchaudio --index-url https://download.pytorch.org/whl/cu117
pip install so-vits-svc-fork
Features not available in the original repo
- Realtime voice conversion
- GUI available
- Unified command-line interface (no need to run Python scripts)
- Ready to use just by installing with
pip
. - Automatically download pretrained base model and HuBERT model
- Code completely formatted with black, isort, autoflake etc.
Usage
Inference
GUI
svcg
CLI
- Realtime (from microphone)
svc vc --model-path <model-path>
- File
svc --model-path <model-path> source.wav
Training
Use of Google Colab is recommended. (To train locally, you need at least 12GB of VRAM.)
Google Colab
Local
Place your dataset like dataset_raw/{speaker_id}/{wav_file}.wav
and run:
svc pre-resample
svc pre-config
svc pre-hubert
svc train
Further help
For more details, run svc -h
or svc <subcommand> -h
.
> svc -h
Usage: svc [OPTIONS] COMMAND [ARGS]...
so-vits-svc allows any folder structure for training data.
However, the following folder structure is recommended.
When training: dataset_raw/{speaker_name}/{wav_name}.wav
When inference: configs/44k/config.json, logs/44k/G_XXXX.pth
If the folder structure is followed, you DO NOT NEED TO SPECIFY model path, config path, etc.
(The latest model will be automatically loaded.)
To train a model, run pre-resample, pre-config, pre-hubert, train.
To infer a model, run infer.
Options:
-h, --help Show this message and exit.
Commands:
clean Clean up files, only useful if you are using the default file structure
infer Inference
onnx Export model to onnx
pre-config Preprocessing part 2: config
pre-hubert Preprocessing part 3: hubert If the HuBERT model is not found, it will be...
pre-resample Preprocessing part 1: resample
train Train model If D_0.pth or G_0.pth not found, automatically download from hub.
train-cluster Train k-means clustering
vc Realtime inference from microphone
Contributors ✨
Thanks goes to these wonderful people (emoji key):
34j 💻 🤔 📖 |
This project follows the all-contributors specification. Contributions of any kind welcome!
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
so_vits_svc_fork-0.4.1.tar.gz
(54.0 kB
view hashes)
Built Distribution
Close
Hashes for so_vits_svc_fork-0.4.1-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 448b4435c172341b41178588e41177100280aa3fd646023d885688900c017878 |
|
MD5 | 8e03ea17dc58ff2aa27a21532bff893c |
|
BLAKE2b-256 | 16264b39f20dd066a7886fcfa77022d984deb200737e2d19da6918fa5226e95b |