Skip to main content

A fork of so-vits-svc.

Project description

SoftVC VITS Singing Voice Conversion Fork

CI Status Documentation Status Test coverage percentage

Poetry black pre-commit

PyPI Version Supported Python versions License

A fork of so-vits-svc with realtime support and greatly improved interface. Based on branch 4.0 (v1) and the models are compatible.

Installation

Install this via pip (or your favourite package manager):

pip install -U torch torchaudio --index-url https://download.pytorch.org/whl/cu117
pip install so-vits-svc-fork

Features not available in the original repo

  • Realtime voice conversion (enhanced in v1.1.0)
  • More accurate pitch estimation using CREPE
  • GUI available
  • Unified command-line interface (no need to run Python scripts)
  • Ready to use just by installing with pip.
  • Automatically download pretrained base model and HuBERT model
  • Code completely formatted with black, isort, autoflake etc.
  • Volume normalization in preprocessing
  • Other minor differences

Usage

Inference

GUI

GUI launches with the following command:

svcg

CLI

  • Realtime (from microphone)
svc vc --model-path <model-path>
  • File
svc --model-path <model-path> source.wav

Notes

  • In real-time inference, if there is noise on the inputs, the HuBERT model will react to those as well. Consider using realtime noise reduction applications such as RTX Voice in this case.

Training

Google Colab

Open In Colab

Local

Place your dataset like dataset_raw/{speaker_id}/**/{wav_file}.{any_format} (subfolders are acceptable) and run:

svc pre-resample
svc pre-config
svc pre-hubert
svc train

Notes

  • Dataset audio duration per file should be <~ 10s or VRAM will run out.
  • It is recommended to change the batch_size in config.json before the train command to match the VRAM capacity. As tested, the default requires about 14 GB.

Further help

For more details, run svc -h or svc <subcommand> -h.

> svc -h
Usage: svc [OPTIONS] COMMAND [ARGS]...

  so-vits-svc allows any folder structure for training data.
  However, the following folder structure is recommended.
      When training: dataset_raw/{speaker_name}/{wav_name}.wav
      When inference: configs/44k/config.json, logs/44k/G_XXXX.pth
  If the folder structure is followed, you DO NOT NEED TO SPECIFY model path, config path, etc.
  (The latest model will be automatically loaded.)
  To train a model, run pre-resample, pre-config, pre-hubert, train.
  To infer a model, run infer.

Options:
  -h, --help  Show this message and exit.

Commands:
  clean          Clean up files, only useful if you are using the default file structure
  infer          Inference
  onnx           Export model to onnx
  pre-config     Preprocessing part 2: config
  pre-hubert     Preprocessing part 3: hubert If the HuBERT model is not found, it will be...
  pre-resample   Preprocessing part 1: resample
  train          Train model If D_0.pth or G_0.pth not found, automatically download from hub.
  train-cluster  Train k-means clustering
  vc             Realtime inference from microphone

Contributors ✨

Thanks goes to these wonderful people (emoji key):

34j
34j

💻 🤔 📖 💡 🚇 🚧 👀 ⚠️ 📣 🐛
GarrettConway
GarrettConway

💻 🐛
BlueAmulet
BlueAmulet

🤔
ThrowawayAccount01
ThrowawayAccount01

🐛

This project follows the all-contributors specification. Contributions of any kind welcome!

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

so_vits_svc_fork-1.2.0.tar.gz (59.9 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

so_vits_svc_fork-1.2.0-py3-none-any.whl (70.3 kB view details)

Uploaded Python 3

File details

Details for the file so_vits_svc_fork-1.2.0.tar.gz.

File metadata

  • Download URL: so_vits_svc_fork-1.2.0.tar.gz
  • Upload date:
  • Size: 59.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.9.6 readme-renderer/37.3 requests/2.28.2 requests-toolbelt/0.10.1 urllib3/1.26.15 tqdm/4.65.0 importlib-metadata/6.1.0 keyring/23.13.1 rfc3986/2.0.0 colorama/0.4.6 CPython/3.10.10

File hashes

Hashes for so_vits_svc_fork-1.2.0.tar.gz
Algorithm Hash digest
SHA256 27618446ee5200237072aafa2ad1540714a832addf48635f54e5fb69c5d01ed4
MD5 58be598b1e73be4322d12ef59d5e0bc6
BLAKE2b-256 70e5d6e335bb000d52238532794ef54940aba3a5963d4093471f3558c304af4e

See more details on using hashes here.

File details

Details for the file so_vits_svc_fork-1.2.0-py3-none-any.whl.

File metadata

  • Download URL: so_vits_svc_fork-1.2.0-py3-none-any.whl
  • Upload date:
  • Size: 70.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.9.6 readme-renderer/37.3 requests/2.28.2 requests-toolbelt/0.10.1 urllib3/1.26.15 tqdm/4.65.0 importlib-metadata/6.1.0 keyring/23.13.1 rfc3986/2.0.0 colorama/0.4.6 CPython/3.10.10

File hashes

Hashes for so_vits_svc_fork-1.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5634fdb8d4f46c6206d5b1fa3f69a8b9f871f9c9ab456d82ea9af6618b9bb6b2
MD5 2b046784a0c920774daba6eb5c9e759a
BLAKE2b-256 ad7bf4d746bb646e166e51e3783d298bc4c594374dc0a114ab3ceae36456f095

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page