A fork of so-vits-svc.

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

34j

These details have not been verified by PyPI

Project links

documentation

Development Status
- 2 - Pre-Alpha
Intended Audience
- Developers
Natural Language
- English
Operating System
- OS Independent
Programming Language
Topic
- Software Development :: Libraries

Project description

SoftVC VITS Singing Voice Conversion Fork

简体中文

Supported Python versions License

A fork of so-vits-svc with realtime support and greatly improved interface. Based on branch 4.0 (v1) (or 4.1) and the models are compatible. 4.1 models are not supported. Other models are also not supported.

No Longer Maintained

Reasons

Within a year, the technology has evolved enormously and there are many better alternatives
Was hoping to create a more Modular, easy-to-install repository, but didn't have the skills, time, money to do so
PySimpleGUI is no longer LGPL
Using Typer is getting more popular than directly using Click

Alternatives

Always beware of the very few influencers who are quite overly surprised about any new project/technology. You need to take every social networking post with semi-doubt.

The voice changer boom that occurred in 2023 has come to an end, and many developers, not just those in this repository, have been not very active for a while.

There are too many alternatives to list here but:

RVC family: IAHispano/Applio (MIT) (actively maintained), fumiama's RVC (AGPL) and original RVC (MIT) (no longer maintained)
VCClient (MIT etc.) offers web-based GUI for real-time conversion but not quite actively maintained.
fish-diffusion tried to be quite modular but not actively maintained.
yxlllc/DDSP-SVC - new releases are issued occasionally. yxlllc/ReFlow-VAE-SVC
coqui-ai/TTS was for TTS but was partially modular. However, it is not maintained anymore, unfortunately.

Elsewhere, several start-ups have improved and marketed voice changers (probably for profit).

Updates to this repository have been limited to maintenance since Spring 2023. It is difficult to narrow the list of alternatives here, but please consider trying other projects if you are looking for a voice changer with even better performance (especially in terms of latency other than quality). > ~~However, this project may be ideal for those who want to try out voice conversion for the moment (because it is easy to install).~~

Features not available in the original repo

Realtime voice conversion (enhanced in v1.1.0)
Partially integrates QuickVC
Fixed misuse of ContentVec in the original repository.^c
More accurate pitch estimation using CREPE.
GUI and unified CLI available
~2x faster training
Ready to use just by installing with pip.
Automatically download pretrained models. No need to install fairseq.
Code completely formatted with black, isort, autoflake etc.

Installation

Option 1. One click easy installation

This BAT file will automatically perform the steps described below.

Option 2. Manual installation (using pipx, experimental)

1. Installing pipx

Windows (development version required due to pypa/pipx#940):

py -3 -m pip install --user git+https://github.com/pypa/pipx.git
py -3 -m pipx ensurepath

Linux/MacOS:

python -m pip install --user pipx
python -m pipx ensurepath

2. Installing so-vits-svc-fork

pipx install so-vits-svc-fork --python=3.11
pipx inject so-vits-svc-fork torch torchaudio --pip-args="--upgrade" --index-url=https://download.pytorch.org/whl/cu121 # https://download.pytorch.org/whl/nightly/cu121

Option 3. Manual installation

Creating a virtual environment

Windows:

py -3.11 -m venv venv
venv\Scripts\activate

Linux/MacOS:

python3.11 -m venv venv
source venv/bin/activate

Anaconda:

conda create -n so-vits-svc-fork python=3.11 pip
conda activate so-vits-svc-fork

Installing without creating a virtual environment may cause a PermissionError if Python is installed in Program Files, etc.

Install this via pip (or your favourite package manager that uses pip):

python -m pip install -U pip setuptools wheel
pip install -U torch torchaudio --index-url https://download.pytorch.org/whl/cu121 # https://download.pytorch.org/whl/nightly/cu121
pip install -U so-vits-svc-fork

Notes

If no GPU is available or using MacOS, simply remove pip install -U torch torchaudio --index-url https://download.pytorch.org/whl/cu121. MPS is probably supported.
If you are using an AMD GPU on Linux, replace --index-url https://download.pytorch.org/whl/cu121 with --index-url https://download.pytorch.org/whl/nightly/rocm5.7. AMD GPUs are not supported on Windows (#120).

Update

Please update this package regularly to get the latest features and bug fixes.

pip install -U so-vits-svc-fork
# pipx upgrade so-vits-svc-fork

Usage

Inference

GUI

GUI launches with the following command:

svcg

CLI

Realtime (from microphone)

svc vc

File

svc infer source.wav

Pretrained models are available on Hugging Face or CIVITAI.

Notes

If using WSL, please note that WSL requires additional setup to handle audio and the GUI will not work without finding an audio device.
In real-time inference, if there is noise on the inputs, the HuBERT model will react to those as well. Consider using realtime noise reduction applications such as RTX Voice in this case.
Models other than for 4.0v1 or this repository are not supported.
GPU inference requires at least 4 GB of VRAM. If it does not work, try CPU inference as it is fast enough. ^r-inference

Training

Before training

If your dataset has BGM, please remove the BGM using software such as Ultimate Vocal Remover. 3_HP-Vocal-UVR.pth or UVR-MDX-NET Main is recommended. ^1
If your dataset is a long audio file with a single speaker, use svc pre-split to split the dataset into multiple files (using librosa).
If your dataset is a long audio file with multiple speakers, use svc pre-sd to split the dataset into multiple files (using pyannote.audio). Further manual classification may be necessary due to accuracy issues. If speakers speak with a variety of speech styles, set --min-speakers larger than the actual number of speakers. Due to unresolved dependencies, please install pyannote.audio manually: pip install pyannote-audio.
To manually classify audio files, svc pre-classify is available. Up and down arrow keys can be used to change the playback speed.

Cloud

[^p]

If you do not have access to a GPU with more than 10 GB of VRAM, the free plan of Google Colab is recommended for light users and the Pro/Growth plan of Paperspace is recommended for heavy users. Conversely, if you have access to a high-end GPU, the use of cloud services is not recommended.

[^p]: If you register a referral code and then add a payment method, you may save about $5 on your first month's monthly billing. Note that both referral rewards are Paperspace credits and not cash. It was a tough decision but inserted because debugging and training the initial model requires a large amount of computing power and the developer is a student.

Local

Place your dataset like dataset_raw/{speaker_id}/**/{wav_file}.{any_format} (subfolders and non-ASCII filenames are acceptable) and run:

svc pre-resample
svc pre-config
svc pre-hubert
svc train -t

Notes

Dataset audio duration per file should be <~ 10s.
Need at least 4GB of VRAM. ^r-training
It is recommended to increase the batch_size as much as possible in config.json before the train command to match the VRAM capacity. Setting batch_size to auto-{init_batch_size}-{max_n_trials} (or simply auto) will automatically increase batch_size until OOM error occurs, but may not be useful in some cases.
To use CREPE, replace svc pre-hubert with svc pre-hubert -fm crepe.
To use ContentVec correctly, replace svc pre-config with -t so-vits-svc-4.0v1. Training may take slightly longer because some weights are reset due to reusing legacy initial generator weights.
To use MS-iSTFT Decoder, replace svc pre-config with svc pre-config -t quickvc.
Silence removal and volume normalization are automatically performed (as in the upstream repo) and are not required.
If you have trained on a large, copyright-free dataset, consider releasing it as an initial model.
For further details (e.g. parameters, etc.), you can see the Wiki or Discussions.

Further help

For more details, run svc -h or svc <subcommand> -h.

> svc -h
Usage: svc [OPTIONS] COMMAND [ARGS]...

  so-vits-svc allows any folder structure for training data.
  However, the following folder structure is recommended.
      When training: dataset_raw/{speaker_name}/**/{wav_name}.{any_format}
      When inference: configs/44k/config.json, logs/44k/G_XXXX.pth
  If the folder structure is followed, you DO NOT NEED TO SPECIFY model path, config path, etc.
  (The latest model will be automatically loaded.)
  To train a model, run pre-resample, pre-config, pre-hubert, train.
  To infer a model, run infer.

Options:
  -h, --help  Show this message and exit.

Commands:
  clean          Clean up files, only useful if you are using the default file structure
  infer          Inference
  onnx           Export model to onnx (currently not working)
  pre-classify   Classify multiple audio files into multiple files
  pre-config     Preprocessing part 2: config
  pre-hubert     Preprocessing part 3: hubert If the HuBERT model is not found, it will be...
  pre-resample   Preprocessing part 1: resample
  pre-sd         Speech diarization using pyannote.audio
  pre-split      Split audio files into multiple files
  train          Train model If D_0.pth or G_0.pth not found, automatically download from hub.
  train-cluster  Train k-means clustering
  vc             Realtime inference from microphone

External Links

Video Tutorial

Contributors ✨

Thanks goes to these wonderful people (emoji key):

_34j 💻 🤔 📖 💡 🚇 🚧 👀 ⚠️ ✅ 📣 🐛	_{GarrettConway} 💻 🐛 📖 👀	_BlueAmulet 🤔 💬 💻 🚧	_{ThrowawayAccount01} 🐛	_緋 📖 🐛	_Lordmau5 🐛 💻 🤔 🚧 💬 📓	_DL909 🐛
_Satisfy256 🐛	_{Pierluigi Zagaria} 📓	_{ruckusmattster} 🐛	_Desuka-art 🐛	_heyfixit 📖	_{Nerdy Rodent} 📹	_谢宇 📖
_ColdCawfee 🐛	_sbersier 🤔 📓 🐛	_Meldoner 🐛 🤔 💻	_mmodeusher 🐛	_AlonDan 🐛	_Likkkez 🐛	_{Duct Tape Games} 🐛
_{Xianglong He} 🐛	_75aosu 🐛	_tonyco82 🐛	_yxlllc 🤔 💻	_outhipped 🐛	_{escoolioinglesias} 🐛 📓 📹	_Blacksingh 🐛
_{Mgs. M. Thoyib Antarnusa} 🐛	_Exosfeer 🐛 💻	_guranon 🐛 🤔 💻	_{Alexander Koumis} 💻	_acekagami 🌍	_Highupech 🐛	_Scorpi 💻
_Maximxls 💻	_Star3Lord 🐛 💻	_Forkoz 🐛 💻	_{Zerui Chen} 💻 🤔	_{Roee Shenberg} 📓 🤔 💻	_Justas 🐛 💻	_Onako2 📖
_4ll0w3v1l 💻	_j5y0V6b 🛡️	_{marcellocirelli} 🐛	_{Priyanshu Patel} 💻	_{Anna Gorshunova} 🐛 💻

This project follows the all-contributors specification. Contributions of any kind welcome!

Credits

This package was created with Copier and the browniebroke/pypackage-template project template.

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

34j

These details have not been verified by PyPI

Project links

documentation

Development Status
- 2 - Pre-Alpha
Intended Audience
- Developers
Natural Language
- English
Operating System
- OS Independent
Programming Language
Topic
- Software Development :: Libraries

Release history Release notifications | RSS feed

This version

4.2.30

Feb 2, 2026

4.2.29

Oct 27, 2025

4.2.28

Oct 26, 2025

4.2.27

Sep 10, 2025

4.2.26

Jul 29, 2024

4.2.25

Jul 29, 2024

4.2.24

Jul 18, 2024

4.2.23

Jul 18, 2024

4.2.22

Jul 18, 2024

4.2.21

Jul 4, 2024

4.2.20

Jul 4, 2024

4.2.19

Jul 4, 2024

4.2.18

Jul 4, 2024

4.2.17

Jul 4, 2024

4.2.16

Jul 4, 2024

4.2.15

Jul 3, 2024

4.2.14

Jul 3, 2024

4.2.13

Jul 3, 2024

4.2.12

Jul 3, 2024

4.2.11

Jul 2, 2024

4.2.10

Jul 2, 2024

4.2.9

May 23, 2024

4.2.8

May 22, 2024

4.2.7

May 22, 2024

4.2.6

May 18, 2024

4.2.5

May 16, 2024

4.2.4

May 16, 2024

4.2.3

May 10, 2024

4.2.2

May 10, 2024

4.2.1

May 10, 2024

4.2.0

Apr 11, 2024

4.1.61

Apr 6, 2024

4.1.60

Apr 6, 2024

4.1.59

Apr 6, 2024

4.1.58

Mar 25, 2024

4.1.57

Mar 25, 2024

4.1.56

Mar 5, 2024

4.1.55

Mar 4, 2024

4.1.54

Mar 3, 2024

4.1.53

Feb 28, 2024

4.1.52

Feb 25, 2024

4.1.51

Feb 23, 2024

4.1.50

Feb 22, 2024

4.1.49

Feb 21, 2024

4.1.48

Feb 16, 2024

4.1.47

Feb 10, 2024

4.1.46

Feb 8, 2024

4.1.45

Feb 5, 2024

4.1.44

Feb 3, 2024

4.1.43

Feb 2, 2024

4.1.42

Jan 30, 2024

4.1.41

Jan 29, 2024

4.1.40

Jan 24, 2024

4.1.39

Jan 22, 2024

4.1.38

Jan 11, 2024

4.1.37

Jan 3, 2024

4.1.36

Jan 3, 2024

4.1.35

Jan 3, 2024

4.1.34

Jan 3, 2024

4.1.33

Jan 2, 2024

4.1.32

Nov 21, 2023

4.1.31

Nov 18, 2023

4.1.30

Nov 16, 2023

4.1.29

Nov 16, 2023

4.1.28

Nov 16, 2023

4.1.27

Nov 15, 2023

4.1.26

Nov 14, 2023

4.1.25

Nov 9, 2023

4.1.24

Nov 8, 2023

4.1.23

Nov 2, 2023

4.1.22

Oct 30, 2023

4.1.21

Oct 26, 2023

4.1.20

Oct 26, 2023

4.1.19

Oct 21, 2023

4.1.18

Oct 21, 2023

4.1.17

Oct 19, 2023

4.1.16

Oct 18, 2023

4.1.15

Oct 13, 2023

4.1.14

Oct 13, 2023

4.1.13

Oct 13, 2023

4.1.12

Oct 13, 2023

4.1.11

Sep 23, 2023

4.1.10

Sep 17, 2023

4.1.9

Sep 16, 2023

4.1.8

Sep 15, 2023

4.1.7

Sep 12, 2023

4.1.6

Sep 6, 2023

4.1.5

Sep 5, 2023

4.1.4

Sep 2, 2023

4.1.3

Aug 30, 2023

4.1.2

Aug 28, 2023

4.1.1

Jul 2, 2023

4.1.0

Jun 25, 2023

4.0.3

Jun 25, 2023

4.0.2

Jun 14, 2023

4.0.1

May 29, 2023

4.0.0

May 29, 2023

3.15.0

May 22, 2023

3.14.1

May 7, 2023

3.14.0

May 6, 2023

3.13.3

May 6, 2023

3.13.2

May 6, 2023

3.13.1

May 4, 2023

3.13.0

May 4, 2023

3.12.1

Apr 30, 2023

3.12.0

Apr 30, 2023

3.11.2

Apr 30, 2023

3.11.1

Apr 30, 2023

3.11.0

Apr 23, 2023

3.10.5

Apr 22, 2023

3.10.4

Apr 21, 2023

3.10.3

Apr 19, 2023

3.10.2

Apr 19, 2023

3.10.1

Apr 19, 2023

3.10.0

Apr 18, 2023

3.9.5

Apr 18, 2023

3.9.4

Apr 18, 2023

3.9.3

Apr 16, 2023

3.9.2

Apr 16, 2023

3.9.1

Apr 16, 2023

3.9.0

Apr 16, 2023

3.8.1

Apr 16, 2023

3.8.0

Apr 15, 2023

3.7.3

Apr 15, 2023

3.7.2

Apr 15, 2023

3.7.1

Apr 15, 2023

3.7.0

Apr 14, 2023

3.6.2

Apr 14, 2023

3.6.1

Apr 14, 2023

3.6.0

Apr 13, 2023

3.5.1

Apr 13, 2023

3.5.0

Apr 13, 2023

3.4.0

Apr 13, 2023

3.3.1

Apr 13, 2023

3.3.0

Apr 13, 2023

3.2.0

Apr 13, 2023

3.1.13

Apr 12, 2023

3.1.12

Apr 12, 2023

3.1.11

Apr 12, 2023

3.1.10

Apr 11, 2023

3.1.9

Apr 10, 2023

3.1.8

Apr 10, 2023

3.1.7

Apr 9, 2023

3.1.6

Apr 9, 2023

3.1.5

Apr 9, 2023

3.1.4

Apr 9, 2023

3.1.3

Apr 9, 2023

3.1.2

Apr 9, 2023

3.1.1

Apr 8, 2023

3.1.0

Apr 8, 2023

3.0.5

Apr 8, 2023

3.0.4

Apr 6, 2023

3.0.3

Apr 5, 2023

3.0.2

Apr 4, 2023

3.0.1

Apr 3, 2023

3.0.0

Apr 3, 2023

2.1.5

Apr 1, 2023

2.1.4

Mar 31, 2023

2.1.3

Mar 31, 2023

2.1.2

Mar 28, 2023

2.1.1

Mar 27, 2023

2.1.0

Mar 27, 2023

2.0.0

Mar 27, 2023

1.4.3

Mar 26, 2023

1.4.2

Mar 26, 2023

1.4.1

Mar 26, 2023

1.4.0

Mar 26, 2023

1.3.6

Mar 26, 2023

1.3.5

Mar 26, 2023

1.3.4

Mar 25, 2023

1.3.3

Mar 25, 2023

1.3.2

Mar 24, 2023

1.3.1

Mar 24, 2023

1.3.0

Mar 23, 2023

1.2.11

Mar 23, 2023

1.2.10

Mar 23, 2023

1.2.9

Mar 23, 2023

1.2.8

Mar 22, 2023

1.2.7

Mar 22, 2023

1.2.6

Mar 22, 2023

1.2.5

Mar 22, 2023

1.2.4

Mar 22, 2023

1.2.3

Mar 21, 2023

1.2.2

Mar 21, 2023

1.2.1

Mar 21, 2023

1.2.0

Mar 21, 2023

1.1.1

Mar 21, 2023

1.1.0

Mar 21, 2023

1.0.2

Mar 21, 2023

1.0.1

Mar 20, 2023

1.0.0

Mar 20, 2023

0.8.2

Mar 20, 2023

0.8.1

Mar 20, 2023

0.8.0

Mar 20, 2023

0.7.1

Mar 20, 2023

0.6.3

Mar 20, 2023

0.6.2

Mar 19, 2023

0.6.1

Mar 19, 2023

0.6.0

Mar 18, 2023

0.5.0

Mar 18, 2023

0.4.1

Mar 18, 2023

0.4.0

Mar 18, 2023

0.3.0

Mar 17, 2023

0.2.1

Mar 17, 2023

0.2.0

Mar 17, 2023

0.1.0

Mar 17, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

so_vits_svc_fork-4.2.30.tar.gz (93.7 kB view details)

Uploaded Feb 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

so_vits_svc_fork-4.2.30-py3-none-any.whl (93.7 kB view details)

Uploaded Feb 2, 2026 Python 3

File details

Details for the file so_vits_svc_fork-4.2.30.tar.gz.

File metadata

Download URL: so_vits_svc_fork-4.2.30.tar.gz
Upload date: Feb 2, 2026
Size: 93.7 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for so_vits_svc_fork-4.2.30.tar.gz
Algorithm	Hash digest
SHA256	`d3cca39fe0f6f1f881f3ac94c1c6fd7b83c2961fe1e2913f939d944d83d06fb7`
MD5	`11cb9ff8471a2780a85281d37a00d659`
BLAKE2b-256	`e72d997011c280d04549ad4d1dd515e1012bceab1c3006b6471b3d34f8e84cb2`

See more details on using hashes here.

Provenance

The following attestation bundles were made for so_vits_svc_fork-4.2.30.tar.gz:

Publisher: ci.yml on voicepaw/so-vits-svc-fork

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: so_vits_svc_fork-4.2.30.tar.gz
- Subject digest: d3cca39fe0f6f1f881f3ac94c1c6fd7b83c2961fe1e2913f939d944d83d06fb7
- Sigstore transparency entry: 902744739
- Sigstore integration time: Feb 2, 2026
Source repository:
- Permalink: voicepaw/so-vits-svc-fork@922beedff7d1efd7d54c75d92f2e090e18c58369
- Branch / Tag: refs/heads/main
- Owner: https://github.com/voicepaw
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@922beedff7d1efd7d54c75d92f2e090e18c58369
- Trigger Event: push

File details

Details for the file so_vits_svc_fork-4.2.30-py3-none-any.whl.

File metadata

Download URL: so_vits_svc_fork-4.2.30-py3-none-any.whl
Upload date: Feb 2, 2026
Size: 93.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for so_vits_svc_fork-4.2.30-py3-none-any.whl
Algorithm	Hash digest
SHA256	`46149cf19d4011d89b7bae3d175f2f3f690e9275b1d76e906f4c8cbe432a161a`
MD5	`60b50814cd696e8eb98f31644306a34b`
BLAKE2b-256	`d47c02f3bab1218ce5ed4cb40de23fc743c7b47161392758b3a8e6d2d85902e8`

See more details on using hashes here.

Provenance

The following attestation bundles were made for so_vits_svc_fork-4.2.30-py3-none-any.whl:

Publisher: ci.yml on voicepaw/so-vits-svc-fork

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: so_vits_svc_fork-4.2.30-py3-none-any.whl
- Subject digest: 46149cf19d4011d89b7bae3d175f2f3f690e9275b1d76e906f4c8cbe432a161a
- Sigstore transparency entry: 902744815
- Sigstore integration time: Feb 2, 2026
Source repository:
- Permalink: voicepaw/so-vits-svc-fork@922beedff7d1efd7d54c75d92f2e090e18c58369
- Branch / Tag: refs/heads/main
- Owner: https://github.com/voicepaw
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: ci.yml@922beedff7d1efd7d54c75d92f2e090e18c58369
- Trigger Event: push

so-vits-svc-fork 4.2.30

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

SoftVC VITS Singing Voice Conversion Fork

No Longer Maintained

Reasons

Alternatives

Features not available in the original repo

Installation

Option 1. One click easy installation

Option 2. Manual installation (using pipx, experimental)

1. Installing pipx

2. Installing so-vits-svc-fork

Option 3. Manual installation

Update

Usage

Inference

GUI

CLI

Notes

Training

Before training

Cloud

Local

Notes

Further help

External Links

Contributors ✨

Credits

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance