Skip to main content

Inference code for GPT-SoVITS

Project description

GPT-SoVITS-Infer

This is the inference code of GPT-SoVITS that can be developer-friendly.

Usage Example

Check out the example notebook for a quick start. Or open it in Colab

Prepare the environment

As we all know, the dependencies of an AI project are always a mess. Here is how I prepare the environment for this project:

Conda (Linux)
conda install python=3.10
conda install pytorch=2.1 torchvision torchaudio pytorch-lightning pytorch-cuda=12.1 -c pytorch -c nvidia 
conda install ffmpeg=6.1.1 -c conda-forge
MacOS
brew install ffmpeg
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpu
pip3 install pytorch-lightning
pip3 install GPT-SoVITS-Infer

You can also try to prepare the environment with cpu only options, which should work, but I have not tested it yet.

After the environment is ready, you can install the package by pip:

pip install GPT-SoVITS

I do not add the packages related to torch to the dependencies of GPT-SoVITS-Infer. Check if the environment is ready if things go wrong.

Advanced Usage

  • GPTSoVITSInference.load_sovits and GPTSoVITSInference.load_gpt: You can load your own fine-tuned model by the methods.
  • GPTSoVITSInference.set_prompt_audio: Set the prompt audio for the inference.
  • GPTSoVITSInference.get_tts_wav_stream: Return a generator that yields the audio pieces of the generated audio. It will create a background thread to generate the audio, so you can get the audio pieces while the audio is still being generated.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gpt_sovits_infer-0.2.5.tar.gz (3.3 MB view details)

Uploaded Source

Built Distribution

gpt_sovits_infer-0.2.5-py3-none-any.whl (3.4 MB view details)

Uploaded Python 3

File details

Details for the file gpt_sovits_infer-0.2.5.tar.gz.

File metadata

  • Download URL: gpt_sovits_infer-0.2.5.tar.gz
  • Upload date:
  • Size: 3.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: pdm/2.12.4 CPython/3.11.8

File hashes

Hashes for gpt_sovits_infer-0.2.5.tar.gz
Algorithm Hash digest
SHA256 81c5103b67c1b1c44b6637d1d5367b42db66f642c8b2af64554c0e4e15206f30
MD5 5bad5d2f3c5374887f6786f9184d7007
BLAKE2b-256 59a49c5b8f716273d6e9380aed9aec9e9d333e50f9018517f31fca8097c9e6ac

See more details on using hashes here.

File details

Details for the file gpt_sovits_infer-0.2.5-py3-none-any.whl.

File metadata

File hashes

Hashes for gpt_sovits_infer-0.2.5-py3-none-any.whl
Algorithm Hash digest
SHA256 e0c7a2a5c2882286ef09e109cfcea7f4216414843cd3148dc892880a5bfe7591
MD5 85778d022bba70393cf7f9374a254a8b
BLAKE2b-256 27180fc17c56ae8934c9539e21b7edee238b083ada0f5c4ab9abc44eea954d08

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page