Skip to main content

Inference code for GPT-SoVITS

Project description

GPT-SoVITS-Infer

This is the inference code of GPT-SoVITS that can be developer-friendly.

Usage Example

Check out the example notebook for a quick start. Or open it in Colab

Prepare the environment

As we all know, the dependencies of an AI project are always a mess. Here is how I prepare the environment for this project:

Conda (Linux)
conda install python=3.10
conda install pytorch=2.1 torchvision torchaudio pytorch-lightning pytorch-cuda=12.1 -c pytorch -c nvidia 
conda install ffmpeg=6.1.1 -c conda-forge
MacOS
brew install ffmpeg
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpu
pip3 install pytorch-lightning
pip3 install GPT-SoVITS-Infer

You can also try to prepare the environment with cpu only options, which should work, but I have not tested it yet.

After the environment is ready, you can install the package by pip:

pip install GPT-SoVITS

I do not add the packages related to torch to the dependencies of GPT-SoVITS-Infer. Check if the environment is ready if things go wrong.

Advanced Usage

  • GPTSoVITSInference.load_sovits and GPTSoVITSInference.load_gpt: You can load your own fine-tuned model by the methods.
  • GPTSoVITSInference.set_prompt_audio: Set the prompt audio for the inference.
  • GPTSoVITSInference.get_tts_wav_stream: Return a generator that yields the audio pieces of the generated audio. It will create a background thread to generate the audio, so you can get the audio pieces while the audio is still being generated.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gpt_sovits_infer-0.2.2.tar.gz (3.3 MB view details)

Uploaded Source

Built Distribution

gpt_sovits_infer-0.2.2-py3-none-any.whl (3.4 MB view details)

Uploaded Python 3

File details

Details for the file gpt_sovits_infer-0.2.2.tar.gz.

File metadata

  • Download URL: gpt_sovits_infer-0.2.2.tar.gz
  • Upload date:
  • Size: 3.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: pdm/2.12.4 CPython/3.11.8

File hashes

Hashes for gpt_sovits_infer-0.2.2.tar.gz
Algorithm Hash digest
SHA256 4bf2064b0156a3437ff5bef36c8a4ccbd27365ef942d7fdb130d52d4d3aca387
MD5 b568ad1f416e0dde440ad9817155ef0e
BLAKE2b-256 3dba19ad40d9a06c6946ee39f288ba8cb9b641a8fef27dc905bda6d2c55aaf48

See more details on using hashes here.

File details

Details for the file gpt_sovits_infer-0.2.2-py3-none-any.whl.

File metadata

File hashes

Hashes for gpt_sovits_infer-0.2.2-py3-none-any.whl
Algorithm Hash digest
SHA256 fa4f23b75efd6b1ca4480ec673e91c3be8ccfb2282e7f3b42a8eebc46bc23d29
MD5 9a8d96cec7d0a5ed491c8e55adf98b97
BLAKE2b-256 14ce8eb11fcca52f30c89bfb2395f4006c054e10b400806070e001243f2cd99b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page