Skip to main content

Inference code for GPT-SoVITS

Project description

GPT-SoVITS-Infer

This is the inference code of GPT-SoVITS that can be developer-friendly.

Usage Example

Check out the example notebook for a quick start. Or open it in Colab

Prepare the environment

As we all know, the dependencies of an AI project are always a mess. Here is how I prepare the environment for this project:

Conda (Linux)
conda install python=3.10
conda install pytorch=2.1 torchvision torchaudio pytorch-lightning pytorch-cuda=12.1 -c pytorch -c nvidia 
conda install ffmpeg=6.1.1 -c conda-forge
MacOS
brew install ffmpeg
pip3 install torch torchvision torchaudio --index-url https://download.pytorch.org/whl/cpu
pip3 install pytorch-lightning
pip3 install GPT-SoVITS-Infer

You can also try to prepare the environment with cpu only options, which should work, but I have not tested it yet.

After the environment is ready, you can install the package by pip:

pip install GPT-SoVITS

I do not add the packages related to torch to the dependencies of GPT-SoVITS-Infer. Check if the environment is ready if things go wrong.

Advanced Usage

  • GPTSoVITSInference.load_sovits and GPTSoVITSInference.load_gpt: You can load your own fine-tuned model by the methods.
  • GPTSoVITSInference.set_prompt_audio: Set the prompt audio for the inference.
  • GPTSoVITSInference.get_tts_wav_stream: Return a generator that yields the audio pieces of the generated audio. It will create a background thread to generate the audio, so you can get the audio pieces while the audio is still being generated.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gpt_sovits_infer-0.2.0.tar.gz (3.3 MB view details)

Uploaded Source

Built Distribution

gpt_sovits_infer-0.2.0-py3-none-any.whl (3.4 MB view details)

Uploaded Python 3

File details

Details for the file gpt_sovits_infer-0.2.0.tar.gz.

File metadata

  • Download URL: gpt_sovits_infer-0.2.0.tar.gz
  • Upload date:
  • Size: 3.3 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: pdm/2.12.4 CPython/3.11.8

File hashes

Hashes for gpt_sovits_infer-0.2.0.tar.gz
Algorithm Hash digest
SHA256 193ef96f8bff81202d0a7e91a43875c0bc6a7466fc99159e3d0d1cdeaefe1503
MD5 211d6d72a15ec68c290ca41cbdcdfb00
BLAKE2b-256 7c85177b4a448bd1e22e1dca4f101d8a502e369ddf4e905f443d9ee5a7f69b5d

See more details on using hashes here.

File details

Details for the file gpt_sovits_infer-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for gpt_sovits_infer-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 bbbbd4d4eb864c624b5192f3c32188051a7b094499aff4322e1c28be15b3b7d4
MD5 13771dc1dcc5a8caf52c14b203583bc2
BLAKE2b-256 b6cbfce6a7a9fc1f5e9bc7a28583259b157257282bead280047f0708ebd4517c

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page