Skip to main content

Fish Speech 1.5 TTS plugin for VOCO audio inference runtime

Project description

voco-fishspeech

Fish Speech 1.5 TTS plugin for VOCO audio inference runtime.

Installation

cd plugins/voco-fishspeech
pip install -e .

Requirements

  • Fish Speech 1.5 checkpoints should be available at the configured checkpoint path
  • All Fish Speech code is bundled inside the plugin (no external dependencies needed)

Usage

from voco import Voco

# Initialize with Fish Speech
voco = Voco(model="fishspeech", device="cuda", checkpoint_path="checkpoints/fish-speech-1.5")

# Generate audio
for audio_chunk in voco.generate(
    text="Hello, this is a test of Fish Speech.",
    reference_audio="path/to/reference.wav",
    reference_text="reference audio",
    temperature=0.7,
    top_p=0.7,
    repetition_penalty=1.1
):
    # Process audio chunk
    pass

Configuration

  • checkpoint_path: Path to Fish Speech 1.5 checkpoints (default: "checkpoints/fish-speech-1.5")
  • device: Device to run inference on (default: "cuda")
  • dtype: Data type for inference (default: "bfloat16")
  • compile: Whether to compile the model with torch.compile (default: True)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

voco_fishspeech-0.0.3.tar.gz (45.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

voco_fishspeech-0.0.3-py3-none-any.whl (55.7 kB view details)

Uploaded Python 3

File details

Details for the file voco_fishspeech-0.0.3.tar.gz.

File metadata

  • Download URL: voco_fishspeech-0.0.3.tar.gz
  • Upload date:
  • Size: 45.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for voco_fishspeech-0.0.3.tar.gz
Algorithm Hash digest
SHA256 eeeaab4b006927ef4947b68bee440e640a509bdaa125e4d19ce01d31de75024f
MD5 222020e7249a6f9b80a528c5c76be1b3
BLAKE2b-256 4340b34a77d20d9433faf4cd79f5e2ba7f9a973987952968c4c882934bff5dde

See more details on using hashes here.

File details

Details for the file voco_fishspeech-0.0.3-py3-none-any.whl.

File metadata

File hashes

Hashes for voco_fishspeech-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 a8d8b96c0aac3f5dc8dcf09664e10b50dc483e2a3916294abec1cbae4cdcdd59
MD5 20f4bfb1ecc87d85ee0ed7fe5963e6ec
BLAKE2b-256 12d20efef0458a83f6d137764f4ebcdfd1da1e987f92375bb441c2d314f5d447

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page