Skip to main content

Fish Speech 1.5 TTS plugin for VOCO audio inference runtime

Project description

voco-fishspeech

Fish Speech 1.5 TTS plugin for VOCO audio inference runtime.

Installation

cd plugins/voco-fishspeech
pip install -e .

Requirements

  • Fish Speech 1.5 checkpoints should be available at the configured checkpoint path
  • All Fish Speech code is bundled inside the plugin (no external dependencies needed)

Usage

from voco import Voco

# Initialize with Fish Speech
voco = Voco(model="fishspeech", device="cuda", checkpoint_path="checkpoints/fish-speech-1.5")

# Generate audio
for audio_chunk in voco.generate(
    text="Hello, this is a test of Fish Speech.",
    reference_audio="path/to/reference.wav",
    reference_text="reference audio",
    temperature=0.7,
    top_p=0.7,
    repetition_penalty=1.1
):
    # Process audio chunk
    pass

Configuration

  • checkpoint_path: Path to Fish Speech 1.5 checkpoints (default: "checkpoints/fish-speech-1.5")
  • device: Device to run inference on (default: "cuda")
  • dtype: Data type for inference (default: "bfloat16")
  • compile: Whether to compile the model with torch.compile (default: True)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

voco_fishspeech-0.0.2.tar.gz (43.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

voco_fishspeech-0.0.2-py3-none-any.whl (52.5 kB view details)

Uploaded Python 3

File details

Details for the file voco_fishspeech-0.0.2.tar.gz.

File metadata

  • Download URL: voco_fishspeech-0.0.2.tar.gz
  • Upload date:
  • Size: 43.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for voco_fishspeech-0.0.2.tar.gz
Algorithm Hash digest
SHA256 5ba02b21829b4a82b6817741c834e6dd23dc1bbcd2ffd1c10808902ed3b754b0
MD5 1c5f69e25deea6a0aa702ec7003ae76c
BLAKE2b-256 a986b1dc1a20f9501544532f1c7021fa53cf5b5b5d22fa84ca7a4e6df90ddba3

See more details on using hashes here.

File details

Details for the file voco_fishspeech-0.0.2-py3-none-any.whl.

File metadata

File hashes

Hashes for voco_fishspeech-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 78026adebd2c1ad082c5880b5193edaa8cc7b0fd71a8c6cc550e81a12e037c72
MD5 29ce2f5fd358cb72178b617dea7a9c1f
BLAKE2b-256 97952654141a233a1f9a4a3d25534bf0c3a4cf921a98a659e5bfc0b8c5170354

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page