Skip to main content

Kokoro TTS integration for Vision Agents

Project description

GetStream Kokoro Plugin

This package integrates the open-weight Kokoro-82M TTS model with the GetStream audio/video SDK.

It provides a drop-in KokoroTTS class that implements the common getstream_common.tts.TTS interface, allowing you to stream PCM audio generated by Kokoro directly into a WebRTC AudioStreamTrack.

from getstream.plugins.kokoro import KokoroTTS
from getstream.video.rtc.audio_track import AudioStreamTrack

track = AudioStreamTrack(framerate=24_000)

tts = KokoroTTS(lang_code="a", voice="af_heart")
tts.set_output_track(track)

async for chunk in tts.send_iter("Hello from Kokoro!"):
    pass

Installation

uv add "vision-agents[kokoro]"
# or directly
uv add vision-agents-plugins-kokoro

This will pull in the required kokoro, numpy and getstream[webrtc] dependencies. You also need espeak-ng **at runtime ** for pronunciation fallback. On macOS you can install it with Homebrew:

brew install espeak-ng

Configuration options

Parameter Default Description
lang_code "a" Language group passed to KPipeline ("a" = American English, etc.)
voice "af_heart" Kokoro voice preset. See the model card for available options.
speed 1.0 Playback speed multiplier.
sample_rate 24000 Output sample-rate (fixed by Kokoro). The attached AudioStreamTrack must use the same value.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_kokoro-0.6.4.tar.gz (3.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vision_agents_plugins_kokoro-0.6.4-py3-none-any.whl (3.5 kB view details)

Uploaded Python 3

File details

Details for the file vision_agents_plugins_kokoro-0.6.4.tar.gz.

File metadata

  • Download URL: vision_agents_plugins_kokoro-0.6.4.tar.gz
  • Upload date:
  • Size: 3.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_kokoro-0.6.4.tar.gz
Algorithm Hash digest
SHA256 da9eebf54b48e47cc98b7eeb7eb04fd720bc7fe2f7e46ef5ca40ce2c4f2ead6f
MD5 4da399bcb4a78fef320d91d63d9dd890
BLAKE2b-256 dd1bd4a1b6c43ec234a7b3541eab234017be9c7b5e61ad3e91cec3493ce9fabd

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_kokoro-0.6.4-py3-none-any.whl.

File metadata

  • Download URL: vision_agents_plugins_kokoro-0.6.4-py3-none-any.whl
  • Upload date:
  • Size: 3.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: uv/0.10.10 {"installer":{"name":"uv","version":"0.10.10","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"macOS","version":null,"id":null,"libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":null}

File hashes

Hashes for vision_agents_plugins_kokoro-0.6.4-py3-none-any.whl
Algorithm Hash digest
SHA256 9c519ceff8a075725da7ed587eab09e5712babd7b643badb7252cb8b12bd1005
MD5 379f0bcacd2498790f43d5810d45d8ac
BLAKE2b-256 3d22e85c69fae48b9ee721d2f7690c8af7ac6421bf742ebe3727fe481445e00d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page