Skip to main content

Kokoro TTS integration for Vision Agents

Project description

GetStream Kokoro Plugin

This package integrates the open-weight Kokoro-82M TTS model with the GetStream audio/video SDK.

It provides a drop-in KokoroTTS class that implements the common getstream_common.tts.TTS interface, allowing you to stream PCM audio generated by Kokoro directly into a WebRTC AudioStreamTrack.

from getstream.plugins.kokoro import KokoroTTS
from getstream.video.rtc.audio_track import AudioStreamTrack

track = AudioStreamTrack(framerate=24_000)

tts = KokoroTTS(lang_code="a", voice="af_heart")
tts.set_output_track(track)

await tts.send("Hello from Kokoro!")

Installation

pip install getstream-plugins-kokoro

This will pull in the required kokoro, numpy and getstream[webrtc"] dependencies. You also need espeak-ng at runtime for pronunciation fallback. On macOS you can install it with Homebrew:

brew install espeak-ng

Configuration options

Parameter Default Description
lang_code "a" Language group passed to KPipeline ("a" = American English, etc.)
voice "af_heart" Kokoro voice preset. See the model card for available options.
speed 1.0 Playback speed multiplier.
sample_rate 24000 Output sample-rate (fixed by Kokoro). The attached AudioStreamTrack must use the same value.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_kokoro-0.2.1.tar.gz (3.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vision_agents_plugins_kokoro-0.2.1-py3-none-any.whl (7.0 kB view details)

Uploaded Python 3

File details

Details for the file vision_agents_plugins_kokoro-0.2.1.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_kokoro-0.2.1.tar.gz
Algorithm Hash digest
SHA256 7d6725858984a21a96cbaff5d49b428b812703f25d3dc14162e58111171b41bf
MD5 f31c2f19ed470c86344c857d1be9a070
BLAKE2b-256 530bf3094a33efb51c11ddc3a6df01a0cfef5a709fe4d71dde1207bf796c253d

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_kokoro-0.2.1-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_kokoro-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 3b916789bf252df49721c517f111d55d37f1fe27b4c8a2704acca2af6ac9eb62
MD5 571645ef251383b2025df7e3d3340d7a
BLAKE2b-256 36d7ecdf15122d839d8101b7d2f9d4fb9cda471c9bcc730af51197ebb4c6b753

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page