Skip to main content

Silero plugin for GetStream

Project description

Silero Voice Activity Detection Plugin

A fast and accurate Voice Activity Detection (VAD) plugin for GetStream that uses the Silero VAD model.

Installation

pip install getstream-plugins-silero

Usage

from getstream.plugins.silero import SileroVAD
from getstream.video.rtc.track_util import PcmData

# Initialize with default settings
vad = SileroVAD()

# Or customize parameters
vad = SileroVAD(
    sample_rate=16000,
    frame_size=512,
    silence_threshold=0.3,
    speech_pad_ms=300,
    min_speech_ms=250,
    max_speech_ms=60000,
)

# Register event handlers
@vad.on("audio")
async def on_audio(pcm_data, user):
    print(f"Detected speech: {pcm_data.duration:.2f} seconds")
    # Process the detected speech with an STT engine
    # await stt.process_audio(pcm_data)

# Process incoming audio
incoming_audio = PcmData(samples=audio_bytes, sample_rate=16000, format="s16")
await vad.process_audio(incoming_audio)

# Reset state if needed
await vad.reset()

Configuration Options

  • sample_rate: Audio sample rate in Hz (default: 16000)
  • frame_size: Size of audio frames to process (default: 512)
  • silence_threshold: Threshold for detecting silence (0.0 to 1.0) (default: 0.5)
  • speech_pad_ms: Number of milliseconds to pad before/after speech (default: 300)
  • min_speech_ms: Minimum milliseconds of speech to emit (default: 250)
  • max_speech_ms: Maximum milliseconds of speech before forced flush (default: 30000)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_silero-0.1.5.tar.gz (16.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vision_agents_plugins_silero-0.1.5-py3-none-any.whl (21.9 kB view details)

Uploaded Python 3

File details

Details for the file vision_agents_plugins_silero-0.1.5.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_silero-0.1.5.tar.gz
Algorithm Hash digest
SHA256 f7494b3efab258f58e9e1a07693ed5e6592370c4550afe09c1e45f3302484913
MD5 667109b1fc5621e06ad74780a5d73a5f
BLAKE2b-256 e76e08050aa512e69b4993ccf04ed99320b486974bc3dd1ba0f81c339eb2bc1d

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_silero-0.1.5-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_silero-0.1.5-py3-none-any.whl
Algorithm Hash digest
SHA256 c6a2cea326d8aed915d2e3df7b4d3cc46b156441bf2d4eafc5c63332fa37a338
MD5 9f353e4d1b3f8490bf935679a6d923fd
BLAKE2b-256 bcc8b79274d05cde9deb7f5c1791c9cf777d31c260af4a541dc168f789866538

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page