Skip to main content

Silero plugin for GetStream

Project description

Silero Voice Activity Detection Plugin

A fast and accurate Voice Activity Detection (VAD) plugin for GetStream that uses the Silero VAD model.

Installation

pip install getstream-plugins-silero

Usage

from getstream.plugins.silero import SileroVAD
from getstream.video.rtc.track_util import PcmData

# Initialize with default settings
vad = SileroVAD()

# Or customize parameters
vad = SileroVAD(
    sample_rate=16000,
    frame_size=512,
    silence_threshold=0.3,
    speech_pad_ms=300,
    min_speech_ms=250,
    max_speech_ms=60000,
)

# Register event handlers
@vad.on("audio")
async def on_audio(pcm_data, user):
    print(f"Detected speech: {pcm_data.duration:.2f} seconds")
    # Process the detected speech with an STT engine
    # await stt.process_audio(pcm_data)

# Process incoming audio
incoming_audio = PcmData(samples=audio_bytes, sample_rate=16000, format="s16")
await vad.process_audio(incoming_audio)

# Reset state if needed
await vad.reset()

Configuration Options

  • sample_rate: Audio sample rate in Hz (default: 16000)
  • frame_size: Size of audio frames to process (default: 512)
  • silence_threshold: Threshold for detecting silence (0.0 to 1.0) (default: 0.5)
  • speech_pad_ms: Number of milliseconds to pad before/after speech (default: 300)
  • min_speech_ms: Minimum milliseconds of speech to emit (default: 250)
  • max_speech_ms: Maximum milliseconds of speech before forced flush (default: 30000)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vision_agents_plugins_silero-0.1.8.tar.gz (16.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

vision_agents_plugins_silero-0.1.8-py3-none-any.whl (22.1 kB view details)

Uploaded Python 3

File details

Details for the file vision_agents_plugins_silero-0.1.8.tar.gz.

File metadata

File hashes

Hashes for vision_agents_plugins_silero-0.1.8.tar.gz
Algorithm Hash digest
SHA256 a5966e4fb255042fcd076ad79f9316aa70be033d5b197fb898a4b1ee2c717e3b
MD5 5bc4973ef0669cd439cd9d922eccc2fc
BLAKE2b-256 38c014b097024d90b6c8c34dbad44f7fe5cb9901f4f122c092ac6ecd967f785a

See more details on using hashes here.

File details

Details for the file vision_agents_plugins_silero-0.1.8-py3-none-any.whl.

File metadata

File hashes

Hashes for vision_agents_plugins_silero-0.1.8-py3-none-any.whl
Algorithm Hash digest
SHA256 7f822936f808f9c131638330096298465548a980bc2a842ece150e7ac0c3c7ca
MD5 3fbed60c992003f9c644a7b575ef8e66
BLAKE2b-256 96c59aa5b34b9e11031a82d32de68f9f47844758c2105fd73c4964d0aa38770b

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page