Silero plugin for GetStream
Project description
Silero Voice Activity Detection Plugin
A fast and accurate Voice Activity Detection (VAD) plugin for GetStream that uses the Silero VAD model.
Installation
pip install getstream-plugins-silero
Usage
from getstream.plugins.silero import SileroVAD
from getstream.video.rtc.track_util import PcmData
# Initialize with default settings
vad = SileroVAD()
# Or customize parameters
vad = SileroVAD(
sample_rate=16000,
frame_size=512,
silence_threshold=0.3,
speech_pad_ms=300,
min_speech_ms=250,
max_speech_ms=60000,
)
# Register event handlers
@vad.on("audio")
async def on_audio(pcm_data, user):
print(f"Detected speech: {pcm_data.duration:.2f} seconds")
# Process the detected speech with an STT engine
# await stt.process_audio(pcm_data)
# Process incoming audio
incoming_audio = PcmData(samples=audio_bytes, sample_rate=16000, format="s16")
await vad.process_audio(incoming_audio)
# Reset state if needed
await vad.reset()
Configuration Options
sample_rate: Audio sample rate in Hz (default: 16000)frame_size: Size of audio frames to process (default: 512)silence_threshold: Threshold for detecting silence (0.0 to 1.0) (default: 0.5)speech_pad_ms: Number of milliseconds to pad before/after speech (default: 300)min_speech_ms: Minimum milliseconds of speech to emit (default: 250)max_speech_ms: Maximum milliseconds of speech before forced flush (default: 30000)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file getstream_plugins_silero-0.1.0-py3-none-any.whl.
File metadata
- Download URL: getstream_plugins_silero-0.1.0-py3-none-any.whl
- Upload date:
- Size: 17.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1918485fa91d4e86f4fac24732926124b61791d2ddf69bc0a661a2be50e97538
|
|
| MD5 |
bfb9e005f2d144a4cd5f5ec82e542f5f
|
|
| BLAKE2b-256 |
3d1736bd2716c1a5329f3d13fd1c29c77a5b108fe42eb19ede5673afcc9e3444
|