Skip to main content

Agent Framework Plugin for TEN VAD

Project description

LiveKit Plugins – TEN VAD

livekit-plugins-tenvad provides seamless integration of the TEN-framework/ten-vad voice activity detection (VAD) plugin into the LiveKit ecosystem.

This plugin enables real-time speech activity detection with low-latency inference, optimized for streaming, conversational AI, and livekit-agents integration.

✨ Features

  • 🔌 LiveKit plugin integration — plug-and-play support for LiveKit workflows
  • 🤖 Compatible with livekit-agents — extend agents with real-time VAD capabilities
  • 🎤 Accurate voice activity detection powered by TEN VAD
  • Low-latency inference (~0.17ms avg per frame) suitable for real-time use
  • 📊 Benchmark validated against Silero VAD (faster and more continuous speech detection)
  • 🛠️ Configurable & extensible within the LiveKit plugin system

🔧 Installation

# from PyPI
uv pip install livekit-plugins-tenvad

# from source
uv pip install git+https://github.com/dangvansam/livekit-plugins-tenvad.git

🔌 Usage

from livekit.plugins import tenvad

vad = tenvad.VAD.load(
    activation_threshold=0.5,
    min_silence_duration=0.3,
    min_speech_duration=0.15,
    max_buffered_speech=30,
    prefix_padding_duration=0.1,
    padding_duration=0.1
)

📊 Run Benchmark

git clone https://github.com/dangvansam/livekit-plugins-tenvad.git

cd livekit-plugins-tenvad

# install dependencies for testing
uv pip install .[test]

python test/benchmark.py test/sample.wav outputs silero,ten

Benchmark Results

Metric Silero VAD TEN VAD
Speech segments 95 41
Total speech 19.01s (13.0%) 114.98s (78.8%)
Avg inference time 0.22ms 0.17ms
Min inference time 0.18ms 0.14ms
Max inference time 9.76ms 0.78ms

Highlights:

  • TEN VAD is ~1.27× faster per frame
  • Detects longer continuous speech compared to Silero
  • Provides lower latency with fewer false segment splits

Visualizations

Long audio

TEN VAD Benchmark

Short audio

TEN VAD Benchmark

Citations

@misc{TEN VAD,
  author       = {TEN Team},
  title        = {TEN VAD: A Low-Latency, Lightweight and High-Performance Streaming Voice Activity Detector (VAD)},
  year         = {2025},
  publisher    = {GitHub},
  journal      = {GitHub repository},
  howpublished = {\url{https://github.com/TEN-framework/ten-vad.git}},
  email        = {developer@ten.ai}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

livekit_plugins_tenvad-1.0.7.tar.gz (1.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

livekit_plugins_tenvad-1.0.7-py3-none-any.whl (1.7 MB view details)

Uploaded Python 3

File details

Details for the file livekit_plugins_tenvad-1.0.7.tar.gz.

File metadata

File hashes

Hashes for livekit_plugins_tenvad-1.0.7.tar.gz
Algorithm Hash digest
SHA256 1ca1677480344cf8e5e2d8a15872be8cae8d16a45d491582a855bf421e857620
MD5 00b55df788d736fbfb10cb7502ad4bbf
BLAKE2b-256 7837f23293b332ce50b0b78c3a9b686e13371e620348bf8fade087049ec563cd

See more details on using hashes here.

File details

Details for the file livekit_plugins_tenvad-1.0.7-py3-none-any.whl.

File metadata

File hashes

Hashes for livekit_plugins_tenvad-1.0.7-py3-none-any.whl
Algorithm Hash digest
SHA256 38c2333d2b8d1c8263abc24d16bd1fac6e36e2ff6378d9912eef37ddaab11727
MD5 43e5b29eed4325751aeef7f049862699
BLAKE2b-256 bf5f9b569f56d31ed39be6f3c7eff59e6724d4f35cf673f4f46abf4f6fa442ba

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page