Skip to main content

Agent Framework Plugin for TEN VAD

Project description

LiveKit Plugins – TEN VAD

livekit-plugins-tenvad provides seamless integration of the TEN-framework/ten-vad voice activity detection (VAD) plugin into the LiveKit ecosystem.

This plugin enables real-time speech activity detection with low-latency inference, optimized for streaming, conversational AI, and livekit-agents integration.

✨ Features

  • 🔌 LiveKit plugin integration — plug-and-play support for LiveKit workflows
  • 🤖 Compatible with livekit-agents — extend agents with real-time VAD capabilities
  • 🎤 Accurate voice activity detection powered by TEN VAD
  • Low-latency inference (~0.17ms avg per frame) suitable for real-time use
  • 📊 Benchmark validated against Silero VAD (faster and more continuous speech detection)
  • 🛠️ Configurable & extensible within the LiveKit plugin system

🔧 Installation

# from PyPI
uv pip install livekit-plugins-tenvad

# from source
uv pip install git+https://github.com/dangvansam/livekit-plugins-tenvad.git

🔌 Usage

from livekit.plugins import tenvad

vad = tenvad.VAD.load(
    activation_threshold=0.5,
    min_speech_duration=0.1,
    min_silence_duration=0.3,
    max_buffered_speech=60,
    prefix_padding_duration=0.3
)

📊 Run Benchmark

git clone https://github.com/dangvansam/livekit-plugins-tenvad.git

cd livekit-plugins-tenvad

# install dependencies for testing
uv pip install .[test]

python test/benchmark.py -i test/sample.wav

Benchmark Results

Metric Silero VAD TEN VAD
Speech segments 95 41
Total speech 19.01s (13.0%) 114.98s (78.8%)
Avg inference time 0.22ms 0.17ms
Min inference time 0.18ms 0.14ms
Max inference time 9.76ms 0.78ms

Highlights:

  • TEN VAD is ~1.27× faster per frame
  • Detects longer continuous speech compared to Silero
  • Provides lower latency with fewer false segment splits

Visualizations

Long audio

TEN VAD Benchmark

Short audio

TEN VAD Benchmark

Citations

@misc{TEN VAD,
  author       = {TEN Team},
  title        = {TEN VAD: A Low-Latency, Lightweight and High-Performance Streaming Voice Activity Detector (VAD)},
  year         = {2025},
  publisher    = {GitHub},
  journal      = {GitHub repository},
  howpublished = {\url{https://github.com/TEN-framework/ten-vad.git}},
  email        = {developer@ten.ai}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

livekit_plugins_tenvad-1.0.8.tar.gz (1.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

livekit_plugins_tenvad-1.0.8-py3-none-any.whl (1.7 MB view details)

Uploaded Python 3

File details

Details for the file livekit_plugins_tenvad-1.0.8.tar.gz.

File metadata

File hashes

Hashes for livekit_plugins_tenvad-1.0.8.tar.gz
Algorithm Hash digest
SHA256 66894bf09be41ee74fb07b93a643906995055f2aba100059b0598513a145c666
MD5 187037bc670cb85e9a64e8abf3f33522
BLAKE2b-256 a28dfbf0860724c84c84401cac8310efe3b4ef4df61cc37a208fcc95461997c3

See more details on using hashes here.

File details

Details for the file livekit_plugins_tenvad-1.0.8-py3-none-any.whl.

File metadata

File hashes

Hashes for livekit_plugins_tenvad-1.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 48998cf9f02c5b8941dd1677cf811753eb7378346a22a33602f38a44c30367fd
MD5 739c9fad18f08f4570a7c388d4a8e36c
BLAKE2b-256 d0d30ac356236e2e703a905d60504393f4e351b7cc77a7a9d78df84dadfb133e

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page