Skip to main content

Agent Framework Plugin for TEN VAD

Project description

LiveKit Plugins – TEN VAD

livekit-plugins-tenvad provides seamless integration of the TEN-framework/ten-vad voice activity detection (VAD) plugin into the LiveKit ecosystem.

This plugin enables real-time speech activity detection with low-latency inference, optimized for streaming, conversational AI, and livekit-agents integration.

✨ Features

  • 🔌 LiveKit plugin integration — plug-and-play support for LiveKit workflows
  • 🤖 Compatible with livekit-agents — extend agents with real-time VAD capabilities
  • 🎤 Accurate voice activity detection powered by TEN VAD
  • Low-latency inference (~0.17ms avg per frame) suitable for real-time use
  • 📊 Benchmark validated against Silero VAD (faster and more continuous speech detection)
  • 🛠️ Configurable & extensible within the LiveKit plugin system

🔧 Installation

# from PyPI
uv pip install livekit-plugins-tenvad

# from source
uv pip install git+https://github.com/dangvansam/livekit-plugins-tenvad.git

🔌 Usage

from livekit.plugins import tenvad

vad = tenvad.VAD.load(
    activation_threshold=0.5,
    min_silence_duration=0.3,
    min_speech_duration=0.15,
    max_buffered_speech=30,
    prefix_padding_duration=0.1,
    padding_duration=0.1
)

📊 Run Benchmark

git clone https://github.com/dangvansam/livekit-plugins-tenvad.git

cd livekit-plugins-tenvad

# install dependencies for testing
uv pip install .[test]

python test/benchmark.py test/sample.wav outputs silero,ten

Benchmark Results

Metric Silero VAD TEN VAD
Speech segments 95 41
Total speech 19.01s (13.0%) 114.98s (78.8%)
Avg inference time 0.22ms 0.17ms
Min inference time 0.18ms 0.14ms
Max inference time 9.76ms 0.78ms

Highlights:

  • TEN VAD is ~1.27× faster per frame
  • Detects longer continuous speech compared to Silero
  • Provides lower latency with fewer false segment splits

Visualizations

Long audio

TEN VAD Benchmark

Short audio

TEN VAD Benchmark

Citations

@misc{TEN VAD,
  author       = {TEN Team},
  title        = {TEN VAD: A Low-Latency, Lightweight and High-Performance Streaming Voice Activity Detector (VAD)},
  year         = {2025},
  publisher    = {GitHub},
  journal      = {GitHub repository},
  howpublished = {\url{https://github.com/TEN-framework/ten-vad.git}},
  email        = {developer@ten.ai}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

livekit_plugins_tenvad-1.0.6.tar.gz (1.3 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

livekit_plugins_tenvad-1.0.6-py3-none-any.whl (1.7 MB view details)

Uploaded Python 3

File details

Details for the file livekit_plugins_tenvad-1.0.6.tar.gz.

File metadata

File hashes

Hashes for livekit_plugins_tenvad-1.0.6.tar.gz
Algorithm Hash digest
SHA256 460ba4f8beb22f3d54e114215715d173354522c5fe137c1be40b54b71350714c
MD5 00ca5222a0bb9a980dc833dbe14412ce
BLAKE2b-256 3519486369b80eade5091ec365462a9a8edcd98e3af4916b4d3d0d1461324825

See more details on using hashes here.

File details

Details for the file livekit_plugins_tenvad-1.0.6-py3-none-any.whl.

File metadata

File hashes

Hashes for livekit_plugins_tenvad-1.0.6-py3-none-any.whl
Algorithm Hash digest
SHA256 3cd47397fb1deafa588e92a0a28c10320aad904e3a46fbdb25e6a3df027e1d79
MD5 92442a02a95b00ad2e31d6cd9ad76c43
BLAKE2b-256 a099ffa9fdeb0c909e76bf9f1b8602e45a8057384c8ff9d276e4ada79dcbed5d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page