Fully open-source and state-of-the-art Voice Activity Detection (VAD) models for academic research and commercial applications.
Project description
Fully Open-Source Voice Activity Detection (VAD) for Real-Time Speech Applications
Voice Activity Detection (VAD) is a critical first step in any application involving speech recognition. However, while exploring real-time voice chat agents, I found that many state-of-the-art (SoTA) models are not truly open-source—they provide only open weights, limiting transparency and hindering research and development.
This repository aims to change that by providing a fully open and research-friendly implementation of the Silero VAD model. The goal is to advance the state of the art in VAD through open experimentation, training, and integration.
Status
As of May 27, 2025, this repository includes:
✅ A complete implementation of the Silero VAD model for research use
Roadmap
In the near future, I plan to add the following:
🧠 Code to train Silero VAD from scratch on custom datasets
📊 Evaluation scripts for standard VAD benchmarks
🔧 Support for LoRA fine-tuning to extend or adapt Silero VAD
🔌 Example integrations with Python, client-side web applications, and Unity
Instructions
Install the package in editable mode:
pip install --editable .
License
This project is released under the Creative Commons Attribution-ShareAlike 4.0 International License (CC BY-SA 4.0), encouraging both academic research and commercial application.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file open_voice_activity_detection-0.0.1.tar.gz.
File metadata
- Download URL: open_voice_activity_detection-0.0.1.tar.gz
- Upload date:
- Size: 9.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: python-httpx/0.28.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
bbc1604a463067e76828009bc4bd814549b18d0cf864009c1bed13e0ca98614a
|
|
| MD5 |
e9bdb692f236c6d46a5af3102d5e7b8c
|
|
| BLAKE2b-256 |
a444ac621405764e6a49ab08322e8274cfa27a91514deeef463ece66489a0fd7
|
File details
Details for the file open_voice_activity_detection-0.0.1-py3-none-any.whl.
File metadata
- Download URL: open_voice_activity_detection-0.0.1-py3-none-any.whl
- Upload date:
- Size: 12.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: python-httpx/0.28.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
919ee61f1b1849069cf943bb52f57da6c72743466f87029fd16a129d63ec1c12
|
|
| MD5 |
94e710e9167a38f1f886d35782eaf601
|
|
| BLAKE2b-256 |
d415d508ff4df50e121e4d23db7141361565b6e51d6eb03c77dddd459f24c204
|