Last released Oct 14, 2025
Toolbox for Speech Processing.
Last released Sep 9, 2025
Multi-lingual Automatic Speech Recognition (ASR) based on Whisper models, with accurate word timestamps, access to language detection confidence, several options for Voice Activity Detection (VAD), and more.
Supported by