Python3 library that adds audio transcription support (.wav, .mp3) to the audio-dataset-converter library.
Project description
The audio-dataset-converter-faster-whisper library is an extension to audio-dataset-converter with plugins for transcribing audio files (.wav, .mp3) using the faster-whisper library (https://github.com/SYSTRAN/faster-whisper).
Examples can be found here:
https://github.com/waikato-llm/audio-dataset-converter-examples
Changelog
0.1.0 (2025-10-31)
switched to kasperl library for base API and generic pipeline plugins
0.0.3 (2025-07-10)
using underscores now instead of dashes in dependencies (setup.py)
0.0.2 (2025-03-14)
switched to underscores in project name
added support for placeholders: adc-srt
0.0.1 (2024-07-05)
initial release
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file audio_dataset_converter_faster_whisper-0.1.0.tar.gz.
File metadata
- Download URL: audio_dataset_converter_faster_whisper-0.1.0.tar.gz
- Upload date:
- Size: 7.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
04da6f6bf26f85086d1b6ed3f48aa9e404b4769a3d5c796343cad43a4971dc83
|
|
| MD5 |
6fdb99bc0b74aef4c86eef7fabdc1e8a
|
|
| BLAKE2b-256 |
862e409e71ccdc4314bfade34d79141130cc9dc02d1a87214ef37513b0bdd3ae
|