Skip to main content

Python3 library that adds audio transcription support (.wav, .mp3) to the audio-dataset-converter library.

Project description

The audio-dataset-converter-faster-whisper library is an extension to audio-dataset-converter with plugins for transcribing audio files (.wav, .mp3) using the faster-whisper library (https://github.com/SYSTRAN/faster-whisper).

Examples can be found here:

https://github.com/waikato-llm/audio-dataset-converter-examples

Changelog

0.1.0 (2025-10-31)

  • switched to kasperl library for base API and generic pipeline plugins

0.0.3 (2025-07-10)

  • using underscores now instead of dashes in dependencies (setup.py)

0.0.2 (2025-03-14)

  • switched to underscores in project name

  • added support for placeholders: adc-srt

0.0.1 (2024-07-05)

  • initial release

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

File details

Details for the file audio_dataset_converter_faster_whisper-0.1.0.tar.gz.

File metadata

File hashes

Hashes for audio_dataset_converter_faster_whisper-0.1.0.tar.gz
Algorithm Hash digest
SHA256 04da6f6bf26f85086d1b6ed3f48aa9e404b4769a3d5c796343cad43a4971dc83
MD5 6fdb99bc0b74aef4c86eef7fabdc1e8a
BLAKE2b-256 862e409e71ccdc4314bfade34d79141130cc9dc02d1a87214ef37513b0bdd3ae

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page