Skip to main content

Whisper Turbo in MLX

Project description

WTM (Whisper Turbo MLX)

This repository provides a fast implementation of the Whisper model using MLX, designed for efficient audio transcription.

Alt text

Features

  • Fast Audio Transcription: Optimized for quick processing.
  • Simplified Architecture: Focus on ease of use with minimal decoding strategies.
  • Custom Tokenizer: Supports multilingual tokenization.

Installation

# Quick install (note: PyPI version may not always be up to date)
brew install ffmpeg
pip install whisper-turbo-mlx

# For the latest version, you can install directly from the repository:
# git clone https://github.com/JosefAlbers/whisper-turbo-mlx.git
# cd whisper-turbo-mlx
# pip install -e .

Usage

To transcribe an audio file, call the transcribe function:

transcribe 'test.wav'

Contributing

Contributions are welcome! Feel free to submit issues or pull requests.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whisper_turbo_mlx-0.0.1rc3.tar.gz (377.5 kB view details)

Uploaded Source

Built Distribution

whisper_turbo_mlx-0.0.1rc3-py3-none-any.whl (377.9 kB view details)

Uploaded Python 3

File details

Details for the file whisper_turbo_mlx-0.0.1rc3.tar.gz.

File metadata

  • Download URL: whisper_turbo_mlx-0.0.1rc3.tar.gz
  • Upload date:
  • Size: 377.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.12.3

File hashes

Hashes for whisper_turbo_mlx-0.0.1rc3.tar.gz
Algorithm Hash digest
SHA256 c195503c4c68c465707111aa18558b82f38e509b6d69d4138100667b5771e135
MD5 62716e2fbd2348a4c9dfbb9d1c3f8b64
BLAKE2b-256 9f4bdc862655535da6962711e614bd2273565af5e35339039be55d0724bebe72

See more details on using hashes here.

File details

Details for the file whisper_turbo_mlx-0.0.1rc3-py3-none-any.whl.

File metadata

File hashes

Hashes for whisper_turbo_mlx-0.0.1rc3-py3-none-any.whl
Algorithm Hash digest
SHA256 9c23be2424381afa587f8f66445fd67fdbe740a94c438e9e27a9b434073d62d9
MD5 dc02e9f8902009ffb56cca7ba5e3c9c5
BLAKE2b-256 9bc7d76a32e275af7099003122785e6a41488863b69d877bb0e9874962b8daad

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page