with Apple MPS support for OpenAI Whisper
Project description
whisper-mps
An opinionated CLI to transcribe Audio files (or YouTube videos) w/ Whisper on-device! Powered by MLX, Whisper & Apple M series
TL;DR - After our actual testing, the Whisper supported by MPS achieves speeds comparable to a 4090!
80 mins audio file only need 80s on APPLE M1 MAX 32G! ONLY 80 SECONDS
🆕 Blazingly fast transcriptions via your terminal! ⚡️
We've added a CLI to enable fast transcriptions. Here's how you can use it:
Install whisper-mps with pip:
# please install ffmpeg first: brew install ffmpeg
pip install whisper-mps
Run inference from any path on your computer:
# filetype should be wav/mp3/mp4 etc.
whisper-mps --file-name <filename>
Run inference with a specific model size:
# for example, using the base model size. Other available models: "tiny", "base", "small", "medium", "large".
# Larger models require more loading time.
# filetype should be wav/mp3/mp4 etc.
whisper-mps --file-name <filename> --model-name base
Run inference from a YouTube URL on your computer:
[!NOTE] Deprecated: The YouTube URL inference feature is deprecated and may be removed in future releases.
Please uses yt-dlp to download YouTube videos.
# whisper-mps --youtube-url https://www.youtube.com/watch\?v\=jaM02mb6JFM
[!NOTE] The CLI is highly opinionated and only works on Apple MPS.
CLI Options
The whisper-mps repo provides all-round support for running Whisper in various settings. More command-line support will be provided later.
--file-name FILE_NAME
Path or URL to the audio file to be transcribed.
--model-name MODEL_NAME
Size of the OPENAI Whisper model name, like tiny (default), base, small, etc.
--youtube-url URL_ADDRESS
The YouTube video URL. (Deprecated)
--output-file-name OUTPUT_FILE_NAME
The output file name for the transcribed text JSON.
--log-level LOG_LEVEL
Optional. ["DEBUG", "INFO", "WARNING", "ERROR", "CRITICAL"]
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file whisper_mps-0.0.10.tar.gz.
File metadata
- Download URL: whisper_mps-0.0.10.tar.gz
- Upload date:
- Size: 920.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7bb703cdb226224e43e920595dd3cc05662c921cf5f0827eb31ee2182ccab024
|
|
| MD5 |
9c8d4fabf4f019c5cdabb2381f76d64b
|
|
| BLAKE2b-256 |
5216b009317ca1f544e2a6ce2a4e2a5011c9e85cd5cd465c45fa06ba87f55758
|
File details
Details for the file whisper_mps-0.0.10-py3-none-any.whl.
File metadata
- Download URL: whisper_mps-0.0.10-py3-none-any.whl
- Upload date:
- Size: 925.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
4bdcf0fd7a1456aece18938e0bb704fd258df2a38f6a8450d3282de841be45c7
|
|
| MD5 |
2aacc4368d5d485fa0e598c2daa5b29f
|
|
| BLAKE2b-256 |
a4ef840fa53b45b31fb84583cfd5e56c6c4af5c09c407283d619f8c1adba9054
|