Skip to main content

with Apple MPS support for OpenAI Whisper

Project description

whisper-mps

image

An opinionated CLI to transcribe Audio files (or YouTube videos) w/ Whisper on-device! Powered by MLX, Whisper & Apple M series

TL;DR - After our actual testing, the Whisper supported by MPS achieves speeds comparable to a 4090!

80 mins audio file only need 80s on APPLE M1 MAX 32G! ONLY 80 SECONDS

🆕 Blazingly fast transcriptions via your terminal! ⚡️

We've added a CLI to enable fast transcriptions. Here's how you can use it:

Install whisper-mps with pip:

# please install ffmpeg first: brew install ffmpeg
pip install whisper-mps

Run inference from any path on your computer:

# filetype should be wav/mp3/mp4 etc.
whisper-mps --file-name <filename>

Run inference with a specific model size:

# for example, using the base model size. Other available models: "tiny", "base", "small", "medium", "large".
# Larger models require more loading time.
# filetype should be wav/mp3/mp4 etc.
whisper-mps --file-name <filename> --model-name base

Run inference from a YouTube URL on your computer:

[!NOTE] Deprecated: The YouTube URL inference feature is deprecated and may be removed in future releases.
Please uses yt-dlp to download YouTube videos.

# whisper-mps --youtube-url https://www.youtube.com/watch\?v\=jaM02mb6JFM

[!NOTE] The CLI is highly opinionated and only works on Apple MPS.

CLI Options

The whisper-mps repo provides all-round support for running Whisper in various settings. More command-line support will be provided later.

  --file-name FILE_NAME
                  Path or URL to the audio file to be transcribed.
  --model-name MODEL_NAME
                  Size of the OPENAI Whisper model name, like tiny (default), base, small, etc.
  --youtube-url URL_ADDRESS
                  The YouTube video URL. (Deprecated)
  --output-file-name OUTPUT_FILE_NAME
                  The output file name for the transcribed text JSON.
  --log-level LOG_LEVEL
                  Optional. ["DEBUG", "INFO", "WARNING", "ERROR", "CRITICAL"]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whisper_mps-0.0.10.tar.gz (920.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

whisper_mps-0.0.10-py3-none-any.whl (925.1 kB view details)

Uploaded Python 3

File details

Details for the file whisper_mps-0.0.10.tar.gz.

File metadata

  • Download URL: whisper_mps-0.0.10.tar.gz
  • Upload date:
  • Size: 920.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for whisper_mps-0.0.10.tar.gz
Algorithm Hash digest
SHA256 7bb703cdb226224e43e920595dd3cc05662c921cf5f0827eb31ee2182ccab024
MD5 9c8d4fabf4f019c5cdabb2381f76d64b
BLAKE2b-256 5216b009317ca1f544e2a6ce2a4e2a5011c9e85cd5cd465c45fa06ba87f55758

See more details on using hashes here.

File details

Details for the file whisper_mps-0.0.10-py3-none-any.whl.

File metadata

  • Download URL: whisper_mps-0.0.10-py3-none-any.whl
  • Upload date:
  • Size: 925.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for whisper_mps-0.0.10-py3-none-any.whl
Algorithm Hash digest
SHA256 4bdcf0fd7a1456aece18938e0bb704fd258df2a38f6a8450d3282de841be45c7
MD5 2aacc4368d5d485fa0e598c2daa5b29f
BLAKE2b-256 a4ef840fa53b45b31fb84583cfd5e56c6c4af5c09c407283d619f8c1adba9054

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page