Audio transcription CLI using MLX Whisper
Project description
transcribe-mlx
Local audio transcription to JSON using MLX Whisper. Runs on Apple Silicon.
Install
pip install transcribe-mlx
Usage
transcribe-mlx "audio.mp3" "output.json"
transcribe-mlx "audio.mp3" "output.json" --model mlx-community/whisper-large-v3-mlx
Output is saved as a JSON file with segments, words, and timestamps.
First run: the model (~1–3 GB) is downloaded automatically from HuggingFace and cached locally. Subsequent runs use the cache.
Arguments
| Argument | Description |
|---|---|
audio |
Path to the input audio file |
output |
Path where the output JSON will be saved |
Options
| Flag | Default | Description |
|---|---|---|
--model / -m |
mlx-community/whisper-large-v3-turbo |
HuggingFace repo or local path to the Whisper MLX model |
Requirements
- Python 3.11+
- Apple Silicon (MLX requires Metal)
- FFmpeg (
brew install ffmpeg)
Development
git clone https://github.com/felipeinfantino/transcribe-mlx
cd transcribe-mlx
poetry install
poetry run transcribe-mlx "inputs/audio.mp3" "outputs/result.json"
Extending with a new package
poetry add [packagename]
# Check the CLI is still working
poetry run transcribe-mlx "inputs/audio.mp3" "outputs/result.json"
# Verify lockfile is clean
poetry lock
# Bump version
poetry version patch # or minor / major
Git build versioning
Let x.y.z be the new bumped version:
git tag v[x.y.z]
git push origin main --tags
License
MIT
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file transcribe_mlx-0.1.1.tar.gz.
File metadata
- Download URL: transcribe_mlx-0.1.1.tar.gz
- Upload date:
- Size: 2.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.3.2 CPython/3.11.14 Darwin/24.3.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5c075a74015f27fdc9b3980bc8c62e25f56350c4d13202893d007c6fc680360b
|
|
| MD5 |
cac68b49fbd9fae94dde812891ab15b6
|
|
| BLAKE2b-256 |
8567cbacc1f5f303e34baf4826288a532df84db0251f4f05e818343298c00bd0
|
File details
Details for the file transcribe_mlx-0.1.1-py3-none-any.whl.
File metadata
- Download URL: transcribe_mlx-0.1.1-py3-none-any.whl
- Upload date:
- Size: 3.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/2.3.2 CPython/3.11.14 Darwin/24.3.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c7f6795ddaadfc868a57bd6b9fa417d41a5d55bc329411640ad4d21607a9f5e0
|
|
| MD5 |
0a52cbbcce4ac7ae9d6b29f422f7eefe
|
|
| BLAKE2b-256 |
959c2ec61784f25ceab36eff4afe4365622dcda8303c4fb054a0473123f11479
|