MCP server for audio transcription via faster-whisper (local) or OpenAI Whisper API
Project description
whisper-transcribe-mcp
MCP server for audio transcription using faster-whisper (local, free, offline) or OpenAI Whisper API (cloud, requires API key). Works with Claude Desktop and Claude Code on macOS, Windows, and Linux.
Prerequisites
macOS
Option A — uv (recommended):
brew install uv
# or
curl -LsSf https://astral.sh/uv/install.sh | sh
Option B — Python:
Python 3.10+ is included in macOS 12.3+. You can also install it with brew install python.
Windows
Option A — uv (recommended):
winget install astral-sh.uv
Or download the installer from astral.sh/uv.
Option B — Python: Download Python 3.10+ from python.org. During installation, check "Add Python to PATH".
No need to install ffmpeg or any compiler — everything is bundled in the package.
Linux
Option A — uv (recommended):
curl -LsSf https://astral.sh/uv/install.sh | sh
Option B — Python:
# Debian/Ubuntu
sudo apt install python3.12 python3.12-venv
# Fedora
sudo dnf install python3.12
# Arch
sudo pacman -S python
No additional system dependencies required.
Installation
Option A — uvx (recommended, no permanent install)
uvx automatically downloads and installs the package in an isolated environment. Only requires uv to be installed.
# Local backend:
uvx "whisper-transcribe-mcp[local]"
# OpenAI backend:
uvx "whisper-transcribe-mcp[openai]"
# Both backends:
uvx "whisper-transcribe-mcp[all]"
Option B — pip
# Local backend:
pip install "whisper-transcribe-mcp[local]"
# OpenAI backend:
pip install "whisper-transcribe-mcp[openai]"
# Both backends:
pip install "whisper-transcribe-mcp[all]"
Use Cases
Case 1 — Local backend only (free, works offline)
Uses faster-whisper to transcribe locally. The model is downloaded from HuggingFace on first use (~74MB for base) and cached.
Install:
pip install "whisper-transcribe-mcp[local]"
Environment variables:
WHISPER_MODEL=base # or tiny, small, medium, large-v3
Case 2 — OpenAI backend only (best accuracy, requires API key)
Uses OpenAI's whisper-1 model. Requires an API key and internet connection. No local model downloads.
Install:
pip install "whisper-transcribe-mcp[openai]"
Environment variables:
OPENAI_API_KEY=sk-...
Case 3 — Both backends (OpenAI if key present, local as fallback)
If OPENAI_API_KEY is set, OpenAI is used automatically. Otherwise falls back to local faster-whisper.
Install:
pip install "whisper-transcribe-mcp[all]"
Configuration
Claude Desktop
Config file location by operating system:
| OS | Path |
|---|---|
| macOS | ~/Library/Application Support/Claude/claude_desktop_config.json |
| Windows | %APPDATA%\Claude\claude_desktop_config.json |
| Linux | ~/.config/Claude/claude_desktop_config.json |
Add the entry inside "mcpServers":
Case 1 — Local:
{
"mcpServers": {
"whisper-transcribe": {
"command": "uvx",
"args": ["whisper-transcribe-mcp[local]"],
"env": {
"WHISPER_MODEL": "base"
}
}
}
}
Case 2 — OpenAI:
{
"mcpServers": {
"whisper-transcribe": {
"command": "uvx",
"args": ["whisper-transcribe-mcp[openai]"],
"env": {
"OPENAI_API_KEY": "sk-..."
}
}
}
}
Case 3 — Both (OpenAI takes priority if key is set):
{
"mcpServers": {
"whisper-transcribe": {
"command": "uvx",
"args": ["whisper-transcribe-mcp[all]"],
"env": {
"OPENAI_API_KEY": "sk-...",
"WHISPER_MODEL": "base"
}
}
}
}
Restart Claude Desktop after editing the file.
Claude Code
Works the same on macOS, Windows, and Linux. Requires uv installed.
Claude Code config file location:
| OS | Global | Per project |
|---|---|---|
| macOS / Linux | ~/.claude.json |
.claude/settings.json (project root) |
| Windows | C:\Users\<user>\.claude.json |
.claude\settings.json (project root) |
The easiest way to add the server is via the Claude Code CLI, which updates the config file automatically:
# Case 1 — Local:
claude mcp add whisper-transcribe uvx -- "whisper-transcribe-mcp[local]"
# Case 2 — OpenAI:
claude mcp add whisper-transcribe uvx --env OPENAI_API_KEY=sk-... -- "whisper-transcribe-mcp[openai]"
# Case 3 — Both:
claude mcp add whisper-transcribe uvx --env OPENAI_API_KEY=sk-... --env WHISPER_MODEL=base -- "whisper-transcribe-mcp[all]"
To add it globally (available in all projects), use --scope user:
claude mcp add --scope user whisper-transcribe uvx -- "whisper-transcribe-mcp[local]"
Or edit ~/.claude.json directly and add inside "mcpServers":
{
"mcpServers": {
"whisper-transcribe": {
"command": "uvx",
"args": ["whisper-transcribe-mcp[local]"],
"env": {
"WHISPER_MODEL": "base"
}
}
}
}
Environment Variables
| Variable | Default | Description |
|---|---|---|
WHISPER_MODEL |
base |
Local model size: tiny, base, small, medium, large-v3 |
OPENAI_API_KEY |
— | If set, activates the OpenAI backend instead of local |
Available Tools
transcribe_file
Transcribes an audio file by path (mp3, wav, m4a, ogg, flac, webm, etc.).
Parameters:
file_path(required): Absolute path to the audio filelanguage(optional): Language code (es,en,fr, etc.). Auto-detected if not provided.model_size(optional): Local model size. Ignored with the OpenAI backend.
Response:
{
"text": "Full transcription...",
"language": "en",
"language_probability": 0.99,
"segments": [
{ "start": 0.0, "end": 4.2, "text": "First segment..." }
],
"backend": "local",
"model": "base"
}
transcribe_base64
Transcribes audio provided as a base64-encoded string. Useful for programmatic integrations.
Parameters:
audio_base64(required): Base64-encoded audio dataextension(optional, defaultmp3): File extension (mp3,wav,ogg, etc.)language(optional): Language codemodel_size(optional): Local model size
list_models
Shows the active backend configuration and available local models.
Local Model Sizes
| Model | Size | Relative Speed | Notes |
|---|---|---|---|
tiny |
39 MB | ~32x | Fastest, least accurate |
base |
74 MB | ~16x | Good balance (default) |
small |
244 MB | ~6x | Better accuracy |
medium |
769 MB | ~2x | High accuracy |
large-v3 |
1.5 GB | ~1x | Best accuracy, slowest |
Models are downloaded automatically from HuggingFace on first use and cached locally.
License
MIT — see LICENSE
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file whisper_transcribe_mcp-1.0.1.tar.gz.
File metadata
- Download URL: whisper_transcribe_mcp-1.0.1.tar.gz
- Upload date:
- Size: 6.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8731347112a3d5f59f520364e62403d5b7979a0d62529f1cc8db2b6686f4fe7c
|
|
| MD5 |
83f8ea293acb90ddf44b00db5f79d466
|
|
| BLAKE2b-256 |
6ac9815ae47f8505d5b74aceccda9f05d2f51be946f38ce6e7ff3d597b67c869
|
Provenance
The following attestation bundles were made for whisper_transcribe_mcp-1.0.1.tar.gz:
Publisher:
publish.yml on ZahiriNatZuke/whisper-transcribe-mcp
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
whisper_transcribe_mcp-1.0.1.tar.gz -
Subject digest:
8731347112a3d5f59f520364e62403d5b7979a0d62529f1cc8db2b6686f4fe7c - Sigstore transparency entry: 1550748511
- Sigstore integration time:
-
Permalink:
ZahiriNatZuke/whisper-transcribe-mcp@6d1bf5ea8d88177ccb2805c3820bce9481f1f38d -
Branch / Tag:
refs/tags/v1.0.1 - Owner: https://github.com/ZahiriNatZuke
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@6d1bf5ea8d88177ccb2805c3820bce9481f1f38d -
Trigger Event:
release
-
Statement type:
File details
Details for the file whisper_transcribe_mcp-1.0.1-py3-none-any.whl.
File metadata
- Download URL: whisper_transcribe_mcp-1.0.1-py3-none-any.whl
- Upload date:
- Size: 7.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? Yes
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7c3705c16ef62e8fab6ecdff96c3b36ed286f9f3f56c3c22526f5576b348c433
|
|
| MD5 |
efca12421d6342b92e016e1de10d826d
|
|
| BLAKE2b-256 |
a14c38558534c824e126762ff39e48981fed9e8b3fe53de22b7c07c39357d5ad
|
Provenance
The following attestation bundles were made for whisper_transcribe_mcp-1.0.1-py3-none-any.whl:
Publisher:
publish.yml on ZahiriNatZuke/whisper-transcribe-mcp
-
Statement:
-
Statement type:
https://in-toto.io/Statement/v1 -
Predicate type:
https://docs.pypi.org/attestations/publish/v1 -
Subject name:
whisper_transcribe_mcp-1.0.1-py3-none-any.whl -
Subject digest:
7c3705c16ef62e8fab6ecdff96c3b36ed286f9f3f56c3c22526f5576b348c433 - Sigstore transparency entry: 1550748662
- Sigstore integration time:
-
Permalink:
ZahiriNatZuke/whisper-transcribe-mcp@6d1bf5ea8d88177ccb2805c3820bce9481f1f38d -
Branch / Tag:
refs/tags/v1.0.1 - Owner: https://github.com/ZahiriNatZuke
-
Access:
public
-
Token Issuer:
https://token.actions.githubusercontent.com -
Runner Environment:
github-hosted -
Publication workflow:
publish.yml@6d1bf5ea8d88177ccb2805c3820bce9481f1f38d -
Trigger Event:
release
-
Statement type: