Skip to main content

MCP server for audio transcription via faster-whisper (local) or OpenAI Whisper API

Project description

whisper-transcribe-mcp

PyPI version License: MIT Python 3.10+

MCP server for audio transcription using faster-whisper (local, free, offline) or OpenAI Whisper API (cloud, requires API key). Works with Claude Desktop and Claude Code on macOS, Windows, and Linux.


Prerequisites

macOS

Option A — uv (recommended):

brew install uv
# or
curl -LsSf https://astral.sh/uv/install.sh | sh

Option B — Python: Python 3.10+ is included in macOS 12.3+. You can also install it with brew install python.


Windows

Option A — uv (recommended):

winget install astral-sh.uv

Or download the installer from astral.sh/uv.

Option B — Python: Download Python 3.10+ from python.org. During installation, check "Add Python to PATH".

No need to install ffmpeg or any compiler — everything is bundled in the package.


Linux

Option A — uv (recommended):

curl -LsSf https://astral.sh/uv/install.sh | sh

Option B — Python:

# Debian/Ubuntu
sudo apt install python3.12 python3.12-venv

# Fedora
sudo dnf install python3.12

# Arch
sudo pacman -S python

No additional system dependencies required.


Installation

Option A — uvx (recommended, no permanent install)

uvx automatically downloads and installs the package in an isolated environment. Only requires uv to be installed.

# Local backend:
uvx "whisper-transcribe-mcp[local]"

# OpenAI backend:
uvx "whisper-transcribe-mcp[openai]"

# Both backends:
uvx "whisper-transcribe-mcp[all]"

Option B — pip

# Local backend:
pip install "whisper-transcribe-mcp[local]"

# OpenAI backend:
pip install "whisper-transcribe-mcp[openai]"

# Both backends:
pip install "whisper-transcribe-mcp[all]"

Use Cases

Case 1 — Local backend only (free, works offline)

Uses faster-whisper to transcribe locally. The model is downloaded from HuggingFace on first use (~74MB for base) and cached.

Install:

pip install "whisper-transcribe-mcp[local]"

Environment variables:

WHISPER_MODEL=base   # or tiny, small, medium, large-v3

Case 2 — OpenAI backend only (best accuracy, requires API key)

Uses OpenAI's whisper-1 model. Requires an API key and internet connection. No local model downloads.

Install:

pip install "whisper-transcribe-mcp[openai]"

Environment variables:

OPENAI_API_KEY=sk-...

Case 3 — Both backends (OpenAI if key present, local as fallback)

If OPENAI_API_KEY is set, OpenAI is used automatically. Otherwise falls back to local faster-whisper.

Install:

pip install "whisper-transcribe-mcp[all]"

Configuration

Claude Desktop

Config file location by operating system:

OS Path
macOS ~/Library/Application Support/Claude/claude_desktop_config.json
Windows %APPDATA%\Claude\claude_desktop_config.json
Linux ~/.config/Claude/claude_desktop_config.json

Add the entry inside "mcpServers":

Case 1 — Local:

{
  "mcpServers": {
    "whisper-transcribe": {
      "command": "uvx",
      "args": ["whisper-transcribe-mcp[local]"],
      "env": {
        "WHISPER_MODEL": "base"
      }
    }
  }
}

Case 2 — OpenAI:

{
  "mcpServers": {
    "whisper-transcribe": {
      "command": "uvx",
      "args": ["whisper-transcribe-mcp[openai]"],
      "env": {
        "OPENAI_API_KEY": "sk-..."
      }
    }
  }
}

Case 3 — Both (OpenAI takes priority if key is set):

{
  "mcpServers": {
    "whisper-transcribe": {
      "command": "uvx",
      "args": ["whisper-transcribe-mcp[all]"],
      "env": {
        "OPENAI_API_KEY": "sk-...",
        "WHISPER_MODEL": "base"
      }
    }
  }
}

Restart Claude Desktop after editing the file.


Claude Code

Works the same on macOS, Windows, and Linux. Requires uv installed.

Claude Code config file location:

OS Global Per project
macOS / Linux ~/.claude.json .claude/settings.json (project root)
Windows C:\Users\<user>\.claude.json .claude\settings.json (project root)

The easiest way to add the server is via the Claude Code CLI, which updates the config file automatically:

# Case 1 — Local:
claude mcp add whisper-transcribe uvx -- "whisper-transcribe-mcp[local]"

# Case 2 — OpenAI:
claude mcp add whisper-transcribe uvx --env OPENAI_API_KEY=sk-... -- "whisper-transcribe-mcp[openai]"

# Case 3 — Both:
claude mcp add whisper-transcribe uvx --env OPENAI_API_KEY=sk-... --env WHISPER_MODEL=base -- "whisper-transcribe-mcp[all]"

To add it globally (available in all projects), use --scope user:

claude mcp add --scope user whisper-transcribe uvx -- "whisper-transcribe-mcp[local]"

Or edit ~/.claude.json directly and add inside "mcpServers":

{
  "mcpServers": {
    "whisper-transcribe": {
      "command": "uvx",
      "args": ["whisper-transcribe-mcp[local]"],
      "env": {
        "WHISPER_MODEL": "base"
      }
    }
  }
}

Environment Variables

Variable Default Description
WHISPER_MODEL base Local model size: tiny, base, small, medium, large-v3
OPENAI_API_KEY If set, activates the OpenAI backend instead of local

Available Tools

transcribe_file

Transcribes an audio file by path (mp3, wav, m4a, ogg, flac, webm, etc.).

Parameters:

  • file_path (required): Absolute path to the audio file
  • language (optional): Language code (es, en, fr, etc.). Auto-detected if not provided.
  • model_size (optional): Local model size. Ignored with the OpenAI backend.

Response:

{
  "text": "Full transcription...",
  "language": "en",
  "language_probability": 0.99,
  "segments": [
    { "start": 0.0, "end": 4.2, "text": "First segment..." }
  ],
  "backend": "local",
  "model": "base"
}

transcribe_base64

Transcribes audio provided as a base64-encoded string. Useful for programmatic integrations.

Parameters:

  • audio_base64 (required): Base64-encoded audio data
  • extension (optional, default mp3): File extension (mp3, wav, ogg, etc.)
  • language (optional): Language code
  • model_size (optional): Local model size

list_models

Shows the active backend configuration and available local models.


Local Model Sizes

Model Size Relative Speed Notes
tiny 39 MB ~32x Fastest, least accurate
base 74 MB ~16x Good balance (default)
small 244 MB ~6x Better accuracy
medium 769 MB ~2x High accuracy
large-v3 1.5 GB ~1x Best accuracy, slowest

Models are downloaded automatically from HuggingFace on first use and cached locally.


License

MIT — see LICENSE

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whisper_transcribe_mcp-1.0.1.tar.gz (6.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

whisper_transcribe_mcp-1.0.1-py3-none-any.whl (7.0 kB view details)

Uploaded Python 3

File details

Details for the file whisper_transcribe_mcp-1.0.1.tar.gz.

File metadata

  • Download URL: whisper_transcribe_mcp-1.0.1.tar.gz
  • Upload date:
  • Size: 6.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for whisper_transcribe_mcp-1.0.1.tar.gz
Algorithm Hash digest
SHA256 8731347112a3d5f59f520364e62403d5b7979a0d62529f1cc8db2b6686f4fe7c
MD5 83f8ea293acb90ddf44b00db5f79d466
BLAKE2b-256 6ac9815ae47f8505d5b74aceccda9f05d2f51be946f38ce6e7ff3d597b67c869

See more details on using hashes here.

Provenance

The following attestation bundles were made for whisper_transcribe_mcp-1.0.1.tar.gz:

Publisher: publish.yml on ZahiriNatZuke/whisper-transcribe-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file whisper_transcribe_mcp-1.0.1-py3-none-any.whl.

File metadata

File hashes

Hashes for whisper_transcribe_mcp-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 7c3705c16ef62e8fab6ecdff96c3b36ed286f9f3f56c3c22526f5576b348c433
MD5 efca12421d6342b92e016e1de10d826d
BLAKE2b-256 a14c38558534c824e126762ff39e48981fed9e8b3fe53de22b7c07c39357d5ad

See more details on using hashes here.

Provenance

The following attestation bundles were made for whisper_transcribe_mcp-1.0.1-py3-none-any.whl:

Publisher: publish.yml on ZahiriNatZuke/whisper-transcribe-mcp

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page