Global hotkeys to record speech and transcribe directly to your cursor

Project description

Whisper Key - Local Speech-to-Text

Global hotkeys to record speech and transcribe directly to your cursor.

Questions or ideas? Discord

✨ Features

Global Hotkey: Start recording speech from any app
Auto-Paste: Transcribe directly to cursor
Auto-Send: Optionally auto-send with ENTER keypress
Local/Offline: Voice data never leaves your computer
CPU Ready: Small, efficient models available
GPU Ready: Support for both NVIDIA & AMD cards
Cross-platform: Works on Windows and macOS
Voice Commands: Trigger shortcuts, text snippets, and shell commands by voice — docs
Configurable: Customize hotkeys, models, and much more

🚀 Quick Start

From PyPI (Recommended)

Requires Python 3.11-3.13

# With pipx (isolated environment)
pipx install whisper-key-local

# Or with pip
pip install whisper-key-local

Then run: whisper-key (or wk for short)

Windows App

Download whisper-key.exe from the latest release
Run whisper-key.exe

From Source

git clone https://github.com/PinW/whisper-key-local.git
cd whisper-key-local
pip install -e .
python whisper-key.py

🎤 Basic Usage

Hotkey	Windows	macOS
Start recording	`Ctrl+Win`	`Fn+Ctrl`
Stop & transcribe	`Ctrl`	`Fn`
Stop & auto-send	`Alt`	`Option`
Cancel recording	`Esc`	`Shift`
Voice command mode	`Alt+Win`	`Fn+Command`

Open the system tray / menu bar icon to:

Toggle auto-paste vs clipboard-only
Change transcription model
Select audio device

🗣️ Voice Commands

Speak trigger phrases to run shell commands and more. Define in:

Windows: %APPDATA%\whisperkey\commands.yaml
macOS: ~/.whisperkey/commands.yaml

commands:
  # Send a keyboard shortcut
  - trigger: "undo"
    hotkey: "ctrl+z"
  # Deliver pre-written text
  - trigger: "my email"
    type: "user@example.com"
  # Run a shell command
  - trigger: "open notepad"
    run: 'notepad.exe'

See the Voice Commands Guide for full details.

⚡ GPU Acceleration

Whisper Key detects your GPU on first launch and offers one-press install of the required runtime libraries. Supports NVIDIA (CUDA) and AMD (ROCm).

For manual setup or troubleshooting, see the GPU Setup Guide.

⚙️ Configuration

Local settings at:

Windows: %APPDATA%\whisperkey\user_settings.yaml
macOS: ~/.whisperkey/user_settings.yaml

Delete this file and restart app to reset to defaults.

Option	Default	Notes
Whisper
`whisper.model`	`tiny`	Any model defined in `whisper.models`
`whisper.device`	`cpu`	cpu or cuda (NVIDIA/AMD GPU) — setup guide
`whisper.compute_type`	`int8`	int8/float16/float32
`whisper.language`	`auto`	auto or language code (en, es, fr, etc.)
`whisper.beam_size`	`5`	Higher = more accurate but slower (1-10)
`whisper.initial_prompt`	`""`	Guide transcription style, language variant, or script
`whisper.hotwords`	`[]`	Words the model should favor (names, technical terms)
`whisper.models`	(see config)	Add custom HuggingFace or local models
Post-Processing
`post_processing.strip_trailing_period`	`false`	Strip trailing period from output
`post_processing.corrections`	`{}`	Fix recurring misheard words, e.g. `CAPEX: [cap x]`
Hotkeys
`hotkey.recording_hotkey`	`ctrl+win` / `fn+ctrl`	Windows / macOS
`hotkey.stop_key`	`ctrl` / `fn`	Stop recording
`hotkey.auto_send_key`	`alt` / `option`	Stop + paste + Enter
`hotkey.cancel_combination`	`esc` / `shift`	Cancel recording
`hotkey.recording_mode`	`toggle`	toggle or push_to_talk
`hotkey.command_hotkey`	`alt+win` / `fn+command`	Voice command mode
Voice Activity Detection
`vad.vad_precheck_enabled`	`true`	Prevent hallucinations on silence
`vad.vad_onset_threshold`	`0.7`	Speech detection start (0.0-1.0)
`vad.vad_offset_threshold`	`0.55`	Speech detection end (0.0-1.0)
`vad.vad_min_speech_duration`	`0.1`	Min speech segment (seconds)
`vad.vad_realtime_enabled`	`true`	Auto-stop on silence
`vad.vad_silence_timeout_seconds`	`30.0`	Seconds before auto-stop
Audio
`audio.host`	`null`	Audio API (WASAPI, Core Audio, etc.)
`audio.channels`	`1`	1 = mono, 2 = stereo
`audio.dtype`	`float32`	float32/int16/int24/int32
`audio.max_duration`	`900`	Max recording seconds (0 = unlimited)
`audio.input_device`	`default`	Device ID or "default"
Clipboard
`clipboard.auto_paste`	`true`	false = clipboard only
`clipboard.delivery_method`	`paste`	paste (Ctrl+V) or type (direct injection)
`clipboard.paste_hotkey`	`ctrl+v` / `cmd+v`	Paste key simulation
`clipboard.paste_pre_paste_delay`	`0.05`	Delay after copy, before paste hotkey (seconds)
`clipboard.paste_preserve_clipboard`	`true`	Restore clipboard after paste
`clipboard.paste_clipboard_restore_delay`	`0.5`	Delay before clipboard restore (seconds)
`clipboard.type_also_copy_to_clipboard`	`false`	Also copy to clipboard in type mode
`clipboard.type_auto_enter_delay`	`0.15`	Delay before ENTER after typing (seconds)
`clipboard.type_auto_enter_delay_per_100_chars`	`0.1`	Extra ENTER delay per 100 typed chars (seconds)
Logging
`logging.level`	`INFO`	DEBUG/INFO/WARNING/ERROR/CRITICAL
`logging.file.enabled`	`true`	Write to app.log
`logging.log_transcriptions`	`false`	Include transcribed text in log (privacy)
`logging.console.enabled`	`true`	Print to console
`logging.console.level`	`WARNING`	Console verbosity
Audio Feedback
`audio_feedback.enabled`	`true`	Play sounds on record/stop
`audio_feedback.transcription_complete_enabled`	`false`	Play sound on transcription complete
`audio_feedback.ready_enabled`	`true`	Play sound when app finishes loading
`audio_feedback.start_sound`	`assets/sounds/...`	Custom sound file path
`audio_feedback.stop_sound`	`assets/sounds/...`	Custom sound file path
`audio_feedback.cancel_sound`	`assets/sounds/...`	Custom sound file path
`audio_feedback.transcription_complete_sound`	`assets/sounds/...`	Custom sound file path
`audio_feedback.ready_sound`	`assets/sounds/...`	Custom sound file path
System Tray
`system_tray.enabled`	`true`	Show tray icon
`system_tray.tooltip`	`Whisper Key`	Hover text
Terminal Title
`terminal_title.idle`	`""`	Tab title prefix when idle: static string or `[prefix, seconds]` animation frames
`terminal_title.recording`	🔴 blink	Tab title prefix while recording
`terminal_title.processing`	`""`	Tab title prefix while transcribing
Console
`console.start_hidden`	`false`	Hide console after startup (whisper-key-hideable.exe only)
Update
`update.mode`	`prompt`	prompt or auto
Voice Commands
`voice_commands.enabled`	`true`	Enable voice command mode

📁 Model Cache

Default path for transcription models (via HuggingFace):

Windows: %USERPROFILE%\.cache\huggingface\hub\
macOS: ~/.cache/huggingface/hub/

Contributing

Check the roadmap for planned features and see CONTRIBUTING.md for guidelines. Please open an issue before starting work on new features.

📦 Dependencies

Cross-platform: faster-whisper · numpy · sounddevice · soxr · pyperclip · ruamel.yaml · pystray · Pillow · playsound3 · ten-vad · hf-xet

Windows: global-hotkeys · pywin32

macOS: pyobjc-framework-Quartz · pyobjc-framework-ApplicationServices

Project details

Release history Release notifications | RSS feed

This version

0.8.2

Jun 12, 2026

0.8.1

Apr 17, 2026

0.8.0

Mar 17, 2026

0.7.1

Mar 7, 2026

0.7.0

Mar 7, 2026

0.6.3

Feb 17, 2026

0.6.2

Feb 9, 2026

0.6.1

Feb 4, 2026

0.6.0

Feb 4, 2026

0.5.3

Jan 19, 2026

0.5.2

Jan 16, 2026

0.5.1

Jan 16, 2026

0.5.0

Jan 15, 2026

0.4.0

Jan 14, 2026

0.3.0

Aug 19, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whisper_key_local-0.8.2.tar.gz (921.8 kB view details)

Uploaded Jun 12, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

whisper_key_local-0.8.2-py3-none-any.whl (929.6 kB view details)

Uploaded Jun 12, 2026 Python 3

File details

Details for the file whisper_key_local-0.8.2.tar.gz.

File metadata

Download URL: whisper_key_local-0.8.2.tar.gz
Upload date: Jun 12, 2026
Size: 921.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for whisper_key_local-0.8.2.tar.gz
Algorithm	Hash digest
SHA256	`50b44c3b969b1527dfb9ce1c889728206064fc9b5671a86a45e7e93e49df5fd7`
MD5	`0ad75c44c38fb6790932b68df5680236`
BLAKE2b-256	`ce393178fdb1628beed66adbcdc8d3a25f56c0a2d224dec35b78c62664113aae`

See more details on using hashes here.

File details

Details for the file whisper_key_local-0.8.2-py3-none-any.whl.

File metadata

Download URL: whisper_key_local-0.8.2-py3-none-any.whl
Upload date: Jun 12, 2026
Size: 929.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for whisper_key_local-0.8.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`92364a132f499602c3cc05546e1cd73b9b20147c2bdbdf219134bf16efe20d16`
MD5	`0ed94ad3bef7f3cad1d4b2f7f323a065`
BLAKE2b-256	`8d3ebfbaf4f1991bf1b7f357214b878fad5474f3df1522537974fd2800b0f181`

See more details on using hashes here.

whisper-key-local 0.8.2

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Whisper Key - Local Speech-to-Text

✨ Features

🚀 Quick Start

From PyPI (Recommended)

Windows App

From Source

🎤 Basic Usage

🗣️ Voice Commands

⚡ GPU Acceleration

⚙️ Configuration

📁 Model Cache

Contributing

📦 Dependencies

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes