Skip to main content

Voice dictation daemon using NVIDIA Parakeet on Apple Silicon

Project description

🐦 Birdword

Contextual voice dictation for macOS. Powered by NVIDIA Parakeet running locally on Apple Silicon via MLX.

Press a hotkey, speak, and your words are transcribed and pasted into whatever app is focused. A small LLM post-processes the transcription to fix errors, using project-specific context from a BIRDWORD.md file.

Getting started

Requires macOS on Apple Silicon (M1+) and Python 3.10+.

# Run with uvx (no install needed)
uvx birdword

# Or run in the background
uvx birdword start
uvx birdword stop
uvx birdword status

Context-aware correction

The key idea behind birdword is contextual transcription correction. When dictating into Terminal.app, birdword detects the focused tab's working directory and looks for a BIRDWORD.md file up the directory tree. This lets you teach birdword your project's domain:

Context detection works with:

  • Terminal.app — detects the focused tab's shell working directory
  • VS Code / VS Code Insiders — via the Birdword extension, which works with local and remote (SSH) workspaces

Transcription and pasting work in any app.

uvx birdword init

This creates a BIRDWORD.md with the default prompt template. Edit it to add your project's terms, names, and jargon:

---
transcription_model: mlx-community/parakeet-tdt-0.6b-v2
fix_model: mlx-community/Qwen2.5-1.5B-Instruct-4bit
---

Fix transcription errors. Output only the corrected text.

Example 1:
Input: "the java script function isnt working"
Output: "The JavaScript function isn't working."

Example 2:
Input: "check the get ignore file for the repo"
Output: "Check the .gitignore file for the repo."

Example 3:
Input: "we need to refactor the a p i endpoint"
Output: "We need to refactor the API endpoint."

Key terms: MyClass, some_function, PostgreSQL
Names: Alice, Bob

Input: "{{ transcript }}"
Output:

The file is a Jinja template. {{ transcript }} is replaced with the raw transcription. If omitted, the transcript is appended automatically.

The YAML front matter lets you override models per-project. When you dictate into a Terminal tab whose shell is in that directory (or a child), birdword picks up the nearest BIRDWORD.md and uses it.

Hotkeys

Action Default
Toggle recording Right ⌘ + Space
Hold to record Hold Right ⌘ for >1s, release to transcribe

Hotkeys are configurable:

--hold-key KEY       Hold key (default: rcmd). Options: rcmd, lcmd, ralt, lalt, rshift, lshift, rctrl, lctrl
--toggle-key KEY     Toggle key (default: space). Options: space, return, tab, escape

Options

--model MODEL        Transcription model (default: mlx-community/parakeet-tdt-0.6b-v2)
--fix-model MODEL    Post-processor model (default: mlx-community/Qwen2.5-1.5B-Instruct-4bit)
--no-fix             Disable LLM post-processing

Menu bar

Birdword shows a bird icon in the menu bar:

  • White — idle
  • 🟡 Yellow — connecting mic
  • 🔴 Red — listening
  • Sparkles — transcribing

Permissions

Birdword needs three macOS permissions, granted to your terminal app:

  • 🎤 Microphone — to record your voice
  • 🔐 Accessibility — to paste text and intercept the hotkey
  • ⌨️ Input Monitoring — to detect the global hotkey

Birdword checks these on startup and tells you what's missing.

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

birdword-0.2.1.tar.gz (406.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

birdword-0.2.1-py3-none-any.whl (27.4 kB view details)

Uploaded Python 3

File details

Details for the file birdword-0.2.1.tar.gz.

File metadata

  • Download URL: birdword-0.2.1.tar.gz
  • Upload date:
  • Size: 406.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for birdword-0.2.1.tar.gz
Algorithm Hash digest
SHA256 3141962429b6ff7b5b9e441176113395cbdb46a00eb4191065ee1fdcc877fa77
MD5 cd6af0026dd3e7b177250d26ffdd8c13
BLAKE2b-256 59bedfe4e4ea5d8ae5df1746cdde6d2364e218cdac51850f23e0b8dfd8e8cfde

See more details on using hashes here.

Provenance

The following attestation bundles were made for birdword-0.2.1.tar.gz:

Publisher: main.yaml on tillahoffmann/birdword

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file birdword-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: birdword-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 27.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for birdword-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 0d15c64313e79a8bc2e2c9a54f05a259e1e2653a7abae25dee2f9c073d83c0c4
MD5 baa2afbd4fe3991c7cb355aaaf619b2c
BLAKE2b-256 8084d5466e43e31e6d6226eb6da4997c19c7909e5aff84350f2e6a25de2bb605

See more details on using hashes here.

Provenance

The following attestation bundles were made for birdword-0.2.1-py3-none-any.whl:

Publisher: main.yaml on tillahoffmann/birdword

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page