A background daemon for Wayland that transcribes audio and types results.
Project description
Harp 🎵
Harp is a powerful background daemon for Linux (specifically Wayland) that turns your voice into text, typed directly into any active window. Harp is now local-first, using faster-whisper for high-performance, private, and near-instant transcription. It can also be integrated with any OpenAI-compatible LLM provider for intelligent command processing and post-processing.
📚 Documentation
For in-depth information, please refer to our documentation:
- CLI Reference: Full guide to the
harpcommand and its options. - Deployment & Setup: Detailed guide on permissions, dependencies, and configuration.
- Internal Architecture: How Harp works under the hood.
- Development Guide: How to contribute and run tests.
✨ Features
...
- Local-First Transcription: Powered by
faster-whisper, ensuring your voice data stays on your machine and transcription is lightning-fast. - Continuous Processing (Opt-in): With
--continuous, Harp transcribes long recordings incrementally in the background, showing you live feedback in the terminal. - Global Hotkey (
Ctrl + Space): Instantly start and stop voice capture from anywhere. - Direct Keyboard Emulation: Transcribed text is typed automatically into your active application.
- Model Management CLI: Easily download, list, and manage Whisper models (tiny, base, small, medium, large-v3) directly from the
harpcommand. - Multimodal Modes:
- Transcription Mode: Standard "Voice-to-Text" for typing emails, notes, or code.
- Command Mode (
Ctrl + Shift + Space): Send locally transcribed text to an LLM (e.g., "Summarize the previous paragraph") and get the response typed back.
- Flexible Operation:
- Hold Mode: Record while you hold the hotkey.
- Toggle Mode (
--toggle): Click once to start, click again to stop.
- Clipboard Integration:
- Context Awareness: In Command Mode, Harp can read your clipboard and send a configurable amount of text to the LLM to provide context.
- Auto-Copy: Automatically copy the final result directly to your clipboard.
- Modern CLI: Beautiful terminal interface with colors, spinners, and panels powered by
Rich.
🚀 Installation
Using uv (Recommended)
The fastest way to run Harp without manual environment setup:
uvx harpio
Using pipx
For a persistent global installation:
pipx install harpio
From Source
git clone https://github.com/apiad/harp.git
cd harp
uv sync
uv run harp start
🛠 Setup & Requirements
- Permissions: Harp requires access to
/dev/inputand/dev/uinput.
sudo usermod -aG input $USER
# You may also need to set udev rules for uinput or run:
sudo chmod 666 /dev/uinput
- Whisper Model: Before starting for the first time, download a Whisper model.
harp models download base
- API Configuration (Optional): If you want to use Command Mode, Harp requires an OpenAI-compatible API key. Create a
.envfile in your home or project directory:
HARP_LLM_API_KEY=your_api_key_here
# Optional: Override the default provider (OpenRouter)
HARP_LLM_BASE_URL=https://openrouter.ai/api/v1
HARP_LLM_MODEL=google/gemini-2.0-flash
- Dependencies: Ensure
libportaudio2andwl-clipboard(for Wayland clipboard support) are installed.
sudo apt install libportaudio2 wl-clipboard
⌨️ Usage
Start the daemon using the CLI. By default, it runs in "Hold Mode" and only types safe characters.
harp start
Configuration & CLI Options
You can customize Harp's behavior using the following flags:
| Option | Short | Description | Use Case |
|---|---|---|---|
--device <path> |
-d |
Target a specific input device (e.g., /dev/input/event0). |
Useful if you have multiple keyboards and only want to trigger Harp from one specific device. |
--toggle |
-t |
Enable toggle mode instead of hold mode. | Press Ctrl+Space once to start recording, then press it again to stop and transcribe. |
--full |
-f |
Disable safe filtering and type all returned characters, including symbols. | Essential when dictating code, complex punctuation, or URLs. |
--copy |
Automatically copy the final transcribed text to your clipboard. | Dictate an idea, let Harp type it out, and also have it ready in your clipboard to paste elsewhere. | |
--send-clipboard |
Command Mode Only: Send a number of tokens from the clipboard as context to LLM. | Copy an email, press Ctrl+Shift+Space, and say "Draft a polite decline to this email." |
|
--type |
Enable typing the result (default: False). | Enable with harp start --type to type directly into active windows. |
Examples
Programmer Mode: Toggle recording, type all code symbols, and copy the result to clipboard.
harp start --toggle --full --copy
Contextual Assistant: Use clipboard context and limit to the last 1000 tokens.
harp start --send-clipboard 1000
🤝 Contributing
Contributions are what make the open-source community such an amazing place to learn, inspire, and create. Any contributions you make are greatly appreciated.
🤖 Gemini CLI
This repository is enhanced with custom Gemini CLI commands to automate common tasks and workflows. If you are using Gemini, you can run:
gemini /onboard
to get started, explore the project architecture, and understand the automated workflows (planning, debugging, releases, etc.).
Standard Process
- Report Bugs: Open an issue if you find something broken.
- Suggest Features: Have an idea for a new mode? Let us know!
- Submit Pull Requests:
- Fork the project.
- Create your Feature Branch (
git checkout -b feature/AmazingFeature). - Commit your changes (
git commit -m 'feat: Add some AmazingFeature'). - Push to the branch (
git push origin feature/AmazingFeature). - Open a Pull Request.
📄 License
Distributed under the MIT License. See LICENSE for more information.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file harpio-0.5.1.tar.gz.
File metadata
- Download URL: harpio-0.5.1.tar.gz
- Upload date:
- Size: 1.9 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fad1067aae7e3cc7d88445d8a665f61aa6477d72d1657fe1030fb9de3b2b4e81
|
|
| MD5 |
1a05d7c72dabc14dec3424917ebf418c
|
|
| BLAKE2b-256 |
ea48417267916a940907a2e080c59b52fea29d82725b24ddd1a95f3e07ef70b1
|
File details
Details for the file harpio-0.5.1-py3-none-any.whl.
File metadata
- Download URL: harpio-0.5.1-py3-none-any.whl
- Upload date:
- Size: 20.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: uv/0.10.9 {"installer":{"name":"uv","version":"0.10.9","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
2aa59f595ed03ffaaf3977f54f4f4d1ceeb798c6b84752b7e84b7d1866b553c9
|
|
| MD5 |
50f45538dd82aeae1c225e080ff77cda
|
|
| BLAKE2b-256 |
63177866b7fb9e59c88e871010c5541b9f45b670ce3f1d6bda42e57977321c39
|