Skip to main content

Voice dictation daemon for Linux with Sarvam AI STT

Project description

voxd

Voice dictation daemon for Linux. Press keybind, talk, press again. Text goes to clipboard.

Install

pip install voxd

Dependencies:

# X11 (dwm, i3, bspwm, etc)
sudo pacman -S ffmpeg xclip

# Wayland (hyprland, sway, etc)
sudo pacman -S ffmpeg wl-clipboard

Setup

Set your Sarvam AI API key:

voxd config set api_key YOUR_KEY

Optional - set language:

voxd config set language hi-IN  # Hindi
voxd config set language en-IN  # English (default)

Start daemon

Add to your startup:

dwm/i3/bspwm - ~/.xinitrc:

voxd-daemon &

hyprland - ~/.config/hypr/hyprland.conf:

exec-once = voxd-daemon

Keybind

dwm - config.h:

{ MODKEY, XK_semicolon, spawn, SHCMD("voxd toggle") },

hyprland - hyprland.conf:

bind = SUPER, semicolon, exec, voxd toggle

Usage

Press keybind → talk → press keybind → paste (Ctrl+V)

Terminal:

voxd toggle    # Start/stop recording
voxd status    # Check if recording
voxd quit      # Kill daemon

Config

voxd config list              # Show all settings
voxd config set key value     # Change setting
voxd config get key           # Get value

Settings:

  • api_key - Sarvam AI key (required)
  • language - Language code (default: en-IN)
  • model - STT model (default: saaras:v3)

Config stored at ~/.config/voxd/config.json

Advanced

WM-specific commands (if auto-detect fails):

voxd-dwm toggle     # Force X11 mode
voxd-hypr toggle    # Force Wayland mode

Troubleshooting

Daemon not running:

ps aux | grep voxd
cat /run/user/$UID/voxd/daemon.log

No clipboard:

# Install clipboard tool
sudo pacman -S xclip          # X11
sudo pacman -S wl-clipboard   # Wayland

Wrong language:

voxd config set language hi-IN
voxd quit && voxd-daemon &

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

voxd-0.1.2.tar.gz (11.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

voxd-0.1.2-py3-none-any.whl (12.5 kB view details)

Uploaded Python 3

File details

Details for the file voxd-0.1.2.tar.gz.

File metadata

  • Download URL: voxd-0.1.2.tar.gz
  • Upload date:
  • Size: 11.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for voxd-0.1.2.tar.gz
Algorithm Hash digest
SHA256 3674f349f579892e601ae6eaf4ce7555c63763116042449928b6cc8c1e79ebff
MD5 091018e4210c9325603095223abbbcc4
BLAKE2b-256 2a55b45c11f60793b4c4926920f850fc28cd9df2e26b443cda798d57c7fb8223

See more details on using hashes here.

File details

Details for the file voxd-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: voxd-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 12.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.14.3

File hashes

Hashes for voxd-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 852049efd018116f32797d476cd2b00dc1d4092a89744dbe0cd1ca5265c38a01
MD5 9cb10942846d4f0201a546ef12dbcc28
BLAKE2b-256 84f7aa3390103ca392acf49b741441793c6912c9e5fd77c24cc798e151c920e8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page