Voice Recognition Bridge for Linux - Speak naturally, control your system, type hands-free

These details have not been verified by PyPI

Project links

Project description

Termivox

Voice Recognition Bridge for Linux — Speak naturally, control your system, type hands-free.

🎯 Overview

Termivox is a Linux-based voice recognition system that transforms your speech into text and system commands. Using offline voice recognition (Vosk), it provides:

Hands-free dictation - Speak and watch your words appear
Voice-controlled system commands - Copy, paste, click, scroll by voice
Multi-language support - English and French recognition
Toggle control - Pause/resume recognition instantly like a guitar pedal
Privacy-first - All processing happens locally, no cloud required

✨ Features

🎤 Voice Recognition

Offline speech-to-text powered by Vosk
Bilingual support: English (en) and French (fr)
Punctuation by voice - Say "comma", "period", "question mark"
Edit commands - "new line", "tab", "new paragraph"
System commands - "copy", "paste", "click", "scroll up/down"

🎛️ Toggle Control (NEW!)

Control voice recognition ON/OFF with multiple interfaces:

⌨️ Global Hotkey

Press Ctrl+Alt+V from anywhere to toggle
Customizable key combination
Works across all applications

🖱️ Desktop Widget

Minimal floating window (160×70px)
One-click toggle button
Visual status: "LISTENING" (green) / "MUTED" (gray)
Draggable, always-on-top
Never steals cursor focus

🎛️ System Tray Icon

Green/red status indicator
Click to toggle
Right-click menu

🎮 Hardware Support (Coming Soon)

USB foot pedal support
MIDI controller integration
Custom button devices

📦 Installation

Prerequisites

System Requirements:

Linux (tested on Ubuntu 24.04)
Python 3.8+
Microphone input

System Dependencies:

sudo apt install python3-pyaudio xdotool sox portaudio19-dev -y

Setup

Clone the repository:

git clone https://github.com/Gerico1007/termivox.git
cd termivox

Create virtual environment:

python3 -m venv termivox-env
source termivox-env/bin/activate

Install Python dependencies:
```
pip install -r requirements.txt
```
Download voice model (if not already present):
```
python download_model.py
```
Run Termivox:
```
./run.sh
```

🚀 Usage

Quick Start

Launch with toggle control:

./run.sh

Original mode (no toggle):

source termivox-env/bin/activate
python src/main.py --no-toggle

Test voice recognition only:

source termivox-env/bin/activate
python src/test_voice_script.py --lang en

Toggle Control

Once Termivox is running, control it using:

Hotkey:

Press Ctrl+Alt+V → Pauses/resumes voice recognition
Works from any window, keeps cursor position

Widget:

Click the floating "LISTENING" or "MUTED" button
Drag the title bar to reposition
Right-click to close widget

Indicator:

Green = Voice recognition ACTIVE (listening)
Gray/Red = Voice recognition MUTED (paused)

Voice Commands

Dictation:

"Hello world" → types: Hello world

Punctuation:

"Hello comma world period" → types: Hello, world.

Available punctuation:

comma, period, question mark, exclamation mark
colon, semicolon, dash, quote, apostrophe

Editing:

"new line"       → ↵
"new paragraph"  → ↵↵
"tab"            → ⇥

System Commands:

"copy"           → Ctrl+C
"paste"          → Ctrl+V
"select all"     → Ctrl+A
"click"          → Mouse click
"scroll up"      → Scroll wheel up
"scroll down"    → Scroll wheel down

Language Selection

English (default):

./run.sh
# or
python src/main.py --lang en

French:

python src/main.py --lang fr

⚙️ Configuration

Edit config/settings.json to customize behavior:

{
  "interfaces": {
    "hotkey": {
      "enabled": true,
      "key": "ctrl+alt+v"        // Change hotkey here
    },
    "tray": {
      "enabled": false            // Enable system tray icon
    },
    "widget": {
      "enabled": true,            // Desktop widget
      "position": {"x": 100, "y": 100},
      "size": {"width": 160, "height": 70},
      "always_on_top": true
    }
  },
  "voice": {
    "language": "en",             // Default language
    "auto_space": true            // Auto-add spaces
  }
}

Custom Hotkey Examples:

"ctrl+shift+v"
"ctrl+alt+t"
"super+v"

📁 Project Structure

termivox/
├── src/
│   ├── main.py                    # Main entry point with toggle support
│   ├── test_voice_script.py       # Standalone testing utility
│   ├── voice/
│   │   ├── recognizer.py          # Vosk voice recognition engine
│   │   └── __init__.py
│   ├── bridge/
│   │   ├── xdotool_bridge.py      # System command executor
│   │   └── __init__.py
│   ├── ui/                        # Toggle control interfaces
│   │   ├── toggle_controller.py   # Central state management
│   │   ├── hotkey_interface.py    # Global hotkey listener
│   │   ├── tray_interface.py      # System tray icon
│   │   ├── widget_interface.py    # Desktop widget
│   │   ├── hardware_interface.py  # Hardware button stub
│   │   ├── config_loader.py       # Configuration system
│   │   └── __init__.py
│   └── utils/
│       └── __init__.py
├── config/
│   └── settings.json              # User configuration
├── voice_models/                  # Vosk language models
│   └── vosk-model-small-en-us-0.15/
├── requirements.txt               # Python dependencies
├── run.sh                         # Launch script
├── download_model.py              # Model downloader
└── README.md

🛠️ Dependencies

Python Packages:

Vosk - Offline speech recognition
pyaudio - Microphone input
numpy - Audio processing
pynput - Global hotkey support
pystray - System tray icon
Pillow - Icon generation
xdotool - System command execution

System Packages:

python3-pyaudio - PyAudio bindings
xdotool - Keyboard/mouse automation
sox - Audio utilities
portaudio19-dev - Audio development headers

🎨 Toggle Widget Design

Minimal Professional Aesthetic:

┌─────────────────────┐
│ TERMIVOX         ● │  ← Dark title bar (draggable)
├─────────────────────┤
│                     │
│    LISTENING        │  ← Green button (active state)
│                     │
└─────────────────────┘

Features:

Compact: 160×70 pixels
Unfocusable: Never steals cursor
Draggable: Reposition anywhere
Color-coded: Green (ON) / Gray (OFF)
Always-on-top: Stays visible

🧪 Testing

Test voice recognition without typing:

source termivox-env/bin/activate
python src/test_voice_script.py --lang en

Test with toggle control:

./run.sh
# Then try:
# 1. Speak something
# 2. Press Ctrl+Alt+V
# 3. Speak again (should not type)
# 4. Press Ctrl+Alt+V
# 5. Speak (should type again)

Test different languages:

python src/test_voice_script.py --lang fr  # French
python src/test_voice_script.py --lang en  # English

🐛 Troubleshooting

Hotkey doesn't work:

Check terminal for errors
Try different hotkey in config/settings.json
Ensure pynput is installed: pip list | grep pynput

No voice recognition:

Check microphone: arecord -l
Test PyAudio: python -c "import pyaudio; print('OK')"
Verify Vosk model downloaded in voice_models/

Widget not visible:

Enable in config: "widget": {"enabled": true}
Check if tkinter available: python -c "import tkinter"

System tray icon missing:

Desktop environment may not support system tray
Use widget or hotkey instead
Try enabling: "tray": {"enabled": true}

🤝 Contributing

Contributions welcome! Areas for enhancement:

Additional language models
Custom wake word detection
Audio feedback on toggle
Hardware button integration
Voice command macros
GUI configuration tool

To contribute:

Fork the repository
Create feature branch: git checkout -b feature/amazing-feature
Commit changes: git commit -m 'Add amazing feature'
Push to branch: git push origin feature/amazing-feature
Open Pull Request

📄 License

MIT License - See LICENSE file for details

🙏 Acknowledgments

Vosk - Offline speech recognition engine
pynput - Cross-platform input control
pystray - System tray integration
xdotool - X11 automation

🔮 Roadmap

Voice command macros
Custom wake word support
GUI settings editor
Hardware button integration (foot pedal, MIDI)
Audio feedback options
Additional language models
Plugin system for custom commands
Cloud sync for settings (optional)

♠️ Nyro - Structural foundation, modular architecture 🌿 Aureon - Flow preservation, accessibility focus 🎸 JamAI - Musical encoding, harmonic design

Built with recursive intention. Speak, toggle, flow.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.3

Nov 16, 2025

This version

0.1.2

Nov 9, 2025

0.1.1

Nov 9, 2025

0.1.0

Nov 9, 2025

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

termivox-0.1.2.tar.gz (27.8 kB view details)

Uploaded Nov 9, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

termivox-0.1.2-py3-none-any.whl (27.7 kB view details)

Uploaded Nov 9, 2025 Python 3

File details

Details for the file termivox-0.1.2.tar.gz.

File metadata

Download URL: termivox-0.1.2.tar.gz
Upload date: Nov 9, 2025
Size: 27.8 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for termivox-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`5adb1c2e83f8f42459230000f6c8ef8537a2b573c6fd605b5aae71cf7fceacbc`
MD5	`fa799d5997bbc0b737d91fd46e006e60`
BLAKE2b-256	`55df63acfb68b5993562b8880b977b3303559938ae5b11c2060b4777b5e7e6ed`

See more details on using hashes here.

File details

Details for the file termivox-0.1.2-py3-none-any.whl.

File metadata

Download URL: termivox-0.1.2-py3-none-any.whl
Upload date: Nov 9, 2025
Size: 27.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.12

File hashes

Hashes for termivox-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`f0acc8c82f32a2f40c8e072b30f625c37db586abd324315d7bef4981a435c68c`
MD5	`ce5cc30d71cb3c11dca626192279f314`
BLAKE2b-256	`9c96e7aaa6c045f68261f8b10e35cc646cd5cca83c6b8ded6203aa7a64894aba`

See more details on using hashes here.

termivox 0.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Termivox

🎯 Overview

✨ Features

🎤 Voice Recognition

🎛️ Toggle Control (NEW!)

⌨️ Global Hotkey

🖱️ Desktop Widget

🎛️ System Tray Icon

🎮 Hardware Support (Coming Soon)

📦 Installation

Prerequisites

Setup

🚀 Usage

Quick Start

Toggle Control

Voice Commands

Language Selection

⚙️ Configuration

📁 Project Structure

🛠️ Dependencies

🎨 Toggle Widget Design

🧪 Testing

🐛 Troubleshooting

🤝 Contributing

📄 License

🙏 Acknowledgments

🔮 Roadmap

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes