A seamless voice dictation system for Linux

These details have not been verified by PyPI

Project links

Project description

Vocalinux

Voice-to-text for Linux, finally done right!

Vocalinux Users

A seamless free open-source private voice dictation system for Linux, comparable to the built-in solutions on macOS and Windows.

🎉 Alpha Release!

We're excited to share Vocalinux with the community. Try it out and let us know what you think!

✨ Features

🎤 Double-tap Ctrl to start/stop voice dictation
⚡ Real-time transcription with minimal latency
🌎 Universal compatibility across all Linux applications
🔒 Offline operation for privacy and reliability (with VOSK)
🤖 Optional Whisper AI support for enhanced accuracy
🎨 System tray integration with visual status indicators
🔊 Audio feedback for recording status
⚙️ Graphical settings dialog for easy configuration

🚀 Quick Install

One-liner Installation (Recommended)

curl -fsSL https://raw.githubusercontent.com/jatinkrmalik/vocalinux/main/install.sh | bash -s -- --tag=v0.3.0-alpha

Note: Installs the latest stable release (v0.3.0-alpha). For the most recent version, check GitHub Releases.

This will:

Clone the repository to ~/.local/share/vocalinux-install
Install all system dependencies
Set up a virtual environment in ~/.local/share/vocalinux/venv
Install both VOSK and Whisper AI speech engines:
- VOSK: installs the vosk Python package from PyPI
- Whisper: installs the openai-whisper package from PyPI, which also pulls in PyTorch (the ML framework Whisper requires)
Create a symlink at ~/.local/bin/vocalinux
Download the default Whisper tiny speech model (~75MB)

⏱️ Note: Installation takes ~5-10 minutes due to Whisper AI dependencies (PyTorch with CUDA support, ~2.3GB).

Whisper with CPU-only PyTorch (no NVIDIA GPU needed):

curl -fsSL https://raw.githubusercontent.com/jatinkrmalik/vocalinux/main/install.sh | bash -s -- --tag=v0.3.0-alpha --whisper-cpu

This installs Whisper with CPU-only PyTorch (~200MB instead of ~2.3GB). Works great for systems without NVIDIA GPU.

For low-RAM systems (8GB or less) - VOSK only:

curl -fsSL https://raw.githubusercontent.com/jatinkrmalik/vocalinux/main/install.sh | bash -s -- --tag=v0.3.0-alpha --no-whisper

This skips Whisper installation entirely and configures VOSK as the default engine.

Alternative: Install from Source

# Clone the repository
git clone https://github.com/jatinkrmalik/vocalinux.git
cd vocalinux

# Run the installer (will prompt for Whisper)
./install.sh

# Or with Whisper support
./install.sh --with-whisper

The installer handles everything: system dependencies, Python environment, speech models, and desktop integration.

After Installation

# If ~/.local/bin is in your PATH (recommended):
vocalinux

# Or activate the virtual environment first:
source ~/.local/bin/activate-vocalinux.sh
vocalinux

# Or run directly:
~/.local/share/vocalinux/venv/bin/vocalinux

Or launch it from your application menu!

Uninstall

# If installed via curl:
curl -fsSL https://raw.githubusercontent.com/jatinkrmalik/vocalinux/main/uninstall.sh | bash

# If installed from source:
./uninstall.sh

📋 Requirements

OS: Ubuntu 22.04+ (other Linux distros may work)
Python: 3.8 or newer
Display: X11 or Wayland
Hardware: Microphone for voice input

🎙️ Usage

Voice Dictation

Double-tap Ctrl to start recording
Speak clearly into your microphone
Double-tap Ctrl again (or pause speaking) to stop

Voice Commands

Command	Action
"new line"	Inserts a line break
"period" / "full stop"	Types a period (.)
"comma"	Types a comma (,)
"question mark"	Types a question mark (?)
"exclamation mark"	Types an exclamation mark (!)
"delete that"	Deletes the last sentence
"capitalize"	Capitalizes the next word

Command Line Options

vocalinux --help              # Show all options
vocalinux --debug             # Enable debug logging
vocalinux --engine whisper    # Use Whisper AI engine
vocalinux --model medium      # Use medium-sized model
vocalinux --wayland           # Force Wayland mode

⚙️ Configuration

Configuration is stored in ~/.config/vocalinux/config.json:

{
  "speech_recognition": {
    "engine": "vosk",
    "model_size": "small",
    "vad_sensitivity": 3,
    "silence_timeout": 2.0
  }
}

You can also configure settings through the graphical Settings dialog (right-click the tray icon).

🔧 Development Setup

# Clone and install in dev mode
git clone https://github.com/jatinkrmalik/vocalinux.git
cd vocalinux
./install.sh --dev

# Activate environment
source venv/bin/activate

# Run tests
pytest

# Run from source with debug
python -m vocalinux.main --debug

📁 Project Structure

vocalinux/
├── src/vocalinux/           # Main application code
│   ├── speech_recognition/  # Speech recognition engines
│   ├── text_injection/      # Text injection (X11/Wayland)
│   ├── ui/                  # GTK UI components
│   └── utils/               # Utility functions
├── tests/                   # Test suite
├── resources/               # Icons and sounds
├── docs/                    # Documentation
└── web/                     # Website source

📖 Documentation

Installation Guide - Detailed installation instructions
User Guide - Complete user documentation
Contributing - Development setup and contribution guidelines

🗺️ Roadmap

~~Custom icon design~~ ✅
~~Graphical settings dialog~~ ✅
~~Whisper AI support~~ ✅
Multi-language support
Application-specific commands
Debian/Ubuntu package (.deb)
Improved Wayland support
Voice command customization

🤝 Contributing

We welcome contributions! Whether it's bug reports, feature requests, or code contributions, please check out our Contributing Guide.

Quick Links

⭐ Support

If you find Vocalinux useful, please consider:

⭐ Starring this repository
🐛 Reporting bugs you encounter
📖 Improving documentation
🔀 Contributing code

📜 License

This project is licensed under the GNU General Public License v3.0 - see the LICENSE file for details.

Made with ❤️ for the Linux community

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.10.1b0 pre-release

Mar 30, 2026

0.10.0b0 pre-release

Mar 26, 2026

0.9.0b0 pre-release

Mar 14, 2026

0.8.0b0 pre-release

Mar 1, 2026

0.7.0b0 pre-release

Feb 23, 2026

0.6.3b0 pre-release

Feb 19, 2026

0.6.2b0 pre-release

Feb 18, 2026

0.6.1b0 pre-release

Feb 12, 2026

0.6.0b0 pre-release

Feb 12, 2026

0.5.0b0 pre-release

Feb 6, 2026

0.4.1a0 pre-release

Jan 29, 2026

This version

0.4.0a0 pre-release

Jan 29, 2026

0.3.0a0 pre-release

Jan 21, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vocalinux-0.4.0a0.tar.gz (773.7 kB view details)

Uploaded Jan 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

vocalinux-0.4.0a0-py3-none-any.whl (869.3 kB view details)

Uploaded Jan 29, 2026 Python 3

File details

Details for the file vocalinux-0.4.0a0.tar.gz.

File metadata

Download URL: vocalinux-0.4.0a0.tar.gz
Upload date: Jan 29, 2026
Size: 773.7 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for vocalinux-0.4.0a0.tar.gz
Algorithm	Hash digest
SHA256	`aec985f69e070df15445c4a8ae8c3bd2081eecb7aa2028d34536f749f334822a`
MD5	`8bcc97562e6bd89bdb34161c8fdc1215`
BLAKE2b-256	`5b135739424f6bdf53e7959b1f53459b219c4a6e0a05b52f078df0bbeb122585`

See more details on using hashes here.

File details

Details for the file vocalinux-0.4.0a0-py3-none-any.whl.

File metadata

Download URL: vocalinux-0.4.0a0-py3-none-any.whl
Upload date: Jan 29, 2026
Size: 869.3 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.10.19

File hashes

Hashes for vocalinux-0.4.0a0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`49bc851c0cada6d96156cb0c48cc4e922dab082a127fac0ec589343d3148ca5e`
MD5	`4a566f629209d3a4985621cae81404f3`
BLAKE2b-256	`31b9e123663132a78e7f1aa1781e286fedd57beec9bcddeb05b40024dd2ff786`

See more details on using hashes here.

vocalinux 0.4.0a0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Vocalinux

Voice-to-text for Linux, finally done right!

✨ Features

🚀 Quick Install

One-liner Installation (Recommended)

Alternative: Install from Source

After Installation

Uninstall

📋 Requirements

🎙️ Usage

Voice Dictation

Voice Commands

Command Line Options

⚙️ Configuration

🔧 Development Setup

📁 Project Structure

📖 Documentation

🗺️ Roadmap

🤝 Contributing

Quick Links

⭐ Support

📜 License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes