D-Bus service providing speech-to-text functionality for GNOME Shell
Project description
GNOME Speech2Text Service
A D-Bus service that provides speech-to-text functionality for the GNOME Shell Speech2Text extension.
Overview
This service handles the actual speech recognition processing using OpenAI's Whisper model locally. It runs as a D-Bus service and communicates with the GNOME Shell extension to provide seamless speech-to-text functionality.
Features
- Real-time speech recognition using OpenAI Whisper
- D-Bus integration for seamless desktop integration
- Audio recording with configurable duration
- Multiple output modes (clipboard, text insertion, preview)
- Error handling and recovery
- Session management for multiple concurrent recordings
Installation
System Dependencies
This service requires several system packages to be installed (e.g. ffmpeg, clipboard tools). See the main README.md for the complete list of system dependencies.
Service Installation
The service is available on PyPI and is typically installed into a per-user virtual environment by the extension’s installer.
pip install gnome-speech2text-service
PyPI Package: gnome-speech2text-service
Or from the source repository:
cd service/
pip install .
D-Bus Registration
After installation, you need to register the D-Bus service and desktop entry. Recommended options:
- Using the repository (local source install)
# From the repo root
./src/install-service.sh --local
- Using the bundled installer (PyPI install)
# From the repo root
./src/install-service.sh --pypi
The installer will:
- Create a per-user virtual environment under
~/.local/share/gnome-speech2text-service/venv - Install the
gnome-speech2text-servicepackage - Register the D-Bus service at
~/.local/share/dbus-1/services/org.gnome.Shell.Extensions.Speech2Text.service - Create a desktop entry at
~/.local/share/applications/gnome-speech2text-service.desktop
Usage
Starting the Service
The service is D-Bus activated and starts automatically when requested by the extension. You can also start it manually:
# If the entry point is on PATH (pip install)
gnome-speech2text-service
# Or via the per-user wrapper created by the installer
~/.local/share/gnome-speech2text-service/gnome-speech2text-service
Configuration
The service uses OpenAI's Whisper model locally for speech recognition. No API key is required. All processing happens on your local machine for complete privacy.
D-Bus Interface
The service provides the following D-Bus interface (stable; used by the GNOME extension):
Methods:
StartRecording(duration, copy_to_clipboard, preview_mode)→recording_idStopRecording(recording_id)→successCancelRecording(recording_id)→successTypeText(text, copy_to_clipboard)→successGetServiceStatus()→statusCheckDependencies()→all_available, missing_dependencies[]
Signals:
RecordingStarted(recording_id)RecordingStopped(recording_id, reason)TranscriptionReady(recording_id, text)RecordingError(recording_id, error_message)TextTyped(text, success)
Requirements
- Python: 3.8–3.13 (Python 3.14+ not supported yet)
- System: Linux with D-Bus support
- Desktop: GNOME Shell (tested on GNOME 46+)
License
This project is licensed under the GPL-3.0-or-later license. See the LICENSE file for details.
Contributing
Contributions are welcome! Please see the main repository for contribution guidelines: https://github.com/kavehtehrani/gnome-speech2text
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file gnome_speech2text_service-1.1.0.tar.gz.
File metadata
- Download URL: gnome_speech2text_service-1.1.0.tar.gz
- Upload date:
- Size: 25.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
7e74112083ddf578518a1f1e381d66030dcc2dc5b1dfb09a24ea9f34b25c19f8
|
|
| MD5 |
a9bbf4682db640ec808f2f2fe0ad46ab
|
|
| BLAKE2b-256 |
639ce2151b50d55da42b5e9a9aae8c811ea3b5e0a86314793b000d1f42e058a2
|
File details
Details for the file gnome_speech2text_service-1.1.0-py3-none-any.whl.
File metadata
- Download URL: gnome_speech2text_service-1.1.0-py3-none-any.whl
- Upload date:
- Size: 24.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.12.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
5da33c284724a07416b3fe979a0409f120c4b4fdb7745cda072ac89b3f726eb7
|
|
| MD5 |
a6a4ebba6d3195f116e03428d1e59fbc
|
|
| BLAKE2b-256 |
df43ed6bdb43aa8a0cfa7035d2c0d6a31ca4218175bb3ad05e8c9b208ff93f4d
|