Hivemind Voice Relay
Project description
HiveMind Voice Relay
Local wakeword detection; STT and TTS handled remotely by hivemind-core running the hivemind-audio-binary-protocol plugin.
Voice Relay runs the microphone, VAD, and wakeword engine on-device — keeping wake-word detection private and low-latency — while forwarding audio to hivemind-core (running the hivemind-audio-binary-protocol plugin) for speech-to-text, and receiving synthesised audio back for playback. No STT or TTS models run on the device.
Full documentation: docs/
Satellite spectrum
| Satellite | Mic | VAD | Wake word | STT | TTS | Connects to |
|---|---|---|---|---|---|---|
| HiveMind-cli | — | — | — | — | — | hivemind-core |
| hivemind-mic-satellite | local | local | server | server | server | core + audio-binary-protocol |
| HiveMind-voice-relay (this repo) | local | local | local | server | server | core + audio-binary-protocol |
| HiveMind-voice-sat | local | local | local | local | local | hivemind-core |
Voice Relay keeps wakeword detection on-device (low latency, no audio leaves until activation) while STT and TTS run on the hive. The point is not mainly resource savings — it is what it means for the hive to own speech services (see below).
Why voice-relay — HiveMind as a service
Voice-relay's real lesson is architectural. STT and TTS run inside the hive (the hivemind-audio-binary-protocol plugin on hivemind-core) and sit behind the same access-key authentication as the rest of the mesh. For a developer, the consequences matter more than the saved CPU:
- The hive owns STT/TTS. A voice-sat can point at any STT/TTS plugin it likes — including a public
ovos-stt-plugin-server/ovos-tts-plugin-server. A relay cannot: it does not choose the engine, model, or voice. The hive operator decides, centrally and uniformly, for every relay that connects. - Speech is authenticated. STT/TTS are not an open endpoint anyone can hit — access is gated by the client's HiveMind credentials, exactly like every other message on the protocol.
- It is the reference for the b64 speech API. The relay sends audio for STT and receives speech for TTS as base64-encoded WAV over the HiveMessage bus (
recognizer_loop:b64_transcribe,speak:b64_audio) — the same work mic-satellite does over the binary protocol. Relay illustrates the b64 path; it could equally use binary.
Choose voice-relay when you want HiveMind to operate STT/TTS as a governed, authenticated service — uniform and centrally controlled — with wakeword kept local for latency and privacy. Lower device resource use is a consequence, not the goal.
Server requirements
⚠️ Your
hivemind-coreserver must have the hivemind-audio-binary-protocol binary plugin installed. Plainhivemind-coredoes not handle STT or TTS — connecting to it will result in silence (no transcription, no spoken response).Alternatively, run
hivemind-coretogether withovos-audioandovos-dinkum-listenerto provide the same capabilities.
Install
pip install HiveMind-voice-relay
60-second quickstart
1. Configure identity (one-time):
hivemind-client set-identity --key YOUR_ACCESS_KEY --password YOUR_PASSWORD --host wss://your-listener-host
2. Run:
hivemind-voice-relay
3. Speak your wake word. The default wake word is hey mycroft (configured in ~/.config/mycroft/mycroft.conf).
CLI flags
Usage: hivemind-voice-relay [OPTIONS]
connect to hivemind-core running the audio binary protocol
Options:
--host TEXT hivemind host (ws:// or wss://)
--key TEXT Access Key
--password TEXT Password for key derivation
--port INTEGER HiveMind port number (default: 5678)
--selfsigned Accept self-signed TLS certificates
--siteid TEXT Location identifier for message context
--help Show this message and exit.
All flags fall back to values stored by hivemind-client set-identity.
Configuration
Voice Relay reads ~/.config/mycroft/mycroft.conf (standard OVOS config).
| Plugin type | Config key | Default | Required |
|---|---|---|---|
| Microphone | microphone.module |
ovos-microphone-plugin-alsa |
Yes |
| VAD | listener.VAD.module |
ovos-vad-plugin-silero |
Yes |
| Wake word | listener.wake_word |
hey_mycroft |
Yes |
| G2P | tts.g2p_module |
— | No |
| Media Playback | Audio.backends |
— | No |
| OCP Plugins | — | — | No |
| Dialog Transformers | — | — | No |
| TTS Transformers | — | — | No |
| PHAL | — | — | No (auto-loaded if installed) |
See docs/configuration.md for full details and plugin swap instructions.
Features and limitations
Built on ovos-simple-listener. Compared to the full voice-satellite:
Present:
- Microphone capture, VAD, wakeword detection — all local
- Audio forwarded to hivemind-core (hivemind-audio-binary-protocol plugin) for STT (base64-encoded WAV over the HiveMessage bus)
- TTS audio synthesised server-side and streamed back for local playback
- PHAL (platform hardware abstraction) auto-loaded if installed
- Standard OVOS plugin system for mic, VAD, and wakeword
Not supported (use HiveMind-voice-sat if you need these):
- Local STT / TTS plugins
- Audio Transformers
- Continuous / Hybrid / Recording / Sleep listening modes
- Multiple wake words
Related
| Project | Role |
|---|---|
| hivemind-audio-binary-protocol | Required hivemind-core plugin — provides server-side STT + TTS |
| hivemind-core | Base mesh node (no STT/TTS) |
| HiveMind-cli | Text-only satellite |
| hivemind-mic-satellite | Thinnest audio satellite (no local wakeword) |
| HiveMind-voice-sat | Full local stack satellite |
| hivemind-bus-client | HiveMind WebSocket client library |
| ovos-simple-listener | Lightweight listener library used internally |
License
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file hivemind_voice_relay-1.1.0a2.tar.gz.
File metadata
- Download URL: hivemind_voice_relay-1.1.0a2.tar.gz
- Upload date:
- Size: 14.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
9a285d155659e7e66951aa5d62c9f8d66db3f7ef98d3825f05d0efdb0776e433
|
|
| MD5 |
0cca99a8d5886daa5763eac0da538205
|
|
| BLAKE2b-256 |
e1d79b215012f90cf42c50eade77336cf24d4d04d61b9f066d53542fbd7e04ec
|
File details
Details for the file hivemind_voice_relay-1.1.0a2-py3-none-any.whl.
File metadata
- Download URL: hivemind_voice_relay-1.1.0a2-py3-none-any.whl
- Upload date:
- Size: 13.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.13.12
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
91badc3a097721b6c8a9b82e3c51aac24657cfddcb531c711db68c0d53e884da
|
|
| MD5 |
08ffb56b457d10f4dee1c664449d4f53
|
|
| BLAKE2b-256 |
17a3ea4ab7fb54288143fb6957007d40d6ace0f859c16d715564ad1af32953a3
|