Local macOS menubar voice assistant powered by MLX Whisper, Ollama, and Kokoro TTS
Project description
Ollama Vox
A local macOS menubar voice assistant that records speech, transcribes with MLX Whisper, gets responses from Ollama, and speaks back using Kokoro TTS.
Features
- Menubar app for easy access
- Local speech-to-text with
mlx-whisper - Local text generation with Ollama
- Local text-to-speech with
mlx-audio+ Kokoro voices - Status panel showing model info, rolling latency stats, and response history
Requirements
- macOS (Apple Silicon recommended)
- Python
3.12+ - Ollama installed and running locally
Installation
The easiest way to install is via pip or uv:
pip install ollama-vox
# or if using uv
uv tool install ollama-vox
If installing from source for development:
git clone https://github.com/maadhav-codes/ollama-vox.git
cd ollama-vox
uv sync
First-Time Setup
Download recommended STT + TTS models:
uv run ollama-vox --setup
Run
Start the menubar application:
uv run ollama-vox
Usage
- Click the microphone icon in your macOS menubar.
- Select Start Listening to speak.
- Select Stop Listening when you are done. The app will process your speech and respond with audio.
- Click Show Status to view latency, recent responses, and active models.
Configuration
Settings are managed in config.yaml.
audio: Adjust Voice Activity Detection (VAD) and recording limits.stt.model: Path to the local Whisper model.ollama: Set the endpoint, model name, and temperature.tts: Configure the voice, speaking rate, and Kokoro model path.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file ollama_vox-1.0.1.tar.gz.
File metadata
- Download URL: ollama_vox-1.0.1.tar.gz
- Upload date:
- Size: 20.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
c719b896ff4065f252cb08cfd831fb81a4f75bea5f219899444e2a9ffbaa0f37
|
|
| MD5 |
47982b0118d1dd35b36e36015b1053b2
|
|
| BLAKE2b-256 |
3cf150475998b0a2c9ed726c239e7a134d2668cb48f1b1f7243bcf776aa629ea
|
File details
Details for the file ollama_vox-1.0.1-py3-none-any.whl.
File metadata
- Download URL: ollama_vox-1.0.1-py3-none-any.whl
- Upload date:
- Size: 22.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.12.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
1a2e30312c89d6544cbb7ffcb881a9763d3be2190e59c664ffdd92dab17ca465
|
|
| MD5 |
7156c306fff32eb116e66628fedeea24
|
|
| BLAKE2b-256 |
ebe69cc98753495d656d8858afecf2ecdab67d77a3f19b6644f3f8c991e49a34
|