Skip to main content

Local macOS menubar voice assistant powered by MLX Whisper, Ollama, and Kokoro TTS

Project description

Ollama Vox

A local macOS menubar voice assistant that records speech, transcribes with MLX Whisper, gets responses from Ollama, and speaks back using Kokoro TTS.

Features

  • Menubar app for easy access
  • Local speech-to-text with mlx-whisper
  • Local text generation with Ollama
  • Local text-to-speech with mlx-audio + Kokoro voices
  • Status panel showing model info, rolling latency stats, and response history

Requirements

  • macOS (Apple Silicon recommended)
  • Python 3.12+
  • Ollama installed and running locally

Installation

The easiest way to install is via pip or uv:

pip install ollama-vox
# or if using uv
uv tool install ollama-vox

If installing from source for development:

git clone https://github.com/maadhav-codes/ollama-vox.git
cd ollama-vox
uv sync

First-Time Setup

Download recommended STT + TTS models:

uv run ollama-vox --setup

Run

Start the menubar application:

uv run ollama-vox

Usage

  1. Click the microphone icon in your macOS menubar.
  2. Select Start Listening to speak.
  3. Select Stop Listening when you are done. The app will process your speech and respond with audio.
  4. Click Show Status to view latency, recent responses, and active models.

Configuration

Settings are managed in config.yaml.

  • audio: Adjust Voice Activity Detection (VAD) and recording limits.
  • stt.model: Path to the local Whisper model.
  • ollama: Set the endpoint, model name, and temperature.
  • tts: Configure the voice, speaking rate, and Kokoro model path.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ollama_vox-1.0.1.tar.gz (20.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

ollama_vox-1.0.1-py3-none-any.whl (22.6 kB view details)

Uploaded Python 3

File details

Details for the file ollama_vox-1.0.1.tar.gz.

File metadata

  • Download URL: ollama_vox-1.0.1.tar.gz
  • Upload date:
  • Size: 20.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.0

File hashes

Hashes for ollama_vox-1.0.1.tar.gz
Algorithm Hash digest
SHA256 c719b896ff4065f252cb08cfd831fb81a4f75bea5f219899444e2a9ffbaa0f37
MD5 47982b0118d1dd35b36e36015b1053b2
BLAKE2b-256 3cf150475998b0a2c9ed726c239e7a134d2668cb48f1b1f7243bcf776aa629ea

See more details on using hashes here.

File details

Details for the file ollama_vox-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: ollama_vox-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 22.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.0

File hashes

Hashes for ollama_vox-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 1a2e30312c89d6544cbb7ffcb881a9763d3be2190e59c664ffdd92dab17ca465
MD5 7156c306fff32eb116e66628fedeea24
BLAKE2b-256 ebe69cc98753495d656d8858afecf2ecdab67d77a3f19b6644f3f8c991e49a34

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page