Apple-Silicon-native TTS and streaming STT for Home Assistant and OpenAI-compatible clients

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

These details have not been verified by PyPI

Project description

wyoming-mlx

Apple-Silicon-native TTS (Kokoro) and streaming STT (Whisper via WhisperLiveKit) for Home Assistant and OpenAI-compatible clients.

Why?

If you have a Mac on your network, it can be your voice server. wyoming-mlx turns it into a fast, fully local speech-to-text and text-to-speech service:

Private by construction. Audio never leaves your network — no cloud speech APIs, no per-request pricing, nothing to subscribe to. Models run entirely on your machine.
Uses hardware you already own. Apple Silicon's GPU and unified memory run Whisper and Kokoro comfortably alongside whatever else the Mac is doing — no dedicated GPU server, no idle power draw of a CUDA box.
One service, two ecosystems. Home Assistant talks to it natively over the Wyoming protocol (drop-in replacement for wyoming-faster-whisper/wyoming-piper satellites), while anything that speaks the OpenAI audio API — scripts, editors, chat UIs — can use the same instance via /v1/audio/transcriptions and /v1/audio/speech.
Set-and-forget. Install with Homebrew, run as a launchd service via brew services, and models are fetched once into the Hugging Face cache.

Speech-to-text streams through WhisperLiveKit's SimulStreaming (AlignAtt) policy on MLX, so partial transcripts arrive while you're still speaking; text-to-speech runs Kokoro on Metal via PyTorch. The real backends therefore require an Apple Silicon Mac. Everything else (config, HTTP API, Wyoming protocol handling, fake backends, tests) is portable, and CI runs on Linux against the fake backends.

Install (Homebrew)

brew tap rnorth/tap
brew install wyoming-mlx

Run it in the foreground with wyoming-mlx, or as a launchd service:

brew services start wyoming-mlx

Logs go to $(brew --prefix)/var/log/wyoming-mlx.log. Apple Silicon only.

[!WARNING] Upgraders: the models.whisper config value (TOML [models] whisper = ..., CLI --whisper-model, env WYOMING_MLX_MODELS__WHISPER) changed meaning in this release. It used to be a Hugging Face repo ID (e.g. mlx-community/distil-whisper-large-v3); it is now a WhisperLiveKit model-size name: tiny, base, small, medium, large-v3, or large-v3-turbo (default large-v3-turbo). An old repo-ID value (anything containing /) is rejected at startup with a clear error. Update your config, for example:
[models]
# before:  whisper = "mlx-community/distil-whisper-large-v3"
whisper = "large-v3-turbo"

Quick start (dev)

mise install
uv sync
uv run pytest

Works on Linux too: CPU-only torch is selected automatically (the fake backends need no GPU). Integration tests against real models are skipped by default; run them with uv run pytest --integration (Apple Silicon only).

Run locally (fake backends)

uv run python scripts/dev_run.py

The dev server uses the API key dev.

Run locally (real MLX backends)

uv run wyoming-mlx

By default it loads:

whisper-large-v3-turbo (MLX, streaming) on Wyoming port 10300 / HTTP /v1/audio/transcriptions
Kokoro-82M (MLX) on Wyoming port 10200 / HTTP /v1/audio/speech
HTTP on port 10400 with API-key auth

Models are stored in the Hugging Face cache. The STT model is loaded at startup (so a bad model name or download failure surfaces immediately); the Kokoro TTS model downloads on first synthesis.

API keys

HTTP endpoints require a bearer token. Keys are read at startup from ~/.config/wyoming-mlx/apikeys (override with --http-api-keys-file), one key per line, # comments allowed. The file should be mode 0600. If the file is missing or empty, all HTTP requests are rejected with 401.

Note that the HTTP API listens on all interfaces by default (set WYOMING_MLX_HTTP__HOST=127.0.0.1 to restrict it), and GET /v1/models is unauthenticated, matching OpenAI API behaviour.

mkdir -p ~/.config/wyoming-mlx
(umask 077; openssl rand -hex 32 > ~/.config/wyoming-mlx/apikeys)

HTTP API

List models

curl http://localhost:10400/v1/models

Transcribe an audio file

curl http://localhost:10400/v1/audio/transcriptions \
  -H "Authorization: Bearer $API_KEY" \
  -F file=@some.wav

Synthesize speech

curl http://localhost:10400/v1/audio/speech \
  -H "Authorization: Bearer $API_KEY" \
  -H "Content-Type: application/json" \
  -d '{"input":"Hello there.","voice":"af_heart"}' \
  --output /tmp/out.wav

Home Assistant integration

Settings → Integrations → Wyoming Protocol → Add:

STT: <host>:10300
TTS: <host>:10200

No keys, no TLS (HA convention, trusted LAN).

Streaming transcripts (live partial results) require Home Assistant 2025.7 or newer; older versions still receive the final transcript exactly as before.

Configuration

Pass --config /path/to/config.toml or set env vars with the WYOMING_MLX_ prefix and __ for nesting (e.g. WYOMING_MLX_HTTP__PORT=10401). See src/wyoming_mlx/config.py for the full schema.

License

Apache-2.0

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

rich.north

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.2.0

Jun 13, 2026

0.1.0

Jun 12, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

wyoming_mlx-0.2.0.tar.gz (183.1 kB view details)

Uploaded Jun 13, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

wyoming_mlx-0.2.0-py3-none-any.whl (25.2 kB view details)

Uploaded Jun 13, 2026 Python 3

File details

Details for the file wyoming_mlx-0.2.0.tar.gz.

File metadata

Download URL: wyoming_mlx-0.2.0.tar.gz
Upload date: Jun 13, 2026
Size: 183.1 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for wyoming_mlx-0.2.0.tar.gz
Algorithm	Hash digest
SHA256	`336ce384fc38b867cc9e57abc0f8a118d0f7d1f861ebff20794d78ca5cb505fb`
MD5	`d9c41d917216decb77efdeea055c5b31`
BLAKE2b-256	`656bfad7ce65a2ea29a51fde8590d0f61b6c004bf8de51598627f47c67e34395`

See more details on using hashes here.

File details

Details for the file wyoming_mlx-0.2.0-py3-none-any.whl.

File metadata

Download URL: wyoming_mlx-0.2.0-py3-none-any.whl
Upload date: Jun 13, 2026
Size: 25.2 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: uv/0.11.21 {"installer":{"name":"uv","version":"0.11.21","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for wyoming_mlx-0.2.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b02eec8e9a6ad1d59d304769b78c30b75600ffbea3c0326cb86219718a057827`
MD5	`a6f947ed40e724131bb022916fb5cb79`
BLAKE2b-256	`ae4be3aee05c3de792fa5d8580e8fcd66c26a881ac86d606fc9ebbecc3ff9d1b`

See more details on using hashes here.

wyoming-mlx 0.2.0

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

wyoming-mlx

Why?

Install (Homebrew)

Quick start (dev)

Run locally (fake backends)

Run locally (real MLX backends)

API keys

HTTP API

List models

Transcribe an audio file

Synthesize speech

Home Assistant integration

Configuration

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes