CLI for Omi Med STT v1 medical speech-to-text

These details have not been verified by PyPI

Project description

Omi Med STT Runtime

Runtime CLI for Omi Med STT v1, an English medical speech-to-text model.

This repository contains runtime code only. It does not contain model weights, private benchmark data, or training data.

Runtimes

omi-med-stt supports three runtime paths:

Runtime	Best for	Artifact
`mlx`	Apple Silicon Macs	`omi-health/omi-med-stt-v1-mlx`
`cpp`	Windows, Mac Intel, Linux CPU, and ggml GPU backends	`omi-health/omi-med-stt-v1-gguf`
`nemo`	NVIDIA CUDA servers and canonical NeMo checkpoint use	`omi-health/omi-med-stt-v1`

The source-of-truth model is the NeMo checkpoint. MLX and GGUF are runtime exports.

Install

From PyPI:

pip install -U omi-med-stt

From this repository:

pip install git+https://github.com/Omi-Health/omi-med-stt-runtime.git

For Apple Silicon / MLX:

pip install "omi-med-stt[mlx] @ git+https://github.com/Omi-Health/omi-med-stt-runtime.git"

For CUDA/Linux NeMo:

pip install "omi-med-stt[nemo] @ git+https://github.com/Omi-Health/omi-med-stt-runtime.git"

Basic Usage

Simple path:

omi-med-stt audio.wav

Explicit MLX:

omi-med-stt audio.wav --runtime mlx

Explicit NeMo:

omi-med-stt audio.wav --runtime nemo

Explicit parakeet.cpp / GGUF:

omi-med-stt audio.wav --runtime cpp

JSON output:

omi-med-stt audio.wav --json

Dependency/runtime check:

omi-med-stt check

parakeet.cpp / GGUF Runtime

The cpp runtime is powered by parakeet.cpp, a C++/ggml inference engine for NVIDIA Parakeet ASR models.

Omi Med STT v1 includes a post-Conformer medical adapter. Until this adapter extension is upstreamed, omi-med-stt builds parakeet.cpp with the adapter patch included in this repository and caches the resulting libparakeet shared library plus parakeet-cli.

The default cpp path uses the parakeet.cpp C API directly. For long audio it keeps the model loaded once and sends in-memory 16 kHz PCM chunks to libparakeet, avoiding repeated model setup and temporary chunk WAV reads. parakeet-cli remains available as a fallback and for developer debugging.

Normal use:

omi-med-stt audio.wav --runtime cpp

Pre-build the C++ runtime explicitly:

omi-med-stt install-cpp

Choose a backend:

omi-med-stt install-cpp --cpp-backend cpu
omi-med-stt install-cpp --cpp-backend metal
omi-med-stt install-cpp --cpp-backend cuda

Manual override remains available for developers:

omi-med-stt audio.wav --runtime cpp --parakeet-cli /path/to/parakeet-cli

Runtime toggles:

# force the old subprocess path
OMI_MED_STT_CPP_DISABLE_CAPI=1 omi-med-stt audio.wav --runtime cpp

# require C API and fail instead of falling back
OMI_MED_STT_CPP_REQUIRE_CAPI=1 omi-med-stt audio.wav --runtime cpp

# portable build instead of native CPU flags
OMI_MED_STT_GGML_NATIVE=OFF omi-med-stt install-cpp --force --cpp-backend cpu

The cached runtime is built from parakeet.cpp and applies parakeet-cpp-omi-adapter.patch. You need git and cmake available for the first build.

The cpp runtime downloads only the selected GGUF file. It does not download the NeMo .nemo checkpoint or the MLX model.safetensors.

By default, the cpp runtime is pinned to a specific Hugging Face repository revision and verifies the SHA256 checksum for the official f16/q8_0 GGUF files. You can override this for experiments:

omi-med-stt audio.wav --runtime cpp --revision main
omi-med-stt audio.wav --runtime cpp --no-verify-checksum

Long Audio

Omi Med STT v1 is based on Parakeet and can handle long-form audio. Start with the simple path:

omi-med-stt consult.wav

An explicit chunked path is still available for constrained environments:

omi-med-stt transcribe-long consult.wav --chunk-seconds 25 --overlap 3

Model Access

The runtime defaults to these Hugging Face model repositories:

omi-health/omi-med-stt-v1
omi-health/omi-med-stt-v1-mlx
omi-health/omi-med-stt-v1-gguf

If the model repositories are private before launch, authenticate first:

huggingface-cli login

Attribution

This runtime uses or interoperates with:

NVIDIA NeMo / Parakeet, for the base ASR architecture.
parakeet-mlx, for Apple Silicon MLX inference.
parakeet.cpp, for GGUF / C++ / ggml inference.

See NOTICE.md.

License

Runtime code in this repository is MIT licensed.

Model weights are governed separately by the model repositories. Omi Med STT v1 is a derivative of nvidia/parakeet-tdt-0.6b-v2, whose model weights are licensed under CC-BY-4.0.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.20

Jun 7, 2026

0.1.19

Jun 7, 2026

0.1.18

Jun 7, 2026

0.1.17

Jun 7, 2026

0.1.16 yanked

Jun 7, 2026

Reason this release was yanked:

issues

0.1.15

Jun 7, 2026

0.1.14 yanked

Jun 6, 2026

Reason this release was yanked:

issues

0.1.13 yanked

Jun 6, 2026

Reason this release was yanked:

issues

0.1.12 yanked

Jun 6, 2026

Reason this release was yanked:

issues

0.1.11

Jun 6, 2026

This version

0.1.9

Jun 5, 2026

0.1.8

Jun 4, 2026

0.1.5

Jun 4, 2026

0.1.4

Jun 4, 2026

0.1.3

Jun 4, 2026

0.1.0

Jun 4, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

omi_med_stt-0.1.9.tar.gz (20.6 kB view details)

Uploaded Jun 5, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

omi_med_stt-0.1.9-py3-none-any.whl (20.6 kB view details)

Uploaded Jun 5, 2026 Python 3

File details

Details for the file omi_med_stt-0.1.9.tar.gz.

File metadata

Download URL: omi_med_stt-0.1.9.tar.gz
Upload date: Jun 5, 2026
Size: 20.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for omi_med_stt-0.1.9.tar.gz
Algorithm	Hash digest
SHA256	`778d37fb42a2c7d1f5f228120d315542cfaa967fd4f42524d7dc93101f787926`
MD5	`0c5d8fa155310ad7f29bc639ae1095f0`
BLAKE2b-256	`198cf9a4a63149353f1975ef0617b6a5e10b418860851c8f64c9b4aa6f76f965`

See more details on using hashes here.

File details

Details for the file omi_med_stt-0.1.9-py3-none-any.whl.

File metadata

Download URL: omi_med_stt-0.1.9-py3-none-any.whl
Upload date: Jun 5, 2026
Size: 20.6 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.9

File hashes

Hashes for omi_med_stt-0.1.9-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ba893b0b39046ca8a738aa4be65c6836d01dd7d3546cec893b3aca45123753f6`
MD5	`6075c90a262c235e5e2b74e6187be9e2`
BLAKE2b-256	`9bd6818f7ea564eff69be863a6b930d59e4aa20f79dab3b90e231053828344c5`

See more details on using hashes here.

omi-med-stt 0.1.9

Navigation

Verified details

Maintainers

Unverified details

Meta

Project description

Omi Med STT Runtime

Runtimes

Install

Basic Usage

parakeet.cpp / GGUF Runtime

Long Audio

Model Access

Attribution

License

Project details

Verified details

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes