4-layer anti-hallucination filter for Whisper speech-to-text output

These details have not been verified by PyPI

Project links

Project description

whisper-guard

whisper-guard is a small Python package that removes common Whisper hallucinations before you ship subtitles or transcripts downstream.

Problem

Whisper output can degrade on silence-heavy or low-confidence clips:

repeated phrases
phantom subtitles on silence
short character loops such as 哈哈哈哈 or xyzxyzxyz

This package extracts the anti-hallucination logic from arkiv into a reusable package with a minimal API.

4-Layer Guard

Layer	What it does	Default
L1 Silence	Reject all-silence batches	avg `no_speech_prob` > `0.6`
L2 Segment	Filter weak segments	`no_speech_prob` > `0.8`, `avg_logprob` < `-1.5` (short <1.6s: `-1.7`), `compression_ratio` > `3.0`
L3 Repetition	Reject repetitive text blocks	unique chunk ratio < `0.35`
L4 Char loops	Remove looped patterns	2-4 chars repeated 3+ times

A/B Test Results

These numbers are from the arkiv guard benchmark set on April 2026.

Config	Reps	Reduction	Time
Raw Whisper	16	baseline	47.7s
Guard only	2	-87.5%	46.5s
LLM polish only	16	0%	106.1s
Guard + LLM	2	-87.5%	119.1s
VAD + Guard + LLM	1	-93.8%	128.4s

Install

pip install whisper-guard

For local development:

pip install -e .

Quick Start

from faster_whisper import WhisperModel
from whisper_guard import WhisperGuard

model = WhisperModel("small")
segments, info = model.transcribe("sample.wav")

guard = WhisperGuard()
result = guard.process([segment._asdict() for segment in segments])

if result.passed:
    print(result.text)

API

from whisper_guard import WhisperGuard, GuardConfig, filter_hallucinations

WhisperGuard.process() expects segment dictionaries shaped like:

{
    "text": "hello world",
    "no_speech_prob": 0.12,
    "avg_logprob": -0.44,
    "compression_ratio": 1.2,
    "start": 0.0,   # optional — enables dynamic logprob threshold
    "end": 2.5,      # optional — enables dynamic logprob threshold
}

When start/end are provided, segments shorter than 1.6s use a stricter logprob threshold (-1.7) to catch hallucinations in brief audio gaps. Segments without timing info fall back to the normal threshold.

Compatible With

faster-whisper
openai-whisper
mlx-whisper

Optional Vocab Helpers

from whisper_guard.vocab import build_hotwords_prompt, filter_filler_words

Part Of

Built for the arkiv transcription pipeline and split out as a standalone package for reuse.

License

MIT

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

0.3.0

Jun 3, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whisper_guard-0.3.0.tar.gz (1.5 MB view details)

Uploaded Jun 3, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

whisper_guard-0.3.0-py3-none-any.whl (5.5 kB view details)

Uploaded Jun 3, 2026 Python 3

File details

Details for the file whisper_guard-0.3.0.tar.gz.

File metadata

Download URL: whisper_guard-0.3.0.tar.gz
Upload date: Jun 3, 2026
Size: 1.5 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for whisper_guard-0.3.0.tar.gz
Algorithm	Hash digest
SHA256	`8a43971d40d0bb9d5419c29da475df926b2014e6a5fd5c729dd8f8f45083fc7b`
MD5	`e5e52d9b618baf1172a4d53d9d00087a`
BLAKE2b-256	`fed77f988773c095ec512fdcc530cd00b8099ea76e825bc1fb5dea5f070ab6a6`

See more details on using hashes here.

File details

Details for the file whisper_guard-0.3.0-py3-none-any.whl.

File metadata

Download URL: whisper_guard-0.3.0-py3-none-any.whl
Upload date: Jun 3, 2026
Size: 5.5 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.2.0 CPython/3.12.13

File hashes

Hashes for whisper_guard-0.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`670a0ecf10d07abd9f0234762e806c62242cb68655db5e01918a015e96aa9493`
MD5	`7f4daf975e5a3dc42529e749f3858387`
BLAKE2b-256	`85096c0f67b9d2212c2d4a25d558d9ec7c36d12793c60e7f6da8f3f49dd06da1`

See more details on using hashes here.

whisper-guard 0.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

whisper-guard

Problem

4-Layer Guard

A/B Test Results

Install

Quick Start

API

Compatible With

Optional Vocab Helpers

Part Of

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes