Local speaker diarization using MLX Whisper (macOS) or faster-whisper (Linux/CUDA) and Pyannote

These details have been verified by PyPI

Project links

Repository

GitHub Statistics

Maintainers

dparedesi

These details have not been verified by PyPI

Project description

VoxScriber

Professional speaker diarization running 100% locally. Supports MLX Whisper on Apple Silicon and faster-whisper on Linux/CUDA, combined with Pyannote 3.1.

VoxScriber Banner

Requirements

Python 3.10+
Hugging Face token (free, one-time model download)
For GPU: CUDA 12 + cuDNN 9 (optional, CPU works too)

That's it. No FFmpeg, no system packages, no sudo required.

Installation

pip install voxscriber

The right Whisper backend is installed automatically:

macOS Apple Silicon: MLX Whisper
Linux/other: faster-whisper (CUDA or CPU)

Setup Hugging Face Token

VoxScriber uses pyannote models which require a Hugging Face token.

Option 1: Interactive setup (recommended)

voxscriber-doctor

This will guide you through accepting the model terms and saving your token securely.

Option 2: Using huggingface-cli

# First, accept terms at https://huggingface.co/pyannote/speaker-diarization-3.1
huggingface-cli login

Your token will be saved to ~/.cache/huggingface/token and used automatically.

Option 3: Environment variable

export HF_TOKEN=your_token_here

Usage

# Basic
voxscriber meeting.m4a

# With known speaker count
voxscriber meeting.m4a --speakers 2

# All formats
voxscriber meeting.m4a --formats md,txt,json,srt,vtt

# Sentence-level subtitle segmentation for editing workflows (default for srt/vtt)
voxscriber meeting.m4a --formats srt,vtt

# Print to console
voxscriber meeting.m4a --print

Python API

from voxscriber import DiarizationPipeline, PipelineConfig

config = PipelineConfig(
    num_speakers=2,
    language="en",
)
pipeline = DiarizationPipeline(config)
transcript = pipeline.process("meeting.m4a")

for segment in transcript.segments:
    print(f"{segment.speaker}: {segment.text}")

Output Formats

Format	Description
`md`	Markdown with bold speaker names
`txt`	Timestamped plain text
`json`	Structured data with word-level timestamps
`srt`	SubRip subtitles
`vtt`	WebVTT subtitles

Options

voxscriber --help

  --speakers, -s    Number of speakers (if known)
  --language, -l    Force language (e.g., 'en', 'es')
  --model, -m       Whisper model (default: large-v3-turbo on GPU/MLX, small on CPU)
  --formats, -f     Output formats (default: md,txt)
  --output, -o      Output directory
  --device          auto (default), mps, cuda, or cpu
  --srt-mode        Subtitle segmentation mode for srt/vtt: speaker|sentence
  --srt-max-duration  Maximum subtitle duration in seconds for srt/vtt
  --quiet, -q       Suppress progress
  --print           Print transcript to console

Performance

~0.1-0.15x RTF on Apple Silicon (MLX). ~0.15-0.25x RTF on NVIDIA GPUs (faster-whisper). A 20-minute recording processes in ~2-4 minutes depending on hardware.

Troubleshooting

Run the diagnostic tool to check your setup:

voxscriber-doctor

This will check FFmpeg availability and HF_TOKEN, and offer to fix common issues automatically.

Other Issues

Issue	Solution
`requires Python >= 3.10`	Use Python 3.10+: `python3.10 -m venv .venv`
Installed wrong package	It's `voxscriber` (with 'r'), not `voxscribe`
`HF_TOKEN required`	Run `voxscriber-doctor` to set up authentication

Support

If you find VoxScriber useful, consider supporting its development:

License

MIT

Project details

These details have been verified by PyPI

Project links

Repository

GitHub Statistics

Maintainers

dparedesi

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.2.8

May 29, 2026

0.2.6

Mar 2, 2026

0.2.5

Mar 2, 2026

0.2.4

Mar 2, 2026

0.2.3

Mar 2, 2026

0.2.2

Mar 2, 2026

0.2.1

Mar 2, 2026

0.2.0

Mar 2, 2026

0.1.19

Feb 15, 2026

0.1.18

Feb 3, 2026

0.1.17

Jan 13, 2026

0.1.16

Jan 13, 2026

0.1.15

Jan 10, 2026

0.1.14

Jan 3, 2026

0.1.13

Jan 3, 2026

0.1.12

Jan 3, 2026

0.1.11

Jan 2, 2026

0.1.10

Jan 2, 2026

0.1.9

Jan 2, 2026

0.1.8

Jan 2, 2026

0.1.7

Jan 2, 2026

0.1.6

Jan 2, 2026

0.1.5

Jan 2, 2026

0.1.4

Jan 2, 2026

0.1.3

Jan 2, 2026

0.1.2

Jan 2, 2026

0.1.1

Jan 2, 2026

0.1.0

Jan 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

voxscriber-0.2.8.tar.gz (869.2 kB view details)

Uploaded May 29, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

voxscriber-0.2.8-py3-none-any.whl (29.8 kB view details)

Uploaded May 29, 2026 Python 3

File details

Details for the file voxscriber-0.2.8.tar.gz.

File metadata

Download URL: voxscriber-0.2.8.tar.gz
Upload date: May 29, 2026
Size: 869.2 kB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for voxscriber-0.2.8.tar.gz
Algorithm	Hash digest
SHA256	`3f8f8c5d6a74acbe72067a6c77344d9ddd0c4d8ee2e9ae8d1a4db3533b547d56`
MD5	`200314f65a857d0e50892332b97f3fb7`
BLAKE2b-256	`2aea4c3aa6c86bd63cb8c76e7a015b9b40f16f260728ee95177ef081bfa0144b`

See more details on using hashes here.

Provenance

The following attestation bundles were made for voxscriber-0.2.8.tar.gz:

Publisher: publish.yml on dparedesi/voxscriber

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: voxscriber-0.2.8.tar.gz
- Subject digest: 3f8f8c5d6a74acbe72067a6c77344d9ddd0c4d8ee2e9ae8d1a4db3533b547d56
- Sigstore transparency entry: 1671647426
- Sigstore integration time: May 29, 2026
Source repository:
- Permalink: dparedesi/voxscriber@9bc447826f3d421a972d06ef5e54bda3fac6f5fb
- Branch / Tag: refs/tags/v0.2.8
- Owner: https://github.com/dparedesi
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@9bc447826f3d421a972d06ef5e54bda3fac6f5fb
- Trigger Event: release

File details

Details for the file voxscriber-0.2.8-py3-none-any.whl.

File metadata

Download URL: voxscriber-0.2.8-py3-none-any.whl
Upload date: May 29, 2026
Size: 29.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for voxscriber-0.2.8-py3-none-any.whl
Algorithm	Hash digest
SHA256	`ec8421d3269a1bb336d40dc31e45f75ea79aefa5db900cfb7003f609d8bb586f`
MD5	`bf6a69c702363902ce9737057c92be12`
BLAKE2b-256	`964e0b2c65c19374cec5ad8ccfe046be2a3acea7fa24819ae6eb4f113debc6de`

See more details on using hashes here.

Provenance

The following attestation bundles were made for voxscriber-0.2.8-py3-none-any.whl:

Publisher: publish.yml on dparedesi/voxscriber

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: voxscriber-0.2.8-py3-none-any.whl
- Subject digest: ec8421d3269a1bb336d40dc31e45f75ea79aefa5db900cfb7003f609d8bb586f
- Sigstore transparency entry: 1671647432
- Sigstore integration time: May 29, 2026
Source repository:
- Permalink: dparedesi/voxscriber@9bc447826f3d421a972d06ef5e54bda3fac6f5fb
- Branch / Tag: refs/tags/v0.2.8
- Owner: https://github.com/dparedesi
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish.yml@9bc447826f3d421a972d06ef5e54bda3fac6f5fb
- Trigger Event: release

voxscriber 0.2.8

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Project description

VoxScriber

Requirements

Installation

Setup Hugging Face Token

Usage

Python API

Output Formats

Options

Performance

Troubleshooting

Other Issues

Support

License

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance