Skip to main content

VB Diarization with Eigenvoice and HMM Priors

Project description

VB Diarization with Eigenvoice and HMM Priors


This is a refactored version of the VB diarization software available at BUT Speech@FIT.

I am NOT the original author of the library.

Notes from original author

This python code implements speaker diarization algorithm described in:

This algorithm is based on a generalized version of the model described in:

Kenny, P. Bayesian Analysis of Speaker Diarization with Eigenvoice Priors, Montreal, CRIM, May 2008,

Kenny, P., Reynolds, D., and Castaldo, F. Diarization of Telephone Conversations using Factor Analysis IEEE Journal of Selected Topics in Signal Processing, December 2010,

The generalization introduced in this implementation lies in using an HMM instead of the simple mixture model when modeling generation of segments (or even frames) from speakers. HMM limits the probability of switching between speakers when changing frames, which makes it possible to use the model on frame-by-frame bases without any need to iterate between

  1. clustering speech segments and
  2. re-segmentation (i.e. as it was done in the paper above).

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for vbdiar, version 0.0.1
Filename, size File type Python version Upload date Hashes
Filename, size vbdiar-0.0.1.tar.gz (7.4 kB) File type Source Python version None Upload date Hashes View
Filename, size vbdiar-0.0.1-py3-none-any.whl (8.0 kB) File type Wheel Python version py3 Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Huawei Huawei PSF Sponsor Microsoft Microsoft PSF Sponsor NVIDIA NVIDIA PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page