Skip to main content

VB Diarization with Eigenvoice and HMM Priors

Project description

VB Diarization with Eigenvoice and HMM Priors

Overview

This is a refactored version of the VB diarization software available at BUT Speech@FIT.

I am NOT the original author of the library.

Notes from original author

This python code implements speaker diarization algorithm described in: http://www.fit.vutbr.cz/~burget/VB_diarization_slides.pdf

This algorithm is based on a generalized version of the model described in:

Kenny, P. Bayesian Analysis of Speaker Diarization with Eigenvoice Priors, Montreal, CRIM, May 2008, http://www.crim.ca/perso/patrick.kenny/BayesCluster.pdf

Kenny, P., Reynolds, D., and Castaldo, F. Diarization of Telephone Conversations using Factor Analysis IEEE Journal of Selected Topics in Signal Processing, December 2010, http://www.crim.ca/perso/patrick.kenny/Kenny_sdfa.pdf

The generalization introduced in this implementation lies in using an HMM instead of the simple mixture model when modeling generation of segments (or even frames) from speakers. HMM limits the probability of switching between speakers when changing frames, which makes it possible to use the model on frame-by-frame bases without any need to iterate between

  1. clustering speech segments and
  2. re-segmentation (i.e. as it was done in the paper above).

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

vbdiar-0.0.1.tar.gz (7.4 kB view details)

Uploaded Source

Built Distribution

vbdiar-0.0.1-py3-none-any.whl (8.0 kB view details)

Uploaded Python 3

File details

Details for the file vbdiar-0.0.1.tar.gz.

File metadata

  • Download URL: vbdiar-0.0.1.tar.gz
  • Upload date:
  • Size: 7.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.5.4rc1

File hashes

Hashes for vbdiar-0.0.1.tar.gz
Algorithm Hash digest
SHA256 e7e22e1ffe519b9f6252d4ea4730c5728670601c8b1caa0e2ca03590a0e90a9d
MD5 a80bf1e0e4dbf495180d95fb8f8b8b82
BLAKE2b-256 ff59c88a1928a24a45ae7cb30ee0ff56634caced698d1ae249ba8f700d1c66ee

See more details on using hashes here.

File details

Details for the file vbdiar-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: vbdiar-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 8.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.5.4rc1

File hashes

Hashes for vbdiar-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 bde47ba75b64cfd734efb9670e7887c28883505ab4ec870c617f75919f3280f4
MD5 45528ed05de503a3f1884e40a7e343ff
BLAKE2b-256 95fe8c5577bd07ea36759c721dc79079d57c26f359c53ab6bbc1acfefa64753d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page