VB Diarization with Eigenvoice and HMM Priors
Project description
VB Diarization with Eigenvoice and HMM Priors
Overview
This is a refactored version of the VB diarization software available at BUT Speech@FIT.
I am NOT the original author of the library.
Notes from original author
This python code implements speaker diarization algorithm described in: http://www.fit.vutbr.cz/~burget/VB_diarization_slides.pdf
This algorithm is based on a generalized version of the model described in:
Kenny, P. Bayesian Analysis of Speaker Diarization with Eigenvoice Priors, Montreal, CRIM, May 2008, http://www.crim.ca/perso/patrick.kenny/BayesCluster.pdf
Kenny, P., Reynolds, D., and Castaldo, F. Diarization of Telephone Conversations using Factor Analysis IEEE Journal of Selected Topics in Signal Processing, December 2010, http://www.crim.ca/perso/patrick.kenny/Kenny_sdfa.pdf
The generalization introduced in this implementation lies in using an HMM instead of the simple mixture model when modeling generation of segments (or even frames) from speakers. HMM limits the probability of switching between speakers when changing frames, which makes it possible to use the model on frame-by-frame bases without any need to iterate between
- clustering speech segments and
- re-segmentation (i.e. as it was done in the paper above).
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file vbdiar-0.0.1.tar.gz
.
File metadata
- Download URL: vbdiar-0.0.1.tar.gz
- Upload date:
- Size: 7.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.5.4rc1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | e7e22e1ffe519b9f6252d4ea4730c5728670601c8b1caa0e2ca03590a0e90a9d |
|
MD5 | a80bf1e0e4dbf495180d95fb8f8b8b82 |
|
BLAKE2b-256 | ff59c88a1928a24a45ae7cb30ee0ff56634caced698d1ae249ba8f700d1c66ee |
File details
Details for the file vbdiar-0.0.1-py3-none-any.whl
.
File metadata
- Download URL: vbdiar-0.0.1-py3-none-any.whl
- Upload date:
- Size: 8.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.21.0 setuptools/40.8.0 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.5.4rc1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bde47ba75b64cfd734efb9670e7887c28883505ab4ec870c617f75919f3280f4 |
|
MD5 | 45528ed05de503a3f1884e40a7e343ff |
|
BLAKE2b-256 | 95fe8c5577bd07ea36759c721dc79079d57c26f359c53ab6bbc1acfefa64753d |