Lingvo utils for Google SVL team
Project description
Lingvo-based modules for speaker and language recognition
Overview
Here we open source some of the Lingvo-based modules used in our publications.
Disclaimer
This is NOT an official Google product.
GE2E and GE2E-XS losses
GE2E and GE2E-XS losses are implemented in lingvo/loss_layers.py
.
GE2E was proposed in this paper:
GE2E-XS was proposed in this paper:
Attentive temporal pooling
Attentive temporal pooling is implemented in lingvo/cumulative_statistics_layer.py
.
It is used by these papers:
- Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech
- Parameter-Free Attentive Scoring for Speaker Verification
Attentive scoring
Attentive scoring is implemented in lingvo/attentive_scoring_layer.py
.
It is proposed in this paper:
Citations
Our papers are cited as:
@inproceedings{wan2018generalized,
title={Generalized end-to-end loss for speaker verification},
author={Wan, Li and Wang, Quan and Papir, Alan and Moreno, Ignacio Lopez},
booktitle={International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
pages={4879--4883},
year={2018},
organization={IEEE}
}
@inproceedings{pelecanos2021drvectors,
title={{Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition}},
author={Jason Pelecanos and Quan Wang and Ignacio Lopez Moreno},
year={2021},
booktitle={Proc. Interspeech},
pages={4603--4607},
doi={10.21437/Interspeech.2021-641}
}
@inproceedings{pelecanos2022parameter,
title={Parameter-Free Attentive Scoring for Speaker Verification},
author={Jason Pelecanos and Quan Wang and Yiling Huang and Ignacio Lopez Moreno},
booktitle={Odyssey: The Speaker and Language Recognition Workshop},
year={2022}
}
@inproceedings{wang2022attentive,
title={Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech},
author={Quan Wang and Yang Yu and Jason Pelecanos and Yiling Huang and Ignacio Lopez Moreno},
booktitle={Odyssey: The Speaker and Language Recognition Workshop},
year={2022}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sidlingvo-0.0.4.tar.gz
(29.7 kB
view details)
Built Distribution
sidlingvo-0.0.4-py3-none-any.whl
(33.6 kB
view details)
File details
Details for the file sidlingvo-0.0.4.tar.gz
.
File metadata
- Download URL: sidlingvo-0.0.4.tar.gz
- Upload date:
- Size: 29.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | b836ec948af7a7a1f89150b5644198c7b2c4d2d87e081b32af34046ddfa42bd9 |
|
MD5 | 7de49952e9d91f18476ec55cec9591b5 |
|
BLAKE2b-256 | 9251a7f8072f4ca3db0aec78d3da0ba91ff81c887a95cb5c7fe953ff837b3497 |
File details
Details for the file sidlingvo-0.0.4-py3-none-any.whl
.
File metadata
- Download URL: sidlingvo-0.0.4-py3-none-any.whl
- Upload date:
- Size: 33.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.11.1
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | de3ab6f0684837a1e623464c3d7918de6a5fe4d3b1d654096ca0050e9f9d81f5 |
|
MD5 | 3a7f925074478e1c7114f30c72ad0d32 |
|
BLAKE2b-256 | 5950dc79cb27a26afc7933a014f921545ed9c5c60d179e7168dc4a6d54d82e58 |