Skip to main content

Lingvo utils for Google SVL team

Project description

Lingvo-based modules for speaker and language recognition

Python application

Overview

Here we open source some of the Lingvo-based modules used in our publications.

Disclaimer

This is NOT an official Google product.

GE2E and GE2E-XS losses

GE2E and GE2E-XS losses are implemented in lingvo/loss_layers.py.

GE2E was proposed in this paper:

GE2E-XS was proposed in this paper:

Attentive temporal pooling

Attentive temporal pooling is implemented in lingvo/cumulative_statistics_layer.py.

It is used by these papers:

Attentive scoring

Attentive scoring is implemented in lingvo/attentive_scoring_layer.py.

It is proposed in this paper:

Citations

Our papers are cited as:

@inproceedings{wan2018generalized,
  title={Generalized end-to-end loss for speaker verification},
  author={Wan, Li and Wang, Quan and Papir, Alan and Moreno, Ignacio Lopez},
  booktitle={International Conference on Acoustics, Speech and Signal Processing (ICASSP)},
  pages={4879--4883},
  year={2018},
  organization={IEEE}
}

@inproceedings{pelecanos2021drvectors,
  title={{Dr-Vectors: Decision Residual Networks and an Improved Loss for Speaker Recognition}},
  author={Jason Pelecanos and Quan Wang and Ignacio Lopez Moreno},
  year={2021},
  booktitle={Proc. Interspeech},
  pages={4603--4607},
  doi={10.21437/Interspeech.2021-641}
}

@inproceedings{pelecanos2022parameter,
  title={Parameter-Free Attentive Scoring for Speaker Verification},
  author={Jason Pelecanos and Quan Wang and Yiling Huang and Ignacio Lopez Moreno},
  booktitle={Odyssey: The Speaker and Language Recognition Workshop},
  year={2022}
}

@inproceedings{wang2022attentive,
  title={Attentive Temporal Pooling for Conformer-based Streaming Language Identification in Long-form Speech},
  author={Quan Wang and Yang Yu and Jason Pelecanos and Yiling Huang and Ignacio Lopez Moreno},
  booktitle={Odyssey: The Speaker and Language Recognition Workshop},
  year={2022}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sidlingvo-0.0.4.tar.gz (29.7 kB view details)

Uploaded Source

Built Distribution

sidlingvo-0.0.4-py3-none-any.whl (33.6 kB view details)

Uploaded Python 3

File details

Details for the file sidlingvo-0.0.4.tar.gz.

File metadata

  • Download URL: sidlingvo-0.0.4.tar.gz
  • Upload date:
  • Size: 29.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.1

File hashes

Hashes for sidlingvo-0.0.4.tar.gz
Algorithm Hash digest
SHA256 b836ec948af7a7a1f89150b5644198c7b2c4d2d87e081b32af34046ddfa42bd9
MD5 7de49952e9d91f18476ec55cec9591b5
BLAKE2b-256 9251a7f8072f4ca3db0aec78d3da0ba91ff81c887a95cb5c7fe953ff837b3497

See more details on using hashes here.

File details

Details for the file sidlingvo-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: sidlingvo-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 33.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.1

File hashes

Hashes for sidlingvo-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 de3ab6f0684837a1e623464c3d7918de6a5fe4d3b1d654096ca0050e9f9d81f5
MD5 3a7f925074478e1c7114f30c72ad0d32
BLAKE2b-256 5950dc79cb27a26afc7933a014f921545ed9c5c60d179e7168dc4a6d54d82e58

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page