Skip to main content

Spoken Language IDentification (LID) using multilingual Whisper model

Project description

This is a spoken language identification system that is based on the Whisper model. The system uses the Whisper-based algorithm to identify spoken languages or non-speech event. The Section 2.3 of the paper about Whisper (https://arxiv.org/abs/2212.04356) states that language tags or non-speech tags need to be predicted after the <|startoftranscript|> special token. Based on this information, the system estimates a probability distribution for the next token after the <|startoftranscript|> and selects the token with the highest probability as the final spoken language prediction. Since the predicted token can be either a language tag or a non-speech tag, the system combines the features of a spoken language identifier and a voice activity detector.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

whisper_lid-0.0.1.tar.gz (96.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

whisper_lid-0.0.1-py3-none-any.whl (8.4 kB view details)

Uploaded Python 3

File details

Details for the file whisper_lid-0.0.1.tar.gz.

File metadata

  • Download URL: whisper_lid-0.0.1.tar.gz
  • Upload date:
  • Size: 96.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for whisper_lid-0.0.1.tar.gz
Algorithm Hash digest
SHA256 ea3f3c4d787f28c2ccff5d5677d49774f0293a0cc9628c6f68b3695df466d64c
MD5 0fbcb64f63859281d6462c919476649c
BLAKE2b-256 ead2bc5257f7f97f512810b13f7497df0a12cbaae625a6e636bcf44c6b221828

See more details on using hashes here.

File details

Details for the file whisper_lid-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: whisper_lid-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 8.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.10.16

File hashes

Hashes for whisper_lid-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 15f75214411bdd75f03a38e196e545ae7365c1c0a7572b0d971b5b25f85549d7
MD5 690a1c49e08c99166679f9e69ddbe125
BLAKE2b-256 abc1fbc6b30a5b758b4e022fda38f7436899a722df4dcb3f9a8462d9849b870d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page