Skip to main content

The Phoneme Discovery Benchmark

Project description

The Phoneme Discovery benchmark

[💾 Website] [📜 Paper] [📖 BibTex]

Introduction

The last several years have seen revolutionary improvements in both speech processing and textual natural language processing. In both cases, unsupervised or self-supervised pre-training has been the key to models autonomously discovering representations that are tremendously useful for doing language tasks. Yet, central to the study of human speech processing is the phoneme inventory, a small set of discrete units that abstract away from massive pronunciation variability in the signal.

Discovering the correct set of phonemes for a language is crucial: encode the wrong categories, and contrasts between words are distorted or disappear; fail to categorize at all, and contrasts between words are hidden behind semantically irrelevant variation in the signal. While much attention has been paid to whether unsupervised speech models’ (continuous or discrete) representations are predictive of phonemes, this benchmark, for the first time, explicitly fixes the goal of learning a discrete set of categories that are in one-to-one correspondence with the phoneme inventory of a language.

Infants appear to learn the phoneme inventory of their language effortlessly, before they can speak. They benefit from millions of years of evolution of the human brain and body, giving them a learning architecture that allows them to thrive in the face of scarce and noisy language data, preparing them to learn the phoneme inventory of any human language.

The Phoneme Discovery benchmark is aimed at building models that discover phoneme inventories across various languages, using only small amounts of speech data, and without textual data during training.

Installation

pip install discophon

To be able to compute ABX discriminabilities: pip install discophon[abx].

If you want to run baselines and have access to the utility scripts, clone this repository:

git clone https://github.com/bootphon/phoneme_discovery
cd phoneme_discovery
uv sync
# uv sync --all-extras --all-groups # If you want all dependencies

Usage

Check out the documentation:

Citation


Contact: benchmarks [at] cognitive-ml [dot] fr

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

discophon-0.0.5.tar.gz (165.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

discophon-0.0.5-py3-none-any.whl (28.0 kB view details)

Uploaded Python 3

File details

Details for the file discophon-0.0.5.tar.gz.

File metadata

  • Download URL: discophon-0.0.5.tar.gz
  • Upload date:
  • Size: 165.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.9.30 {"installer":{"name":"uv","version":"0.9.30","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for discophon-0.0.5.tar.gz
Algorithm Hash digest
SHA256 2bcf2de0dd8f4285e1092c6397c50057bd4216637210b02845c60bcf05295e96
MD5 4cfe3b7ecd565af2153b0feac0bcf39c
BLAKE2b-256 ec9c8c9b2fcf5c2612503ef31b0943ac987ea0e61a3199591017331a371b486f

See more details on using hashes here.

File details

Details for the file discophon-0.0.5-py3-none-any.whl.

File metadata

  • Download URL: discophon-0.0.5-py3-none-any.whl
  • Upload date:
  • Size: 28.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.9.30 {"installer":{"name":"uv","version":"0.9.30","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for discophon-0.0.5-py3-none-any.whl
Algorithm Hash digest
SHA256 ddb0e9bce24ab98e14f7f907de979011a4c0feb9c13aea5fe6b72ab45b9928e9
MD5 8d7f1eb68b312ccdee62547cc66e1ac2
BLAKE2b-256 d2d90ad57d51f6178f165592269faeafc150b0fa7ceb98c5833c2986d3f22476

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page