Skip to main content

The Phoneme Discovery Benchmark

Project description

The Phoneme Discovery benchmark

[💾 Website] [📜 Paper] [📖 BibTex]

Introduction

The last several years have seen revolutionary improvements in both speech processing and textual natural language processing. In both cases, unsupervised or self-supervised pre-training has been the key to models autonomously discovering representations that are tremendously useful for doing language tasks. Yet, central to the study of human speech processing is the phoneme inventory, a small set of discrete units that abstract away from massive pronunciation variability in the signal.

Discovering the correct set of phonemes for a language is crucial: encode the wrong categories, and contrasts between words are distorted or disappear; fail to categorize at all, and contrasts between words are hidden behind semantically irrelevant variation in the signal. While much attention has been paid to whether unsupervised speech models’ (continuous or discrete) representations are predictive of phonemes, this benchmark, for the first time, explicitly fixes the goal of learning a discrete set of categories that are in one-to-one correspondence with the phoneme inventory of a language.

Infants appear to learn the phoneme inventory of their language effortlessly, before they can speak. They benefit from millions of years of evolution of the human brain and body, giving them a learning architecture that allows them to thrive in the face of scarce and noisy language data, preparing them to learn the phoneme inventory of any human language.

The Phoneme Discovery benchmark is aimed at building models that discover phoneme inventories across various languages, using only small amounts of speech data, and without textual data during training.

Installation

pip install discophon

To be able to compute ABX discriminabilities: pip install discophon[abx].

If you want to run baselines and have access to the utility scripts, clone this repository:

git clone https://github.com/bootphon/phoneme_discovery
cd phoneme_discovery
uv sync
# uv sync --all-extras --all-groups # If you want all dependencies

Usage

Check out the documentation:

Citation


Contact: benchmarks [at] cognitive-ml [dot] fr

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

discophon-0.0.4.tar.gz (163.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

discophon-0.0.4-py3-none-any.whl (27.4 kB view details)

Uploaded Python 3

File details

Details for the file discophon-0.0.4.tar.gz.

File metadata

  • Download URL: discophon-0.0.4.tar.gz
  • Upload date:
  • Size: 163.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.9.26 {"installer":{"name":"uv","version":"0.9.26","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for discophon-0.0.4.tar.gz
Algorithm Hash digest
SHA256 be324fef3796a3127ee6a0329d855160e19f42349f71ca024f6b1d7eb0b03eb0
MD5 c720b7da122e98c2e0024b464d41ffbb
BLAKE2b-256 0aafaca8414c9392c1ea29116e7ef6ca1a2d0dbcad9775d402dd6a440dcf39bf

See more details on using hashes here.

File details

Details for the file discophon-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: discophon-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 27.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: uv/0.9.26 {"installer":{"name":"uv","version":"0.9.26","subcommand":["publish"]},"python":null,"implementation":{"name":null,"version":null},"distro":{"name":"Ubuntu","version":"24.04","id":"noble","libc":null},"system":{"name":null,"release":null},"cpu":null,"openssl_version":null,"setuptools_version":null,"rustc_version":null,"ci":true}

File hashes

Hashes for discophon-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 248880db9ee45ae1ffb94af3f0f3537714aff8304309ebd0d6cca9d0dc7692d3
MD5 f76fdff2eabecb0cc1b102deee11cef2
BLAKE2b-256 d422306714770b17f4afeb37e6babcb6f71abe7ad06bac94f41a573b63abebbb

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page