Skip to main content

A versioned python wrapper package for The CMU Pronouncing Dictionary data files.

Project description

CMUdict: Python wrapper for cmudict

Latest PyPI version GitHub Workflow Status

CMUdict is a versioned python wrapper package for The CMU Pronouncing Dictionary data files. The main purpose is to expose the data with little or no assumption on how it is to be used.

Installation

cmudict is available on PyPI. Simply install it with pip:

pip install cmudict

Usage

The cmudict data set includes 4 data files: cmudict.dict, cmudict.phones, cmudict.symbols, and cmudict.vp. See The CMU Pronouncing Dictionary for details on the data. Chances are, if you're here, you already know what's in the files.

Each file can be accessed through three functions, one which returns the raw (string) contents, one which returns a binary stream of the file, and one which does minimal processing of the file into an appropriate structure:

>>> import cmudict

>>> cmudict.dict() # Compatible with NLTK
>>> cmudict.dict_string()
>>> cmudict.dict_stream()

>>> cmudict.phones()
>>> cmudict.phones_string()
>>> cmudict.phones_stream()

>>> cmudict.symbols()
>>> cmudict.symbols_string()
>>> cmudict.symbols_stream()

>>> cmudict.vp()
>>> cmudict.vp_string()
>>> cmudict.vp_stream()

Three additional functions are included to maintain compatibility with NLTK: cmudict.entries(), cmudict.raw(), and cmudict.words(). See the nltk.corpus.reader.cmudict documentation for details:

>>> cmudict.entries() # Compatible with NLTK
>>> cmudict.raw() # Compatible with NLTK
>>> cmudict.words() # Compatible with NTLK

And finally, the license for the cmudict data set is available as well:

>>> cmudict.license_string() # Returns the cmudict license as a string

Credits

Built on or modeled after the following open source projects:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cmudict-1.0.32.tar.gz (936.0 kB view details)

Uploaded Source

Built Distribution

cmudict-1.0.32-py3-none-any.whl (939.4 kB view details)

Uploaded Python 3

File details

Details for the file cmudict-1.0.32.tar.gz.

File metadata

  • Download URL: cmudict-1.0.32.tar.gz
  • Upload date:
  • Size: 936.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.5 CPython/3.12.3 Linux/6.8.0-1017-azure

File hashes

Hashes for cmudict-1.0.32.tar.gz
Algorithm Hash digest
SHA256 e84a587bb610b3a837a93f07494e874860cf205ea7f23db652b871093a699f38
MD5 f3f133f8e7ac122a6dc2eaf787c403a4
BLAKE2b-256 1bd1c32478997451c1c7cbf07b663845972812ca9ce678d228f4cd2bac5d2e32

See more details on using hashes here.

File details

Details for the file cmudict-1.0.32-py3-none-any.whl.

File metadata

  • Download URL: cmudict-1.0.32-py3-none-any.whl
  • Upload date:
  • Size: 939.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: poetry/1.8.5 CPython/3.12.3 Linux/6.8.0-1017-azure

File hashes

Hashes for cmudict-1.0.32-py3-none-any.whl
Algorithm Hash digest
SHA256 b9323664d49d128193c480ec97a3270ab2162469289bb26e950d13b2ef661c41
MD5 3c8c4e125e0407a3c588c37db6879f70
BLAKE2b-256 faff617bdeab02c2ce0384e64a5bb00ec479322dbf019491def68af6a860733d

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page