Skip to main content

Python wrapper for phonetisaurus grapheme to phoneme tool

Project description

Phonetisaurus for Python

Python wrapper for the excellent phonetisaurus grapheme to phoneme tool (license).

Includes pre-built binaries for:

  • x86_64 - desktop/laptop/server (64-bit)
  • armv6l - Raspberry Pi 0/1
  • armv7l - Raspberry Pi 2/3/4 (32-bit)
  • aarch64 - Raspberry Pi 3/4 (64-bit)

Requirements

  • Python 3.7+
  • Linux
    • Tested with Debian Buster

Installing

$ pip install phonetisaurus

Training

Assuming you have a lexicon formatted like the CMU pronouncing dictionary:

word1 phoneme1 phoneme2 ...
word2 phoneme1 phoneme2 phoneme3 ...

saved to lexicon.dict run:

$ phonetisaurus train --model /path/to/write/g2p.fst /path/to/lexicon.dict

You may supply more than one lexicon.

See phonetisaurus train --help for more options.

Predicting

$ phonetisaurus predict --model /path/to/g2p.fst word1 word2 ...

If no words are provided on the command line, they will be read line-by-line from standard in.

You may optionally supply one or more --lexicon /path/to/lexicon.dict arguments to avoid guessing pronunciations for known words.

See phonetisaurus predict --help for more options.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page