Skip to main content

Simple text to phones converter for multiple languages

Project description

Tests Linux MacOS Windows Codecov
Documentation Doc
Release GitHub release (latest SemVer) PyPI downloads
Citation status DOI

Phonemizer -- foʊnmaɪzɚ

  • The phonemizer allows simple phonemization of words and texts in many languages.

  • Provides both the phonemize command-line tool and the Python function phonemizer.phonemize. See the package's documentation.

  • It is based on four backends: espeak, espeak-mbrola, festival and segments. The backends have different properties and capabilities resumed in table below. The backend choice is let to the user.

    • espeak-ng is a Text-to-Speech software supporting a lot of languages and IPA (International Phonetic Alphabet) output.

    • espeak-ng-mbrola uses the SAMPA phonetic alphabet instead of IPA but does not preserve word boundaries.

    • festival is another Tex-to-Speech engine. Its phonemizer backend currently supports only American English. It uses a custom phoneset, but it allows tokenization at the syllable level.

    • segments is a Unicode tokenizer that build a phonemization from a grapheme to phoneme mapping provided as a file by the user.

    espeak espeak-mbrola festival segments
    phone set IPA SAMPA custom user defined
    supported languages 100+ 35 US English user defined
    processing speed fast slow very slow fast
    phone tokens :heavy_check_mark: :heavy_check_mark: :heavy_check_mark: :heavy_check_mark:
    syllable tokens :x: :x: :heavy_check_mark: :x:
    word tokens :heavy_check_mark: :x: :heavy_check_mark: :heavy_check_mark:
    punctuation preservation :heavy_check_mark: :x: :heavy_check_mark: :heavy_check_mark:
    stressed phones :heavy_check_mark: :x: :x: :x:
    tie :heavy_check_mark: :x: :x: :x:

Citation

To refenrece the phonemizer in your own work, please cite the following JOSS paper.

@article{Bernard2021,
  doi = {10.21105/joss.03958},
  url = {https://doi.org/10.21105/joss.03958},
  year = {2021},
  publisher = {The Open Journal},
  volume = {6},
  number = {68},
  pages = {3958},
  author = {Mathieu Bernard and Hadrien Titeux},
  title = {Phonemizer: Text to Phones Transcription for Multiple Languages in Python},
  journal = {Journal of Open Source Software}
}

Licence

Copyright 2015-2021 Mathieu Bernard

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see http://www.gnu.org/licenses/.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

phonemizer-3.2.0.tar.gz (62.9 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

phonemizer-3.2.0-py3.10.egg (159.6 kB view details)

Uploaded Egg

phonemizer-3.2.0-py3-none-any.whl (90.4 kB view details)

Uploaded Python 3

File details

Details for the file phonemizer-3.2.0.tar.gz.

File metadata

  • Download URL: phonemizer-3.2.0.tar.gz
  • Upload date:
  • Size: 62.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.13

File hashes

Hashes for phonemizer-3.2.0.tar.gz
Algorithm Hash digest
SHA256 f5a6c9159bc7808016a5b03bdca7be42a72c8c84f4b2907fb502c46ceddc5fbf
MD5 77122b3a5433b3cef32156f96dbd82fd
BLAKE2b-256 8fe6693d442c7e83387133eb063712e44c5ef2e6e44c98071e2356b7eb9dee29

See more details on using hashes here.

File details

Details for the file phonemizer-3.2.0-py3.10.egg.

File metadata

  • Download URL: phonemizer-3.2.0-py3.10.egg
  • Upload date:
  • Size: 159.6 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.13

File hashes

Hashes for phonemizer-3.2.0-py3.10.egg
Algorithm Hash digest
SHA256 2a47dd5e77f33beb3c7b77205c676c07bd2e5444fcdac4f56dcc032cae24a8dd
MD5 cddda3972f2b3442a8322ebd3932ceca
BLAKE2b-256 42247cf515248e8f171561d1640d01e61cfe1264e38980fb05d0085f29614382

See more details on using hashes here.

File details

Details for the file phonemizer-3.2.0-py3-none-any.whl.

File metadata

  • Download URL: phonemizer-3.2.0-py3-none-any.whl
  • Upload date:
  • Size: 90.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.0 CPython/3.9.13

File hashes

Hashes for phonemizer-3.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 52c6cb20c5a2684e1b030bb6fa0db085fd62c95b885a3573821058ebd9ff5cf0
MD5 88d6896152aec7c68b1c4b775a5811ed
BLAKE2b-256 be6ff07e1f8e9c41ccb67798efaa7408f4c9e9f239cb0f4e6dc5597d3c448927

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page