Simple text to phones converter for multiple languages

These details have not been verified by PyPI

Project links

Project description

Tests
Documentation
Release
Citation

Phonemizer -- foʊnmaɪzɚ

The phonemizer allows simple phonemization of words and texts in many languages.
Provides both the phonemize command-line tool and the Python function phonemizer.phonemize. See the package's documentation.

It is based on four backends: espeak, espeak-mbrola, festival and segments. The backends have different properties and capabilities resumed in table below. The backend choice is let to the user.

espeak-ng is a Text-to-Speech software supporting a lot of languages and IPA (International Phonetic Alphabet) output.
espeak-ng-mbrola uses the SAMPA phonetic alphabet instead of IPA but does not preserve word boundaries.
festival is another Tex-to-Speech engine. Its phonemizer backend currently supports only American English. It uses a custom phoneset, but it allows tokenization at the syllable level.
segments is a Unicode tokenizer that build a phonemization from a grapheme to phoneme mapping provided as a file by the user.

	espeak	espeak-mbrola	festival	segments
phone set	IPA	SAMPA	custom	user defined
supported languages	100+	35	US English	user defined
processing speed	fast	slow	very slow	fast
phone tokens	:heavy_check_mark:	:heavy_check_mark:	:heavy_check_mark:	:heavy_check_mark:
syllable tokens	:x:	:x:	:heavy_check_mark:	:x:
word tokens	:heavy_check_mark:	:x:	:heavy_check_mark:	:heavy_check_mark:
punctuation preservation	:heavy_check_mark:	:x:	:heavy_check_mark:	:heavy_check_mark:
stressed phones	:heavy_check_mark:	:x:	:x:	:x:
tie	:heavy_check_mark:	:x:	:x:	:x:

Citation

To refenrece the phonemizer in your own work, please cite the following JOSS paper.

@article{Bernard2021,
  doi = {10.21105/joss.03958},
  url = {https://doi.org/10.21105/joss.03958},
  year = {2021},
  publisher = {The Open Journal},
  volume = {6},
  number = {68},
  pages = {3958},
  author = {Mathieu Bernard and Hadrien Titeux},
  title = {Phonemizer: Text to Phones Transcription for Multiple Languages in Python},
  journal = {Journal of Open Source Software}
}

Licence

This program is free software: you can redistribute it and/or modify it under the terms of the GNU General Public License as published by the Free Software Foundation, either version 3 of the License, or (at your option) any later version.

This program is distributed in the hope that it will be useful, but WITHOUT ANY WARRANTY; without even the implied warranty of MERCHANTABILITY or FITNESS FOR A PARTICULAR PURPOSE. See the GNU General Public License for more details.

You should have received a copy of the GNU General Public License along with this program. If not, see http://www.gnu.org/licenses/.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

This version

3.3.0

Aug 1, 2024

3.2.1

Jun 9, 2022

3.2.0

May 23, 2022

3.1.1

Mar 29, 2022

3.0.1

Dec 18, 2021

3.0

Oct 25, 2021

2.2.2

Jan 6, 2021

2.2.1

Jul 24, 2020

2.2

Feb 27, 2020

2.1

Jan 29, 2020

2.0.1

Nov 7, 2019

2.0.1.dev0 pre-release

Nov 4, 2019

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

phonemizer-3.3.0.tar.gz (88.6 kB view details)

Uploaded Aug 1, 2024 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

phonemizer-3.3.0-py3-none-any.whl (103.8 kB view details)

Uploaded Aug 1, 2024 Python 3

File details

Details for the file phonemizer-3.3.0.tar.gz.

File metadata

Download URL: phonemizer-3.3.0.tar.gz
Upload date: Aug 1, 2024
Size: 88.6 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.4

File hashes

Hashes for phonemizer-3.3.0.tar.gz
Algorithm	Hash digest
SHA256	`5e0c38122effe0b331a24e674aff256874ece169d70a9cf1120337b56f8e3d0c`
MD5	`b4e8fae495b394087083e53f522d8418`
BLAKE2b-256	`c4ff3574c55a71b42ad6944a5bf0a7d59f0251ea2ba47e51a5c4005e32e9145c`

See more details on using hashes here.

File details

Details for the file phonemizer-3.3.0-py3-none-any.whl.

File metadata

Download URL: phonemizer-3.3.0-py3-none-any.whl
Upload date: Aug 1, 2024
Size: 103.8 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/5.1.1 CPython/3.12.4

File hashes

Hashes for phonemizer-3.3.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`17afaa98691fe73b025dd8d8727b0e67cc376c5e7ee27590853e457fb3f43602`
MD5	`25a2ec347defdff8664cfd7cd97bcf2e`
BLAKE2b-256	`b623e8d67c2052e132181c4c9027c2d8ed9e37e8acb27acfc13ed2d0c41ed850`

See more details on using hashes here.

phonemizer 3.3.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Phonemizer -- foʊnmaɪzɚ

Citation

Licence

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes