Skip to main content

A phonetic transcriber module for emtsv

Project description

emPhon

Phonetic transcriber for the xtsv framework.

Requirements

  • Python >= 3.6
  • make

Installation

  • Clone the repository: git clone https://github.com/levaid/emPhon
  • make build
  • pip install dist/*.whl

Usage

  • Same as any other module using the xtsv framework - either as part of the emtsv framework or as separate module.
  • The module needs form and anas fields and produces the phon field and by default, it prepends the phonetic form of the entire sentence

Configurations

The module takes command line arguments. By default, the module produces IPA output in strict xtsv format.

Command line arguments:

  • --ipaize or --no-ipaize toggles IPA-ization, it produces the inner representation which uses exactly one unicode character for each phoneme. Default: on.
  • --opt-palatal-assim or --no-opt-palatal-assim toggles optional palatal assimilation for the t/d+ny clusters, e.g. lapátnyél -> lapátynyél. Default: off.
  • --include-sentence or --no-include-sentence toggles the inclusion of the entire phonetic form as a comment before each sentence. Default: on.

Example output

# phon = ɒ mɛɡoldaːʒbɒ ɒkaːr moʃt meːɡ nɛm iʃ ɛŋɡɛdeːjɛzɛtː oltaːʃok iʃ bɛjaːt͡shɒtnɒk .
A       ɒ
megoldásba      mɛɡoldaːʒbɒ
akár    ɒkaːr
most    moʃt
még     meːɡ
nem     nɛm
is      iʃ
engedélyezett   ɛŋɡɛdeːjɛzɛtː
oltások oltaːʃok
is      iʃ
bejátszhatnak   bɛjaːt͡shɒtnɒk
.       .

Paper

Can be downloaded from here. Please cite the following paper using this module:

@InProceedings{   Kulcsar:2021,
  author        = {Virág Kulcsár and Dániel Lévai},
  title         = {em{P}hon: Morphologically sensitive open-source phonetic transcriber},
  booktitle     = {{XVII}. Conference on Hungarian Computational Linguistics  ({MSZNY}2021)},
  year          = 2021,
  address       = {Szeged}
}

Miscellaneous

If different IPA (or transcription) is needed, the emphon/ipa_key.json file contains the key, which is human-modifiable.

License

This module is licensed under the LGPL 3.0 license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

emphon-1.3.1.tar.gz (10.4 kB view details)

Uploaded Source

Built Distribution

emphon-1.3.1-py3-none-any.whl (22.0 kB view details)

Uploaded Python 3

File details

Details for the file emphon-1.3.1.tar.gz.

File metadata

  • Download URL: emphon-1.3.1.tar.gz
  • Upload date:
  • Size: 10.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.1 pkginfo/1.7.0 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.2

File hashes

Hashes for emphon-1.3.1.tar.gz
Algorithm Hash digest
SHA256 cce20eba3b4a3d98c07112993d9b4dbc5b35117c1f009371c151416343c4ca2d
MD5 bdb108613df5f6ac0827d02f89c55b32
BLAKE2b-256 e37048b910c9a90ae0a391777592fe7ef686c3ab91e6e168d0392357d913c902

See more details on using hashes here.

File details

Details for the file emphon-1.3.1-py3-none-any.whl.

File metadata

  • Download URL: emphon-1.3.1-py3-none-any.whl
  • Upload date:
  • Size: 22.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/3.10.1 pkginfo/1.7.0 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.2

File hashes

Hashes for emphon-1.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 20c40120b9f57a946644f53fbfdc6b457d28f03eac031509cbd40973f2ae9097
MD5 3ea987e72b9f91f85889c042f1b0896a
BLAKE2b-256 5624ad7964196b7497473665118bdcdd4bf97e6eb976262c47d18052cdfb927c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page