A phonetic transcriber module for emtsv
Project description
emPhon
Phonetic transcriber for the xtsv framework.
Requirements
- Python >= 3.6
- make
Installation
- Clone the repository:
git clone https://github.com/levaid/emPhon make buildpip install dist/*.whl
Usage
- Same as any other module using the xtsv framework - either as part of the emtsv framework or as separate module.
- The module needs
formandanasfields and produces thephonfield and by default, it prepends the phonetic form of the entire sentence
Configurations
The module takes command line arguments. By default, the module produces IPA output in strict xtsv format.
Command line arguments:
--ipaizeor--no-ipaizetoggles IPA-ization, it produces the inner representation which uses exactly one unicode character for each phoneme. Default: on.--opt-palatal-assimor--no-opt-palatal-assimtoggles optional palatal assimilation for the t/d+ny clusters, e.g. lapátnyél -> lapátynyél. Default: off.--include-sentenceor--no-include-sentencetoggles the inclusion of the entire phonetic form as a comment before each sentence. Default: on.
Example output
# phon = ɒ mɛɡoldaːʒbɒ ɒkaːr moʃt meːɡ nɛm iʃ ɛŋɡɛdeːjɛzɛtː oltaːʃok iʃ bɛjaːt͡shɒtnɒk .
A ɒ
megoldásba mɛɡoldaːʒbɒ
akár ɒkaːr
most moʃt
még meːɡ
nem nɛm
is iʃ
engedélyezett ɛŋɡɛdeːjɛzɛtː
oltások oltaːʃok
is iʃ
bejátszhatnak bɛjaːt͡shɒtnɒk
. .
Paper
Can be downloaded from here. Please cite the following paper using this module:
@InProceedings{ Kulcsar:2021,
author = {Virág Kulcsár and Dániel Lévai},
title = {em{P}hon: Morphologically sensitive open-source phonetic transcriber},
booktitle = {{XVII}. Conference on Hungarian Computational Linguistics ({MSZNY}2021)},
year = 2021,
address = {Szeged}
}
Miscellaneous
If different IPA (or transcription) is needed, the emphon/ipa_key.json file contains the key, which is human-modifiable.
License
This module is licensed under the LGPL 3.0 license.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file emphon-1.3.1.tar.gz.
File metadata
- Download URL: emphon-1.3.1.tar.gz
- Upload date:
- Size: 10.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.10.1 pkginfo/1.7.0 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
cce20eba3b4a3d98c07112993d9b4dbc5b35117c1f009371c151416343c4ca2d
|
|
| MD5 |
bdb108613df5f6ac0827d02f89c55b32
|
|
| BLAKE2b-256 |
e37048b910c9a90ae0a391777592fe7ef686c3ab91e6e168d0392357d913c902
|
File details
Details for the file emphon-1.3.1-py3-none-any.whl.
File metadata
- Download URL: emphon-1.3.1-py3-none-any.whl
- Upload date:
- Size: 22.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.1 importlib_metadata/3.10.1 pkginfo/1.7.0 requests/2.22.0 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
20c40120b9f57a946644f53fbfdc6b457d28f03eac031509cbd40973f2ae9097
|
|
| MD5 |
3ea987e72b9f91f85889c042f1b0896a
|
|
| BLAKE2b-256 |
5624ad7964196b7497473665118bdcdd4bf97e6eb976262c47d18052cdfb927c
|