mlphon

Malayalam phonetic analyser

These details have not been verified by PyPI

Project links

Homepage

Project description

This is python interface for the Malayalam phonetic analyser - mlphon.

Installation

Using Virtual Environment (https://docs.python.org/3/library/venv.html) is recommended.

$ pip install mlphon

Syllablize a Malayalam Word

The following python snippet will split a word in Malayalam script into syllables.

from mlphon import PhoneticAnalyser
mlphon = PhoneticAnalyser()
mlphon.split_to_syllables('കേരളം')

It will give the result

[‘കേ’, ‘ര’, ‘ളം’]

Phonetically analyse a Malayalam Word

from mlphon import PhoneticAnalyser
mlphon = PhoneticAnalyser()
mlphon.analyse('കേരളം')

It gives the result as a sequence of ipa and associated phonetic tags.

[{‘phonemes’: [{‘ipa’: ‘k’, ‘tags’: [‘plosive’, ‘voiceless’, ‘unaspirated’, ‘velar’]}, {‘ipa’: ‘eː’, ‘tags’: [‘v_sign’]}]}, {‘phonemes’: [{‘ipa’: ‘ɾ’, ‘tags’: [‘flapped’, ‘alveolar’]}, {‘ipa’: ‘a’, ‘tags’: [‘inherentvowel’]}]}, {‘phonemes’: [{‘ipa’: ‘ɭ’, ‘tags’: [‘lateral’, ‘retroflex’]}, {‘ipa’: ‘a’, ‘tags’: [‘inherentvowel’]}, {‘ipa’: ‘m’, ‘tags’: [‘anuswara’]}]}]

Malayalam g2p : Grapheme to Phoneme conversion

from mlphon import PhoneticAnalyser
mlphon = PhoneticAnalyser()
mlphon.grapheme_to_phoneme('കാറ്റ്')

It gives the ipa sequence as output.

[‘kaːṯṯə’]

Malayalam p2g : Phoneme to Grapheme conversion

from mlphon import PhoneticAnalyser
mlphon = PhoneticAnalyser()
mlphon.phoneme_to_grapheme('paːlə')

It gives the corresponding grapheme sequences as output.

[പാല്’]

Command Line Interface for the above operations: mlphon

usage:

mlphon [-h] [-s] [-a] [-p] [-pe string] [-se string] [-g] [-i INFILE]
        [-o OUTFILE] [-v]

optional arguments:
-h, --help            show this help message and exit
-s, --syllablize      Syllablize the input Malayalam string
-a, --analyse         Phonetically analyse the input Malayalam string
-p, --tophoneme       Transcribe the input Malayalam grapheme to phoneme
                        sequence
-pe string, --phoneme_end string
                        String to be inserted at end of phoneme
-se string, --syllable_end string
                        String to be inserted at end of syllable
-g, --tographeme      Transcribe the input phoneme sequence to Malayalam
                        grapheme
-i INFILE, --input INFILE
                        source of analysis data
-o OUTFILE, --output OUTFILE
                        target of generated strings
-v, --verbose         print verbosely while processing

For example to perform g2p operation on a set of words stored in input.txt with one Malayalam word per line,

mlphon -p -pe " " -se "." -i path/to/inputfile.txt -o path/to/outputfile.txt

Inputfile contents:

cat path/to/inputfile.txt
അകത്തുള്ളത്
അകപ്പെട്ടത്
അകലെ

Outputfile contents:

അകത്തുള്ളത് a .k a .t̪ t̪ u .ɭ ɭ a .t̪ ə .
അകപ്പെട്ടത്        a .k a .p p e .ʈ ʈ a .t̪ ə .
അകലെ    a .k a .l e .

Application: Using mlphon to create a phonetic lexicon

A typical use case of phonetic analysis is to create a phonetic lexicon to be used in Automatic Speech Recognition or Text to Speech Synthesis. The phonetic representation with each phoneme separated by a space can be obtained as below:

from mlphon import PhoneticAnalyser, split_as_phonemes
mlphon = PhoneticAnalyser()
analysis = mlphon.analyse('എന്നാൽ')
for result in analysis:
  split_as_phonemes(result)

It results in the output, two different valid phoneme sequences:

‘e n̪ n̪ aː l’

‘e n n aː l’

The phonetic representation with each syllable separated by a space can be obtained as below:

from mlphon import PhoneticAnalyser, split_as_syllables
mlphon = PhoneticAnalyser()
analysis = mlphon.analyse('ഇന്ത്യയുടെ')
for result in analysis:
  split_as_syllables(result)

It results in the output:

‘i n̪t̪ja ju ʈe’

To get phonemes and syllables with user defined end-marker strings as below:

from mlphon import PhoneticAnalyser, phonemize
mlphon = PhoneticAnalyser()
analysis = mlphon.analyse('ഇന്ത്യയുടെ')
for result in analysis:
  phonemize(result, " ", ".")

It results in the output with a ‘space’ after every phoneme and a ‘period’ after every syllable

‘i .n̪ t̪ j a .j u .ʈ e .’

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

This version

3.1.2

Sep 15, 2021

3.1.1

Jul 31, 2021

3.1.0

Jul 9, 2021

3.0.7

May 13, 2021

3.0.6

Nov 21, 2020

3.0.5

Nov 21, 2020

3.0.4

Nov 6, 2020

3.0.3

Nov 6, 2020

3.0.2

Oct 11, 2020

3.0.1

Oct 9, 2020

3.0.0

Oct 9, 2020

2.0.0

May 18, 2020

1.0.4

Jan 8, 2019

1.0.3

Dec 26, 2018

1.0.2

Dec 25, 2018

1.0.1

Dec 25, 2018

1.0.0

Dec 25, 2018

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

mlphon-3.1.2.tar.gz (21.4 kB view details)

Uploaded Sep 15, 2021 Source

File details

Details for the file mlphon-3.1.2.tar.gz.

File metadata

Download URL: mlphon-3.1.2.tar.gz
Upload date: Sep 15, 2021
Size: 21.4 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/47.1.0 requests-toolbelt/0.9.1 tqdm/4.50.2 CPython/3.7.9

File hashes

Hashes for mlphon-3.1.2.tar.gz
Algorithm	Hash digest
SHA256	`4054782f2916e963cb36a110f235be7082f725be633a8e0436c8ea7696f59906`
MD5	`9ff97f4e887279c7608d2264f7f1d21a`
BLAKE2b-256	`3467c0dfd51226a9f9157e6237b651b21dea3efc57a7cfdbd70778fec9a0a512`

See more details on using hashes here.

mlphon 3.1.2

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Installation

Syllablize a Malayalam Word

Phonetically analyse a Malayalam Word

Malayalam g2p : Grapheme to Phoneme conversion

Malayalam p2g : Phoneme to Grapheme conversion

Command Line Interface for the above operations: mlphon

Application: Using mlphon to create a phonetic lexicon

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

File details

File metadata

File hashes