Skip to main content

Tools for working with the tunic language

Project description

Tunic_language

This tool uses the CMUdict (the Carnegie Mellon Pronouncing Dictionary) to retreive the phones for English words ultizing the nltk corpus.

Symbols are provided using the Tunic Language Reference Sheet created by Reddit user u/oposdeo.

Tunic language

The Tunic langauge is a written language, which when read aloud is simply spoken English. As such it is identical to English in terms of grammar and vocab. Understanding the langauge is therefore simply a matter of recognizing the characters.

The characters which are comprised of 12 segments, 5 of which are used to identify a vowel, 6 for a consonant and the 12th used to determine the order of vowel and consonant in a character. Each character represents up to 1 vowel and 1 consonant, and thus, can also represent a single vowel or consonant. Multiple characters make up a word, and charaters in a word are connected by a continous horizontal line through the centre.

Below is a single character and the numbering used to refer to the segments.

Segments

Dependencies

NLTK

pip install nltk

NLTK Data

import nltk

# Download CMUDict corpus (~4mb)
nltk.download('cmudict')
# Download punkt tokenizer (~35mb)
nltk.download('punkt')

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

tunic-language-YI-JIAHE-0.0.5.tar.gz (4.8 kB view hashes)

Uploaded Source

Built Distribution

tunic_language_YI_JIAHE-0.0.5-py3-none-any.whl (5.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page