Korean Conjugation Paradigm Generator
Project description
KoParadigm: A Korean Conjugation Paradigm Generator
This is the offical repo for our paper: KoParadigm: A Korean Conjugation Paradigm Generator
(Inflectional) paradigm means the set of all the inflected forms of a word. For example, English verb "look" has inflected forms like "look", "look-s", "look-ed", and "look-ing", as all of you know. Paradigms are widely used in corpus linguistics or search engines. To create the full paradigm set of a language is sometimes tricky. It is particularly so when we deal with a morphologically rich language like Korean. Inflection of Korean verbs is notorisouly complicated. Typically, a Korean verb can combine with more than 100 endings. What is worse, the combination rules are not simple at all. They are determined by the sound of the verb/ending, and the part-of-speech of the verb (action / descriptive). That's why so far there's no open sources of Korean paradigm generator, I think. Here's the first one. With KoParadigm, you can easily get the full paradigm of a Korean verb.
Dependencies
- python >=3.6
- jamo >=0.4.1
- xlrd == 1.2.0
Installation
pip install koparadigm
Usage
>>> from koparadigm import Paradigm, prettify
>>> p = Paradigm()
>>> verb = "곱" # Note that you must drop the final ending 다
>>> paradigms = p.conjugate(verb) # this returns list of lists
>>> print(paradigms)
[['Action Verb', [('거나', '곱거나'), ('거늘', '곱거늘'), ('거니', '곱거니') ...]]]
>>> prettify(paradigms)
POS = Action Verb
• ending = 거나 form = 곱거나
• ending = 거늘 form = 곱거늘
• ending = 거니 form = 곱거니
...
==================== 2 ====================
POS = Descriptive Verb
• ending = 거나 form = 곱거나
• ending = 거늘 form = 곱거늘
• ending = 거니 form = 곱거니
• ending = 거니와 form = 곱거니와
...
References
If you use our software for research, please cite:
@article{park2020KoParadigm,
author = {Park, Kyubyong },
title={KoParadigm: A Korean Conjugation Paradigm Generator},
journal={arXiv preprint arXiv:2004.13221},
year={2020}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for koparadigm-0.10.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 1180ad8b8841fc0d1178b3d98db04877ba1ef1be23b691e23a9c0c582a39d7da |
|
MD5 | f99fa152673f6340ca01042bb24c69bb |
|
BLAKE2b-256 | b8ab79d089283785526ef3838e0cc5b0824102b2fa2dc2639064faa0315f580b |