Morphological Analyser for Tatar language
Project description
Morphological Parser of Tatar language. Uses HFST-tool. Web form which uses this tool: http://tatmorphan.pythonanywhere.com/
To install:
$ pip install py_tat_morphan
To use lookup:
$ tat_morphan_lookup
To process text:
$ tat_morphan_process_text <filename>
To process whole folder:
$ tat_morphan_process_folder <path_from>
or
$ tat_morphan_process_folder <path_from> <path_to>
Note: if you do not provide <path_to>, programm puts analyzed texts into folder near initial with ‘_analyzed’ postfix. Eg, if <path_from>=’/home/ramil/mytexts/’, then <path_to>=’/home/ramil/mytexts_analyzed/’.
To use as python module:
>>> from py_tat_morphan.morphan import Morphan
>>> morphan = Morphan()
>>> print(morphan.analyse('урманнарга'))
>>> print(morphan.lemma('урманнарга'))
>>> print(morphan.pos('урманнарга'))
>>> print(morphan.process_text('Без урманга барабыз.'))
For feedback:
Versions:
1.2.1 | Uses HFST python package
1.2.2 | Add tat_morphan_lookup and tat_morphan_process_text scripts to bin/
1.2.3 | Fixed exception dictionary
1.2.4 | Fixed to use C HFST package | Added tat_morphan_process_folder script to bin/ | Added Russain Morphological Analyser (pymorphy2 package) to detect russian words in text
1.2.5 | Fixed morphophonetic and morphotacktic rules | Added tat_morphan_stats_of_folder script to bin/
1.2.6 | Fixed dictioray collection
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file py_tat_morphan-1.2.6.tar.gz.
File metadata
- Download URL: py_tat_morphan-1.2.6.tar.gz
- Upload date:
- Size: 3.4 MB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3428c29c9619ebca8c0d77f9490e61ec66b5aaa824df7ed1a65ed6733749fdbb
|
|
| MD5 |
f576195ad2b5b0fd92417ab17fe33643
|
|
| BLAKE2b-256 |
aa1d3526dd7344e065f7e9ae87391503c97c5d610ab144660dee6644f7d10430
|