A selection of open source tools for Nahuatl NLP.
Project description
nahuatl_tools
: a selection of tools for Nahuatl NLP, designed to be fast, simple, and easy to use. It has no library dependencies other than those included in base Python. All code files have fully documented functions for ease of use. Download with pip
: pip install nahuatl_tools
.
For all features of nahuatl_tools
, it is expected that the text be in an orthography designed for a one-to-one phoneme to grapheme conversion. For automatic orthography conversion, see nahuatl_tools/orthography_converter.py
. The wordlists are also written with this orthography.
Features:
- Morphological segmentation (see
nahuatl_tools/parse.py
) - Stemming (see
nahuatl_tools/parse.py
) - Part-of-speech tagging (see
nahuatl_tools/pos_tagger.py
) - Full text tokenization/morpheme segmentation pipeline, including optional orthography conversion (see
nahuatl_tools/tokenizer.py
) - Wordlists and lists of stems (see
wordlists/
) - Morpheme glosser (see
nahuatl_tools/gloss.py
)
For questions, suggestions, or anything else, email nicocloutier1@gmail.com
.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file nahuatl_tools-0.0.2.tar.gz
.
File metadata
- Download URL: nahuatl_tools-0.0.2.tar.gz
- Upload date:
- Size: 12.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f79dd9392602e2dc099b5527fe41c2223521e57c6202fdbfbfddd236917cd13b |
|
MD5 | 605cb658e0fcad612bd635e7bf04f3ff |
|
BLAKE2b-256 | 6dee5d272051836cdc28e73aa4e2a7da18ddee0ae814b8f39ad09e054c346177 |
File details
Details for the file nahuatl_tools-0.0.2-py3-none-any.whl
.
File metadata
- Download URL: nahuatl_tools-0.0.2-py3-none-any.whl
- Upload date:
- Size: 13.5 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.9
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 30dc03a172ed95461ce6dd6c85dc1290ef375610dcbb75401e5c91308ea2cf7f |
|
MD5 | 0e9406d1968575bde05fc28920a3a868 |
|
BLAKE2b-256 | a70fe2d4169df44463f041ab1f7940964489418b806302663a8db838603315f6 |