Skip to main content

UzLemmatizer: A Stemmer and Lemmatizer Tool for Uzbek Language

Project description

UzLemmatizer

https://pypi.org/project/UzLemmatizer
https://github.com/UlugbekSalaev/UzLemmatizer

UzLemmatizer tool is focused to identify a stem and lemma of Uzbek words with its POS tag based on a morphemes. It is created as a python library and uploaded to PyPI. It is simply easy to use in your python project or other programming language projects via the API.

About project

UzLemmatizer project involves Uzbek word morphology, which is the study of word forms. The tool is focused to extract stem and lemma of Uzbek word based on morphemes. Additionally, the result contain a predicted POS tag of the given token.

Quick links

Demo

You can use web interface.

Features

  • Stemmer
  • Lemmatizer
  • Lemmatizer with POS tag
  • Extract Morphemes list
  • Predict POS tag

Usage

Three options to run UzLemmatizer:

  • pip
  • API
  • Web interface

pip installation

To install UzLemmatizer, simply run:

pip install UzLemmatizer

After installation, use in python like following:

# import the library
from UzLemmatizer import UzLemmatizer
# create an object 
analyzer = UzLemmatizer.UzLemmatizer()
# call stem method
analyzer.stem('maktabimda')
# call lemmatize method
analyzer.lemmatize('maktabimda')
# call lemmatize method with POS tag
analyzer.lemmatize('maktabimda', analyzer.POS.NOUN)

API

API configurations:

Web-UI

The web interface created to use easily the library: You can use web interface here.

Demo image

Options

When you use PyPI or API, you should use following options as POS tag of a word which is optional parameter of lemmatize() metods:
NOUN Noun
VERB Verb
ADJ Adjective
NUM Numerical
PRN Pronoun
ADV Adverb

pos parameters is optional for lemmatize metods.

Result Explaining

It returns single word in a string type from each method, stem and lemmatize, that is stem and lemma of given word, respectively.

Documentation

See here.

Citation

@misc{UzLemmatizer,
  title={{UzLemmatizer}: Stemmer and Lemmatizer Tool for Uzbek Language},
  url={https://github.com/UlugbekSalaev/UzLemmatizer},
  note={Software available from https://github.com/UlugbekSalaev/UzLemmatizer},
  author={
    Ulugbek Salaev},
  year={2022},
}

Contact

For help and feedback, please feel free to contact the author.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

UzLemmatizer-0.0.1-py3-none-any.whl (18.9 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page