Skip to main content

A Python package to get stem of any inflected Bangla words.

Project description

Fatick Stemmer

Description

Bengali is one of the most morphologically rich languages and it has lots of inflectional and derivational variant forms of a word. Because of that it is quite complicated to determine the stem of word.

Therefore our main focus was to program a light-weight library to determine a identical word for same type of inflected word as stem to run on any model.

Installation

Run the following to install:

  • using pip
pip install bangla-stemmer
  • using git
git clone https://github.com/Fatick-DevStudio/Bangla-stemmer.git
cd Bangla-stemmer
python setup.py install

Usages

Example 01:

from bangla_stemmer.stemmer import stemmer
wordlist = ['কবিরগুলিকে', 'আমাকে', 'নামাবার']
stmr = stemmer.BanglaStemmer()
stm = stmr.stem(word)
print(stm)

output: ['কবির', 'আমা', 'নামা']

Example 01:

from bangla_stemmer.stemmer.stemmer import BanglaStemmer
word = 'কবিরগুলিকে'
stm = BanglaStemmer().stem(word)
print(stm)

output: ['কবির', 'আমা', 'নামা']

Grammar Rule

The grammar rules applied here are based on the algorithm of Rafi Kamal. However some major and required modification has been made in this library in order to make it compatible with any model.  

Reference:    BanglaKit

Project details


Release history Release notifications

This version

1.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for bangla-stemmer, version 1.0
Filename, size File type Python version Upload date Hashes
Filename, size bangla_stemmer-1.0-py3-none-any.whl (9.1 kB) File type Wheel Python version py3 Upload date Hashes View hashes
Filename, size bangla-stemmer-1.0.tar.gz (8.2 kB) File type Source Python version None Upload date Hashes View hashes

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page