A Python package to get stem of any inflected Bangla words.
Project description
Fatick Stemmer
Description
Bengali is one of the most morphologically rich languages and it has lots of inflectional and derivational variant forms of a word. Because of that it is quite complicated to determine the stem of word.
Therefore our main focus was to program a light-weight library to determine a identical word for same type of inflected word as stem to run on any model.
Installation
Run the following to install:
- using pip
pip install bangla-stemmer
- using git
git clone https://github.com/Fatick-DevStudio/Bangla-stemmer.git
cd Bangla-stemmer
python setup.py install
Usages
Example 01:
from bangla_stemmer.stemmer import stemmer wordlist = ['কবিরগুলিকে', 'আমাকে', 'নামাবার'] stmr = stemmer.BanglaStemmer() stm = stmr.stem(word) print(stm)
output: ['কবির', 'আমা', 'নামা']
Example 01:
from bangla_stemmer.stemmer.stemmer import BanglaStemmer word = 'কবিরগুলিকে' stm = BanglaStemmer().stem(word) print(stm)
output: ['কবির', 'আমা', 'নামা']
Grammar Rule
The grammar rules applied here are based on the algorithm of Rafi Kamal. However some major and required modification has been made in this library in order to make it compatible with any model.
Reference: ⮑ BanglaKit
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for bangla_stemmer-1.0-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 77b55acb8bafe18fb9984e333dde9107d58496a801b92c572d9c523b3dd0bfd4 |
|
MD5 | c55f85455da0abf15f6be529a5e3eb90 |
|
BLAKE2b-256 | 0b9443286b4f9948268e0468d6203a1a1ad6f2ccb2273b7d952daf99413d67e5 |