Skip to main content

A Python package to get stem of any inflected Bangla words.

Project description

Fatick Stemmer

Description

Bengali is one of the most morphologically rich languages and it has lots of inflectional and derivational variant forms of a word. Because of that it is quite complicated to determine the stem of word.

Therefore our main focus was to program a light-weight library to determine a identical word for same type of inflected word as stem to run on any model.

Installation

Run the following to install:

  • using pip
pip install bangla-stemmer
  • using git
git clone https://github.com/Fatick-DevStudio/Bangla-stemmer.git
cd Bangla-stemmer
python setup.py install

Usages

Example 01:

from bangla_stemmer.stemmer import stemmer
wordlist = ['কবিরগুলিকে', 'আমাকে', 'নামাবার']
stmr = stemmer.BanglaStemmer()
stm = stmr.stem(word)
print(stm)

output: ['কবির', 'আমা', 'নামা']

Example 01:

from bangla_stemmer.stemmer.stemmer import BanglaStemmer
word = 'কবিরগুলিকে'
stm = BanglaStemmer().stem(word)
print(stm)

output: ['কবির', 'আমা', 'নামা']

Grammar Rule

The grammar rules applied here are based on the algorithm of Rafi Kamal. However some major and required modification has been made in this library in order to make it compatible with any model.  

Reference:    BanglaKit

Project details


Release history Release notifications | RSS feed

This version

1.0

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

bangla-stemmer-1.0.tar.gz (8.2 kB view details)

Uploaded Source

Built Distribution

bangla_stemmer-1.0-py3-none-any.whl (9.1 kB view details)

Uploaded Python 3

File details

Details for the file bangla-stemmer-1.0.tar.gz.

File metadata

  • Download URL: bangla-stemmer-1.0.tar.gz
  • Upload date:
  • Size: 8.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.3

File hashes

Hashes for bangla-stemmer-1.0.tar.gz
Algorithm Hash digest
SHA256 71b01c3b0a4306f68b37db7a4535e4ecf6407fc69e1346edd57da5c06b00ef71
MD5 7081451a39ac1ea8f9559351c5264df5
BLAKE2b-256 35ff957fb33e6f7083e92a6dc05959ba4ce61ce8461959f383bdb02c8ad37162

See more details on using hashes here.

File details

Details for the file bangla_stemmer-1.0-py3-none-any.whl.

File metadata

  • Download URL: bangla_stemmer-1.0-py3-none-any.whl
  • Upload date:
  • Size: 9.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/40.8.0 requests-toolbelt/0.9.1 tqdm/4.41.1 CPython/3.7.3

File hashes

Hashes for bangla_stemmer-1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 77b55acb8bafe18fb9984e333dde9107d58496a801b92c572d9c523b3dd0bfd4
MD5 c55f85455da0abf15f6be529a5e3eb90
BLAKE2b-256 0b9443286b4f9948268e0468d6203a1a1ad6f2ccb2273b7d952daf99413d67e5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page