Project description

Bangla NLTK

banglanltk is a python package for Bengali Natural Language Processing Toolkit. It includes modules for Cleaning Text, Word Tokenization, Sentence Tokenization, Parts of speech tagging and Synonym.

Documents

Installation

pip install banglanltk

Usage

Cleaning Text

import banglanltk as bn
s = 'আজ আকাশ পরিষ্কার!!! মনে হয় আজ আর বৃষ্টি হবে না .........!'

print(bn.clean_text(s))

Word Tokenization

import banglanltk as bn
s = 'প্রাচীন কালে মানুষ একসময় সংখ্যা বুঝানোর জন্য ঝিনুক, নুড়ি, দড়ির গিট ইত্যাদি ব্যবহার করত।'

print(bn.word_tokenize(s))

Sentence Tokenization

import banglanltk as bn
s = ''' কম্পিউটার শব্দটি গ্রিক "কম্পিউট" শব্দ থেকে এসেছে। Compute শব্দের অর্থ গণনা করা। আর কম্পিউটার শব্দের অর্থ গণনাকারী যন্ত্র। '''

print(bn.sent_tokenize(s))

POS Tagging

import banglanltk as bn

print(bn.pos_tag('কম্পিউটার'))

Synonym

import banglanltk as bn

print(bn.synonym('হাত'))

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.0.4

Aug 4, 2020

This version

0.0.3

Jul 29, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

banglanltk-0.0.3-py3-none-any.whl (481.0 kB view hashes)

Uploaded Jul 29, 2020 Python 3

Hashes for banglanltk-0.0.3-py3-none-any.whl

Hashes for banglanltk-0.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`8709e54f78e3b822010059e03595e2401a5ecfc3133107aa5a2f512c4ac44551`
MD5	`409aa3bcf8a0c0667abcf9f040dafe60`
BLAKE2b-256	`d06fd51a5be41a4bd8fad26325186ed0e4ce613fda6c279846c4b5fa8d8ad6db`