A Python NLP Library for Persian language, by PartDP AI
Project description
PartNLP Project
Overview
This documentation is all about PartNLP package. PartNLP designs to help developers to perprocessing their text automatically! Also it has many useful features that makes perprocessing more fun! However, This is not an exhaustive description but it should show you how use the package effortlessly.
Introduction
PartNLP is an integrated package uses many famous packages. Moreover, PartNLP supports multi languages. In the below table you can see all valid operations accomplishing by PartNLP and their corresponder packages.
Operations |
Keyword |
Packages |
---|---|---|
normalize |
NORMALIZE |
HAZM, PARSIVAR |
sent tokenize |
S_TOKENIZE |
HAZM, PARSIVAR, STANZA |
word tokenize |
W_TOKENIZE |
HAZM, PARSIVAR, STANZA |
lemmatize |
LEMMATIZE |
HAZM, STANZA |
stem |
STEM |
HAZM, PARSIVAR, STANZA |
Installation
for installing, you can simply use pip to install the package.
>>> pip install PartNLP
Pipeline Usage Example
>>> from PartNLP import Pipeline
>>> Pipeline(lang='persian', package='hazm', processors=['W_TOKENIZE', 'LEMMATIZE'], text='این متن، جهت بررسی عملکرد بسته نوشته شده است')
Output of Hazm word tokenizer:
['این', 'متن', '،', 'جهت', 'بررسی', 'عملکرد', 'بسته', 'نوشته_شده_است']
Output of Hazm Lemmatizer :
['این', 'متن', '،', 'جهت', 'بررسی', 'عملکرد', 'بسته', 'نویس']
Pipeline also can handle missing required data in which should be passed by users. In the below example no package, language or processors entered but Pipeline asks you to fill them out.
>>> Pipeline(text='این متن، جهت بررسی عملکرد بسته نوشته شده است')
Warning: no package selected. List of supported packages:['HAZM', 'PARSIVAR', 'STANZA']
please enter a valid value: 'hazm'
Warning: no language selected. List of supported languages:['ENGLISH', 'PERSIAN']
please enter a valid value: 'persian'
Warning: no operator selected. List of supported operations for 'hazm' package :['NORMALIZE', 'S_TOKENIZE', 'STEM', 'W_TOKENIZE', 'LEMMATIZE']
Interface Usage Example
In this section we are going to see the simple usage of PartNLP package.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file PartNLP-0.1.34.tar.gz
.
File metadata
- Download URL: PartNLP-0.1.34.tar.gz
- Upload date:
- Size: 18.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 84c1e5e40ab280aebb4e89fbe616150d86a00dbc28d31bae3467aebc8a988b78 |
|
MD5 | 91f17e03b35a698126e5b5b755394e88 |
|
BLAKE2b-256 | 6b5625eff521f5f2620830a92f1321e10cd445dfb9046de23cc0a18162fac11c |
File details
Details for the file PartNLP-0.1.34-py3-none-any.whl
.
File metadata
- Download URL: PartNLP-0.1.34-py3-none-any.whl
- Upload date:
- Size: 40.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c397a0e9ff87c5d0e458da5d86a29ce05457d5d1cb4d7cbb256f4a5e4fff3d26 |
|
MD5 | 4fbc255e601bc1b9b2369f58f28992d5 |
|
BLAKE2b-256 | 38f366b9fe00eebb795cd74be8821dc46c2f0a96bc3b352678ff4ba88e0d6542 |