Text preprocesssing
Project description
Overview
This documentation is all about PParser package. PParser designes to help developers to perprocessing their text automatically! Also it has many useful features that makes perprocessing more fun! However, This is not an exhaustive description but it should show you how use the package effortlessly.
Introduction
PParser is an integrated package uses many famous packages. Moreover, PParser supports multi languages. In the below table you can see all valid operations accomplishing by PParser and their corresponder packages.
Operations |
Keyword |
Packages |
---|---|---|
normalize |
NORMALIZE |
HAZM, PARSIVAR |
sent tokenize |
S_TOKENIZE |
HAZM, PARSIVAR |
word tokenize |
W_TOKENIZE |
HAZM, PARSIVAR |
lemmatize |
LEMMATIZE |
HAZM |
stem |
STEM |
HAZM, PARSIVAR |
Features
This section provides a list of possible features supported by PParser. It able to:
Use GPU
Use multi thread
Add custom stopwords
Use multi processors
Separate files for using GPU
Remove specify range of characters
Remove digits and Non-Persian letters
Convert fnglish letters to persian letters
Installation
for installing, you can simpley use pip to install the package.
>>> pip install -i https://test.pypi.org/simple/ mPPars.
Usage
In this section we are going to see the simple usage of PParser package.
Examples
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file PartNLP-0.0.1.tar.gz
.
File metadata
- Download URL: PartNLP-0.0.1.tar.gz
- Upload date:
- Size: 11.6 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7decfd7bd3835de69d4fd0420e70a47993035072a637446fd69a24f05d74dc79 |
|
MD5 | 41565c7b564ab8f45df0178d3295e0ea |
|
BLAKE2b-256 | b57345786120549509fe4e48379a635765ac3330a3835a28f6fe6f25b2bde8e1 |
File details
Details for the file PartNLP-0.0.1-py3-none-any.whl
.
File metadata
- Download URL: PartNLP-0.0.1-py3-none-any.whl
- Upload date:
- Size: 53.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 453f9cf19fa5a6a7ffff2ce53a97be4fe4495a0495d6fcde14fdec0dbf11e7f0 |
|
MD5 | 1af81fdbc6935a9e4af50042be31f43a |
|
BLAKE2b-256 | 639a3664a08b10af8fe9b6d5986e5e7f56a2a72b0639812dfea510d6238ab981 |