Skip to main content

Text preprocesssing

Project description

Overview

documentation

This documentation is all about PParser package. PParser designes to help developers to perprocessing their text automatically! Also it has many useful features that makes perprocessing more fun! However, This is not an exhaustive description but it should show you how use the package effortlessly.

Introduction

PParser is an integrated package uses many famous packages. Moreover, PParser supports multi languages. In the below table you can see all valid operations accomplishing by PParser and their corresponder packages.

Operations

Keyword

Packages

normalize

NORMALIZE

HAZM, PARSIVAR

sent tokenize

S_TOKENIZE

HAZM, PARSIVAR

word tokenize

W_TOKENIZE

HAZM, PARSIVAR

lemmatize

LEMMATIZE

HAZM

stem

STEM

HAZM, PARSIVAR

Features

This section provides a list of possible features supported by PParser. It able to:

  • Use GPU

  • Use multi thread

  • Add custom stopwords

  • Use multi processors

  • Separate files for using GPU

  • Remove specify range of characters

  • Remove digits and Non-Persian letters

  • Convert fnglish letters to persian letters

Installation

for installing, you can simpley use pip to install the package.

>>> pip install -i https://test.pypi.org/simple/ mPPars.

Usage

In this section we are going to see the simple usage of PParser package.

https://gitlab.com/mostafarahgouy/pparser/-/raw/mostafa-dev/images/guideline.gif

Examples

https://gitlab.com/mostafarahgouy/pparser/-/raw/mostafa-dev/images/guideline.gif https://gitlab.com/mostafarahgouy/pparser/-/blob/mostafa-dev/images/example_of_validation.png

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PartNLP-0.0.1.tar.gz (11.6 kB view details)

Uploaded Source

Built Distribution

PartNLP-0.0.1-py3-none-any.whl (53.4 kB view details)

Uploaded Python 3

File details

Details for the file PartNLP-0.0.1.tar.gz.

File metadata

  • Download URL: PartNLP-0.0.1.tar.gz
  • Upload date:
  • Size: 11.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for PartNLP-0.0.1.tar.gz
Algorithm Hash digest
SHA256 7decfd7bd3835de69d4fd0420e70a47993035072a637446fd69a24f05d74dc79
MD5 41565c7b564ab8f45df0178d3295e0ea
BLAKE2b-256 b57345786120549509fe4e48379a635765ac3330a3835a28f6fe6f25b2bde8e1

See more details on using hashes here.

File details

Details for the file PartNLP-0.0.1-py3-none-any.whl.

File metadata

  • Download URL: PartNLP-0.0.1-py3-none-any.whl
  • Upload date:
  • Size: 53.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for PartNLP-0.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 453f9cf19fa5a6a7ffff2ce53a97be4fe4495a0495d6fcde14fdec0dbf11e7f0
MD5 1af81fdbc6935a9e4af50042be31f43a
BLAKE2b-256 639a3664a08b10af8fe9b6d5986e5e7f56a2a72b0639812dfea510d6238ab981

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page