Skip to main content

Text pre-processing

Project description

Overview

documentation

This documentation is all about PartNLP package. PartNLP designes to help developers to perprocessing their text automatically! Also it has many useful features that makes perprocessing more fun! However, This is not an exhaustive description but it should show you how use the package effortlessly.

Introduction

PartNLP is an integrated package uses many famous packages. Moreover, PartNLP supports multi languages. In the below table you can see all valid operations accomplishing by PartNLP and their corresponder packages.

Operations

Keyword

Packages

normalize

NORMALIZE

HAZM, PARSIVAR

sent tokenize

S_TOKENIZE

HAZM, PARSIVAR, STANZA

word tokenize

W_TOKENIZE

HAZM, PARSIVAR, STANZA

lemmatize

LEMMATIZE

HAZM, STANZA

stem

STEM

HAZM, PARSIVAR, STANZA

Features

This section provides a list of possible features supported by PartNLP. It able to:

  • Use GPU

  • Use multi thread

  • Use multi processors

  • Add custom stopwords

  • Separate files for using GPU

  • Remove specify range of characters

  • Remove digits and Non-Persian letters

  • Convert fnglish letters to persian letters

Installation

for installing, you can simpley use pip to install the package.

>>> pip install -i https://test.pypi.org/simple/PartNLP

Usage

In this section we are going to see the simple usage of PartNLP package.

https://gitlab.com/mostafarahgouy/pparser/-/raw/mostafa-dev/images/demo.gif

Examples

Simple example:

>>> from PartNLP import Pipeline
https://gitlab.com/mostafarahgouy/pparser/-/raw/mostafa-dev/images/usage_example_scale.png
https://gitlab.com/mostafarahgouy/pparser/-/raw/mostafa-dev/images/validation_example_scale.png

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PartNLP-0.0.2.tar.gz (11.7 kB view details)

Uploaded Source

Built Distribution

PartNLP-0.0.2-py3-none-any.whl (53.4 kB view details)

Uploaded Python 3

File details

Details for the file PartNLP-0.0.2.tar.gz.

File metadata

  • Download URL: PartNLP-0.0.2.tar.gz
  • Upload date:
  • Size: 11.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for PartNLP-0.0.2.tar.gz
Algorithm Hash digest
SHA256 71535907d877e105dd3a27a8f5d409823ddbee90b5558392cb5778869a61aadf
MD5 ba0ecad9a3f84ea5dda9eb9bfcd9f1be
BLAKE2b-256 8f777405f1bb0c4d9a197bf4c08d91fb2ac788233a0a162903a023f6d5383ee4

See more details on using hashes here.

File details

Details for the file PartNLP-0.0.2-py3-none-any.whl.

File metadata

  • Download URL: PartNLP-0.0.2-py3-none-any.whl
  • Upload date:
  • Size: 53.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for PartNLP-0.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 435e996c3460516defb718c2f4227d5e68f59a5059ffaac320160d5fc9aa404c
MD5 738e69cc1d506ca4633209365dc3ab4f
BLAKE2b-256 9817f3e9259b4d1af7ec9ef383ccec6b4aaa3f086cd6a870cbc435cc5839a2e9

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page