Skip to main content

Text pre-processing

Project description

Overview

documentation

This documentation is all about PartNLP package. PartNLP designes to help developers to perprocessing their text automatically! Also it has many useful features that makes perprocessing more fun! However, This is not an exhaustive description but it should show you how use the package effortlessly.

Introduction

PartNLP is an integrated package uses many famous packages. Moreover, PartNLP supports multi languages. In the below table you can see all valid operations accomplishing by PartNLP and their corresponder packages.

Operations

Keyword

Packages

normalize

NORMALIZE

HAZM, PARSIVAR

sent tokenize

S_TOKENIZE

HAZM, PARSIVAR, STANZA

word tokenize

W_TOKENIZE

HAZM, PARSIVAR, STANZA

lemmatize

LEMMATIZE

HAZM, STANZA

stem

STEM

HAZM, PARSIVAR, STANZA

Features

This section provides a list of possible features supported by PartNLP. It able to:

  • Use GPU

  • Use multi thread

  • Use multi processors

  • Add custom stopwords

  • Separate files for using GPU

  • Remove specify range of characters

  • Remove digits and Non-Persian letters

  • Convert fnglish letters to persian letters

Installation

for installing, you can simpley use pip to install the package.

>>> pip install -i https://test.pypi.org/simple/PartNLP

Usage

In this section we are going to see the simple usage of PartNLP package.

https://gitlab.com/mostafarahgouy/pparser/-/raw/mostafa-dev/images/demo.gif

Examples

Simple example:

>>> from PartNLP import Pipeline
https://gitlab.com/mostafarahgouy/pparser/-/raw/mostafa-dev/images/usage_example_scale.png
https://gitlab.com/mostafarahgouy/pparser/-/raw/mostafa-dev/images/validation_example_scale.png

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

PartNLP-0.1.0.tar.gz (11.7 kB view details)

Uploaded Source

Built Distribution

PartNLP-0.1.0-py3-none-any.whl (53.5 kB view details)

Uploaded Python 3

File details

Details for the file PartNLP-0.1.0.tar.gz.

File metadata

  • Download URL: PartNLP-0.1.0.tar.gz
  • Upload date:
  • Size: 11.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for PartNLP-0.1.0.tar.gz
Algorithm Hash digest
SHA256 4f290d1af23aaeb2be3644005a48000114dcd03a91ba40fe6f8b110accdbfd27
MD5 8b948c1d241036f8ff893ce5b7bf176d
BLAKE2b-256 3a8b48d1c24667c89e73ae9437cd94e1a698943e714bb5b68f14a4aa767ddc17

See more details on using hashes here.

File details

Details for the file PartNLP-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: PartNLP-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 53.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.22.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.42.1 CPython/3.7.6

File hashes

Hashes for PartNLP-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 40f44b704e87f374006123ca9b6bffe69e9c665583d60027da37c8bd707314f1
MD5 6e3df9fde8b88914677dc4b29ab55206
BLAKE2b-256 da890d41d208abe87e5f01fb3792ee09e3b751fae409dd8550a5957231034903

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page