Skip to main content

This package contains preprocessing functions

Project description

NLPPREPROCESS

NLPPREPROCESS is a preprocessing package for NLP task. The main objective of the package is to reduce time consumed for preprocessing by using ready made functions.

Requirements

  • Python 3.4 or higher

Installation

Using PIP via PyPI

$ pip install nlppreprocess

Manually via GIT

$ git clone git://github.com/gaganmanku96/nlppreprocess
$ cd nlppreprocess
$ python setup.py install

Functionalities

  1. Replaces words
  2. Remove stopwords
  3. Remove numbers
  4. Remove HTML tags
  5. Remove punctations

Usage

>>> from nlpuitls import NLP
>>> obj = NLP()

Parameters

>>> obj = NLP(
       replace_words=True,
       remove_stopwords=True,
       remove_numbers=True,
       remove_HTML_tags=True,
       remove_punctation=True
       )

Using with Pandas Library

>>> dataFrame['text'] = dataFrame['text].apply(obj.process)

Using with plain textx

>>> print(obj.process("Pass a text here"))

Add more stopwords

>>> obj = NLP()
>>> obj.add_stopword(['this', 'and this'])

Add more replace words

>>> obj = NLP()
>>> obj.add_replacement([this="by this", this="by this"])

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nlppreprocess-1.0.1.tar.gz (4.5 kB view details)

Uploaded Source

File details

Details for the file nlppreprocess-1.0.1.tar.gz.

File metadata

  • Download URL: nlppreprocess-1.0.1.tar.gz
  • Upload date:
  • Size: 4.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.7

File hashes

Hashes for nlppreprocess-1.0.1.tar.gz
Algorithm Hash digest
SHA256 57e168533b21a1e9f65dea2a7c08a7ca1420890dc0f735280f5190152bb31382
MD5 d530f9427d91763c0dec57f18f8f14fd
BLAKE2b-256 84c223535011b283941983aa04e20a5acf30cf3f52347f4533b2a32ebf4201c1

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page