Skip to main content

This package contains preprocessing functions

Project description

NLPPREPROCESS

NLPPREPROCESS is a preprocessing package for NLP task. The main objective of the package is to reduce time consumed for preprocessing by using ready made functions.

Requirements

  • Python 3.4 or higher

Installation

Using PIP via PyPI

$ pip install nlppreprocess

Manually via GIT

$ git clone git://github.com/gaganmanku96/nlppreprocess
$ cd nlppreprocess
$ python setup.py install

Functionalities

  1. Replaces words
  2. Remove stopwords
  3. Remove numbers
  4. Remove HTML tags
  5. Remove punctations

Usage

>>> from nlpuitls import NLP
>>> obj = NLP()

Parameters

>>> obj = NLP(
       replace_words=True,
       remove_stopwords=True,
       remove_numbers=True,
       remove_HTML_tags=True,
       remove_punctation=True
       )

Using with Pandas Library

>>> dataFrame['text'] = dataFrame['text].apply(obj.process)

Using with plain textx

>>> print(obj.process("Pass a text here"))

Add more stopwords

>>> obj = NLP()
>>> obj.add_stopword(['this', 'and this'])

Add more replace words

>>> obj = NLP()
>>> obj.add_replacement([this="by this", this="by this"])

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nlppreprocess-0.2.5.tar.gz (4.6 kB view details)

Uploaded Source

File details

Details for the file nlppreprocess-0.2.5.tar.gz.

File metadata

  • Download URL: nlppreprocess-0.2.5.tar.gz
  • Upload date:
  • Size: 4.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.7

File hashes

Hashes for nlppreprocess-0.2.5.tar.gz
Algorithm Hash digest
SHA256 a9ce87fbf42669200c13b0408ee09e4454c36cd84053ec7fd027d62c8fbfda24
MD5 589875aea8f44c3ac8669f741298d18b
BLAKE2b-256 9ff578e4ee474124ecbfc4fec8920677f2328ce8d3e5371eef500597c6a3ba9f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page