Skip to main content

This package contains preprocessing functions

Project description

NLPPREPROCESS

NLPPREPROCESS is a preprocessing package for NLP task. The main objective of the package is to reduce time consumed for preprocessing by using ready made functions.

Requirements

  • Python 3.4 or higher

Installation

Using PIP via PyPI

$ pip install nlppreprocess

Manually via GIT

$ git clone git://github.com/gaganmanku96/nlppreprocess
$ cd nlppreprocess
$ python setup.py install

Functionalities

  1. Replaces words
  2. Remove stopwords
  3. Remove numbers
  4. Remove HTML tags
  5. Remove punctations

Usage

>>> from nlpuitls import NLP
>>> obj = NLP()

Parameters

>>> obj = NLP(
       replace_words=True,
       remove_stopwords=True,
       remove_numbers=True,
       remove_HTML_tags=True,
       remove_punctation=True
       )

Using with Pandas Library

>>> dataFrame['text'] = dataFrame['text].apply(obj.process)

Using with plain textx

>>> print(obj.process("Pass a text here"))

Add more stopwords

>>> obj = NLP()
>>> obj.add_stopword(['this', 'and this'])

Add more replace words

>>> obj = NLP()
>>> obj.add_replacement([this="by this", this="by this"])

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

nlppreprocess-0.1.1.tar.gz (4.6 kB view details)

Uploaded Source

Built Distribution

nlppreprocess-0.1.1-py3-none-any.whl (4.8 kB view details)

Uploaded Python 3

File details

Details for the file nlppreprocess-0.1.1.tar.gz.

File metadata

  • Download URL: nlppreprocess-0.1.1.tar.gz
  • Upload date:
  • Size: 4.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.7

File hashes

Hashes for nlppreprocess-0.1.1.tar.gz
Algorithm Hash digest
SHA256 160414549bd9ff6d136af47f45f1bd7e90d5fd844500b173228815da7af8b18d
MD5 49188000062fc370fe10a91fb8a7f04d
BLAKE2b-256 c5c1d4261e88e7fa29afe3635fff220b5ad4546402b2c23ad23100f1cd0f59d5

See more details on using hashes here.

File details

Details for the file nlppreprocess-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: nlppreprocess-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 4.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.21.0 setuptools/41.0.1 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.7

File hashes

Hashes for nlppreprocess-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 cd402f8f3695b23202f943b5e67f58869d4ddd448f0875710f1361824caa4d1d
MD5 babce1c6dba2b6b0a8a64e8e704625f4
BLAKE2b-256 ba14d67fa1cee9c02c448112270c4c8d676b6a8da01a36abfc316b4f7bdff081

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page