Skip to main content

NLP Text perprocessor

Project description

TextPreprocessing

This is the beta release of TextPreprocessing library. This library currenly capable of cleansing your text data for modal training.

TextPreprocessing library can do the below actions:

* Expand general abbreviations

* Clear email ids in the text data

* Clear web URLs

* Clear html tags present in the text dataset

* Clear gibberish charsets

* Lemetize the text

* Correct spelling errors.

We are enhancing this package on a regular basis and adding more flexible components to it in the upcoming releases. Please do update this package on frequently.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

TextPreprocessing-1.2.0.tar.gz (5.9 kB view details)

Uploaded Source

Built Distribution

TextPreprocessing-1.2.0-py3-none-any.whl (6.3 kB view details)

Uploaded Python 3

File details

Details for the file TextPreprocessing-1.2.0.tar.gz.

File metadata

  • Download URL: TextPreprocessing-1.2.0.tar.gz
  • Upload date:
  • Size: 5.9 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.5.0 pkginfo/1.8.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.1 CPython/3.9.7

File hashes

Hashes for TextPreprocessing-1.2.0.tar.gz
Algorithm Hash digest
SHA256 85ccb7668f0584f43a2f8128cd0b4a8bf129409c4417d91222c06641144e5f8c
MD5 5c3b9c74d08ec5010cb43a88efb441a6
BLAKE2b-256 cf45b5a57049e9c68348be8ff925dda17ebde5dba7fa85ee46a1cea566a34464

See more details on using hashes here.

File details

Details for the file TextPreprocessing-1.2.0-py3-none-any.whl.

File metadata

  • Download URL: TextPreprocessing-1.2.0-py3-none-any.whl
  • Upload date:
  • Size: 6.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.5.0 pkginfo/1.8.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.1 CPython/3.9.7

File hashes

Hashes for TextPreprocessing-1.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0a43fe6e31b2dd7e31c9083581be5d21783b27dbc53d482a3234d7ced8617913
MD5 708dc1c9fd2988645fd3234ebf77617b
BLAKE2b-256 eba2d15e84c8e824c7b005804383a61ed9bfa70418a3f8d2f063173eae1439aa

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page