Skip to main content

NLP Text perprocessor

Project description

TextPreprocessing

This is the beta release of TextPreprocessing library. This library currenly capable of cleansing your text data for modal training.

TextPreprocessing library can do the below actions:

* Expand general abbreviations

* Clear email ids in the text data

* Clear web URLs

* Clear html tags present in the text dataset

* Clear gibberish charsets

* Lemetize the text

* Correct spelling errors.

We are enhancing this package on a regular basis and adding more flexible components to it in the upcoming releases. Please do update this package on frequently.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

TextPreprocessing-1.0.0.tar.gz (2.8 kB view details)

Uploaded Source

Built Distribution

TextPreprocessing-1.0.0-py3-none-any.whl (3.3 kB view details)

Uploaded Python 3

File details

Details for the file TextPreprocessing-1.0.0.tar.gz.

File metadata

  • Download URL: TextPreprocessing-1.0.0.tar.gz
  • Upload date:
  • Size: 2.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.5.0 pkginfo/1.8.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.1 CPython/3.9.7

File hashes

Hashes for TextPreprocessing-1.0.0.tar.gz
Algorithm Hash digest
SHA256 2fef66001ebe140f6eace537f2f975ee74bbc1076a0bf7668f10027abaca5589
MD5 fd6e1a2b3d0c323b1ea91c249cd4955a
BLAKE2b-256 83afb566a2cfeb309b877e68a7be61c6a22a867bb9be3dfba2cc2177fab01c5b

See more details on using hashes here.

File details

Details for the file TextPreprocessing-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: TextPreprocessing-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 3.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.5.0 pkginfo/1.8.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.1 CPython/3.9.7

File hashes

Hashes for TextPreprocessing-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 a44cb488933fa37935d3e148092b2168b9ee9d270607e255d6f1ef389809efb5
MD5 be66dfd551d62142a5728364fe64b3ea
BLAKE2b-256 16735dc4ef0b5f10ca31b527853b0b1ddcf6568a258126cbcb071d7aa4f46959

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page