Skip to main content

NLP Text perprocessor

Project description

TextPreprocessing

This is the beta release of TextPreprocessing library. This library currenly capable of cleansing your text data for modal training.

TextPreprocessing library can do the below actions:

* Expand general abbreviations

* Clear email ids in the text data

* Clear web URLs

* Clear html tags present in the text dataset

* Clear gibberish charsets

* Lemetize the text

* Correct spelling errors.

We are enhancing this package on a regular basis and adding more flexible components to it in the upcoming releases. Please do update this package on frequently.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

TextPreprocessing-1.0.1.tar.gz (2.8 kB view details)

Uploaded Source

Built Distribution

TextPreprocessing-1.0.1-py3-none-any.whl (3.3 kB view details)

Uploaded Python 3

File details

Details for the file TextPreprocessing-1.0.1.tar.gz.

File metadata

  • Download URL: TextPreprocessing-1.0.1.tar.gz
  • Upload date:
  • Size: 2.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.5.0 pkginfo/1.8.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.1 CPython/3.9.7

File hashes

Hashes for TextPreprocessing-1.0.1.tar.gz
Algorithm Hash digest
SHA256 f195230c15aa5c241d13202ab09743cd2ea7446ba0c4f9b420250f7cede3d606
MD5 45b434322d54a402f4d69e9de02f1d95
BLAKE2b-256 19fecda1cfeed01da2f636a41969caa9cf4e5fd0f65f7ef9c5a2e8add327f40f

See more details on using hashes here.

File details

Details for the file TextPreprocessing-1.0.1-py3-none-any.whl.

File metadata

  • Download URL: TextPreprocessing-1.0.1-py3-none-any.whl
  • Upload date:
  • Size: 3.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.6.0 importlib_metadata/4.5.0 pkginfo/1.8.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.61.1 CPython/3.9.7

File hashes

Hashes for TextPreprocessing-1.0.1-py3-none-any.whl
Algorithm Hash digest
SHA256 88306fd8088b3e9fbb07f272e5299014570ff80877135a65e7a926145fa55f80
MD5 adb50b06975b76a2f82f65f5adc96c0d
BLAKE2b-256 8d844f4649d8bf7454dc52023e33439fa6f9368c1bd2657ed00a18d02b2a1f25

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page