One step Python package for NLP friendly data preprocessing.
Project description
nlptextprep
One Step Python Package to preprocess text data for NLP tasks. This package is designed to clean, transform, and standardize text data, making it an ideal choice for applications in natural language processing, text analysis, and data cleaning.
Installation
pip install nlptextprep
Usage
from nlptextprep import preprocess_text
text = "This is a sample text containing a #hashtag, 100+ @mention in 70% , (4 - 2)/7 and line breaks in https://botpenguin.com .\nCheck it out!"
cleaned_text = preprocess_text(text)
print(cleaned_text)
Note
This project has been set up using PyScaffold 4.6. For details and usage information on PyScaffold see https://pyscaffold.org/.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file nlptextprep-0.0.3.tar.gz.
File metadata
- Download URL: nlptextprep-0.0.3.tar.gz
- Upload date:
- Size: 23.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3aa5d2bd778175f4b8460b90015f4a73b676787e06be5e8b55c713f6307b3f12
|
|
| MD5 |
7401ee8ca296bc2679e6dabb573ebc47
|
|
| BLAKE2b-256 |
c08e0900af6aa419e60c071b1b3e5df0a5e4b645dcfa8bcd79747872d578874a
|
File details
Details for the file nlptextprep-0.0.3-py3-none-any.whl.
File metadata
- Download URL: nlptextprep-0.0.3-py3-none-any.whl
- Upload date:
- Size: 6.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.1.1 CPython/3.12.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
034874b7ab0d7d6c76cab8dfe2ab04499f5cda9d6c786d716bd3ce28ec2f8c83
|
|
| MD5 |
849c16eab83e54c6818b4a4d101cd291
|
|
| BLAKE2b-256 |
206d3b7d24d0bb3158984c97868c6a8a9d57276ecb8d55bbbef8a1a50f17f0b4
|