A small package
Project description
simager
Tools for cleaning or preprocessing text data
Features
- Remove Hastag, mention, nonascii, emoticons, html tag, punctuation,
character, repeated character, repeated word, number, whitespace
- Stopword Operation
- Text normalizer Operation
Instalation
pip install simager
Getting Started
from simager import text_cleaner, stop_word, normalizer
text_cleaner.rm_url("your text data")
text_cleaner.rm_all("your text data")
text_cleaner.rm_set([
"rm_hastag",
"rm_mention",
"rm_url",
"rm_punct",
"rm_whitespace"
])
text_cleaner.rm("your text data")
stop_word.remove("your text data")
normalizer.text("your text data")
Full Example Usage Here
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
simager-0.0.1.tar.gz
(241.0 kB
view hashes)
Built Distribution
simager-0.0.1-py3-none-any.whl
(241.4 kB
view hashes)