Skip to main content

A small package

Project description

simager

Tools for cleaning or preprocessing text data

Features

- Remove Hastag, mention, nonascii, emoticons, html tag, punctuation,
  character, repeated character, repeated word, number, whitespace
- Stopword Operation
- Text normalizer Operation

Instalation

pip install simager

Getting Started

from simager import text_cleaner, stop_word, normalizer


text_cleaner.rm_url("your text data")

text_cleaner.rm_all("your text data")

text_cleaner.rm_set([
    "rm_hastag",
    "rm_mention",
    "rm_url",
    "rm_punct",
    "rm_whitespace"
])
text_cleaner.rm("your text data")

stop_word.remove("your text data")

normalizer.text("your text data")

Full Example Usage Here

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

simager-0.0.1.tar.gz (241.0 kB view hashes)

Uploaded Source

Built Distribution

simager-0.0.1-py3-none-any.whl (241.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page