Skip to main content

A small text processing package

Project description

✨ Shiny Text

This package aims to be a one stop all package for all text processing tasks. The package was developed using Python3.10.

💻 Installation

pip install -U shinytext

🚀 Usage

# To clean text
from shinytext.cleantext import <method>

# To clean dataframe
from shinytext.cleandata import <method>

# To download/fetch a spacy language model
from shinytext.utils.spacy_loader import get_spacy_object

List of text cleaning methods

  • remove_punctuation: To remove punctuations
  • remove_stopwords: To remove stopwords
  • remove_frequent_words: To remove frequently occuring words
  • remove_rare_words: To remove rarely occuring words
  • stemmer: To perform stemming
  • lemmatizer: To perform lemmatizing
  • remove_emojis: To remove emoji's
  • convert_emojis: To convert emoji's to text
  • remove_emoticons: To remove emoticons
  • convert_emoticons: To convert emoticons to text
  • remove_urls: To remove URLs
  • remove_html: To remove html
  • convert_chatwords: To convert chat words to text

List of dataframe cleaning methods

  • filter_english_records: To extract data with only english text

✍ Contribution Guidelines

Please ensure that you adhere to the following guidelines while making a pull request.

  1. Fork the project on to your repo. Create a branch under your name and create PR from your branch.

  2. USE proper commit messages and give detailed descriptions for your commit including the file name, function name and the changes made.

Customize your commit titles according to below given instructions

* "ADD: <your task title>" for new additions  
* "MOD: <your task title>" for modifications 
* "DEL: <your task title>" for deletions
* "FIX: <your task title>" for fixing bugs
  1. Ensure that your code adheres to PEP-8 guidelines. Click here to know more about it.

💼 Current Requirements

  1. More text processing techniques

  2. Test cases for existing functions

🙌 Contributors

  1. Retin P Kumar

❤ Credits

  1. Emot library by Neel Shah
  2. Getting started with Text Preprocessing by Sudalai Rajkumar

⚙ Dependancies

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

shinytext-0.0.3.tar.gz (51.0 kB view details)

Uploaded Source

Built Distribution

shinytext-0.0.3-py3-none-any.whl (51.2 kB view details)

Uploaded Python 3

File details

Details for the file shinytext-0.0.3.tar.gz.

File metadata

  • Download URL: shinytext-0.0.3.tar.gz
  • Upload date:
  • Size: 51.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.0

File hashes

Hashes for shinytext-0.0.3.tar.gz
Algorithm Hash digest
SHA256 f97821e69fe0312188c8aa788fe492b02008faa812dabe21aad9e519f77ae9bf
MD5 eca3c5230dbcffe7043887980f356606
BLAKE2b-256 dcc3a1c4eed6795a16ede96a0ca875d8102ffaa49d0e03e3ca8f411d016517f0

See more details on using hashes here.

File details

Details for the file shinytext-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: shinytext-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 51.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.0

File hashes

Hashes for shinytext-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 11954d1533484af6db54a493ba88d56105e303f4151ae95758ced2cfb9eb32aa
MD5 25a316b9499b8cb5f10609ca42e9657d
BLAKE2b-256 7f04390421cb59455da64fce4ba8537bf6ba05375af4a14c874dbc585dffb573

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page