Skip to main content

Play with text data

Project description

textslack

A text cleaning pipeline to perform text cleaning, along with additional functionalities for sentiment, pos extraction, and word count.

After pip install, please follow the below step to access the functionalities:

  • from textslack.textslack import TextSlack
  • slack = TextSlack()

Below are the key functionalities currently available in the all the versions:

  1. transform(text): normalises and cleans unstructured text
  2. extract_nouns(text): cleans text to keep only nouns
  3. extract_verbs(text): cleans text to keep only verbs
  4. extract_adjectives(text): cleans text to keep only adjectives
  5. extract_adverbs(text): cleans text to keep only adverbs
  6. sentiment(text): returns a string as sentiment
  7. word_occurances(word, text): returns frequency of a word mentioned in the text

v.0.1.5 updates:

  1. transform method can now be used on a list and an entire pandas dataframe column.

  2. textslack can perform basic text cleaning for some non-english languages as well, just pass the language while creating the object as shown below.

    slack = TextSlack(lang='spanish')

Please refer the below medium article for a detailed explanation of textslack functionalities. https://medium.com/analytics-vidhya/text-processing-made-easy-with-textslack-4214ae6bc67a

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

textslack-0.1.5.tar.gz (3.4 kB view hashes)

Uploaded Source

Built Distribution

textslack-0.1.5-py3-none-any.whl (4.4 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page