Play with text data
Project description
textslack
A text cleaning pipeline to perform text cleaning, along with additional functionalities for sentiment, pos extraction, and word count.
After pip install, please follow the below step to access the functionalities:
- from textslack.textslack import TextSlack
- slack = TextSlack()
Below are the key functionalities currently available in the all the versions:
- transform(text): normalises and cleans unstructured text
- extract_nouns(text): cleans text to keep only nouns
- extract_verbs(text): cleans text to keep only verbs
- extract_adjectives(text): cleans text to keep only adjectives
- extract_adverbs(text): cleans text to keep only adverbs
- sentiment(text): returns a string as sentiment
- word_occurances(word, text): returns frequency of a word mentioned in the text
v.0.1.3 update: transform method can now be used on an entire pandas dataframe column.
Please refer the below medium article for a detailed explanation of textslack functionalities. https://medium.com/analytics-vidhya/text-processing-made-easy-with-textslack-4214ae6bc67a
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
textslack-0.1.3.tar.gz
(2.9 kB
view hashes)
Built Distribution
Close
Hashes for textslack-0.1.3-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | ad65dbd7f4c60da68fb811e93e0a5b9be2d18135318bbe67dc9219a26a5af88e |
|
MD5 | 5a13e419d76da61ebd7ef91fb8e8f9ab |
|
BLAKE2b-256 | 30346e9d46b379bc3aa7e043901c34cbc7cee354555bd5558dec392975bfdd62 |