Play with text data
Project description
textslack
A text cleaning pipeline to perform text cleaning, along with additional functionalities for sentiment, pos extraction, and word count.
After pip install, please follow the below step to access the functionalities:
- from textslack.textslack import TextSlack
- slack = TextSlack()
Below are the key functionalities currently available in the all the versions:
- transform(text): normalises and cleans unstructured text
- extract_nouns(text): cleans text to keep only nouns
- extract_verbs(text): cleans text to keep only verbs
- extract_adjectives(text): cleans text to keep only adjectives
- extract_adverbs(text): cleans text to keep only adverbs
- sentiment(text): returns a string as sentiment
- word_occurances(word, text): returns frequency of a word mentioned in the text
v.0.1.4 updates:
- transform method can now be used on a list and an entire pandas dataframe column.
- textslack can perform basic text cleaning for some non-english languages as well, just pass the language while creating the object as shown below. slack = TextSlack(lang='spanish')
Please refer the below medium article for a detailed explanation of textslack functionalities. https://medium.com/analytics-vidhya/text-processing-made-easy-with-textslack-4214ae6bc67a
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
textslack-0.1.4.tar.gz
(3.2 kB
view details)
Built Distribution
File details
Details for the file textslack-0.1.4.tar.gz
.
File metadata
- Download URL: textslack-0.1.4.tar.gz
- Upload date:
- Size: 3.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4f380bf45aeba565198651e9041ed6d9cae5b2365b139865cebba4e8d897857f |
|
MD5 | 0594791c57d8c0f185134809c1856ee6 |
|
BLAKE2b-256 | ed1a90a466488ce617970fb15bba1bce4836c1a0ae543464b37bdc7a4da27eab |
File details
Details for the file textslack-0.1.4-py3-none-any.whl
.
File metadata
- Download URL: textslack-0.1.4-py3-none-any.whl
- Upload date:
- Size: 4.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/50.3.0 requests-toolbelt/0.9.1 tqdm/4.50.0 CPython/3.7.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4e724e1dd9a275b4bc1fa2ad5714090acb7ceceb1b137bcc9e40671d17790d41 |
|
MD5 | 43cce75af515a3920fbf1fe1e63c87af |
|
BLAKE2b-256 | 049865acb4a427e465a4c2fc606b5ac119cd03abb09ebe82807baaa4cd6814e1 |