Skip to main content

The goal is to provide a unified interface to interact with various social analysis tools

Project description

Social Tools

Overview

The Social Tools library provides a unified interface for interacting with various social analysis tools, including sentiment analysis, toxicity detection, emotion detection, and other natural language processing (NLP) models. With this library, developers can quickly analyze social media text, chat messages, or other forms of unstructured data for toxicity, sentiment, emotions, and more.

Key Features

  • Sentiment Analysis: Determine whether the text expresses positive, negative, or neutral sentiment.
  • Toxicity Detection: Identify toxic language, hate speech, offensive comments, and inappropriate content.
  • Emotion Detection: Recognize emotions such as happiness, sadness, anger, and more.
  • Custom NLP Models: Integrate additional NLP models for detecting bias, misinformation, and other social signals.

Installation

To install Social Tools, run:

pip install social-tools

Usage

Import the Modules

from social_tools import EmotionDetection, SentimentAnalysis, ToxicityDetection

Toxicity Detection

The ToxicityDetection module allows you to analyze text for toxic comments using pre-trained models like HuggingFace's unitary/toxic-bert.

# Using the unitary/toxic-bert transformer model
tox_detector = ToxicityDetection(tool='transformer', model='unitary/toxic-bert')
result = tox_detector.analyze("I hate you.")
print(result)

This returns:

[{'label': 'toxic', 'score': 0.9475088119506836}]

You can also analyze multiple texts at once:

tox_detector.analyze(["I hate you.", "This is harsh"])

Output:

[
    {'label': 'toxic', 'score': 0.9475088119506836},
    {'label': 'toxic', 'score': 0.002488125581294298}
]

Sentiment Analysis

The SentimentAnalysis module offers several options for analyzing the sentiment of text, including NLTK, SpaCy, and HuggingFace models.

# Using NLTK
sa = SentimentAnalysis(tool='nltk')
result = sa.analyze("This is awesome!")
print(result)

Output:

[{'neg': 0.0, 'neu': 0.313, 'pos': 0.687, 'compound': 0.6588}]

Using SpaCy:

sa = SentimentAnalysis(tool='spacy')
result = sa.analyze("This is awesome!")
print(result)

Output:

[{
    'polarity': 1.0, 
    'subjectivity': 1.0, 
    'sentiment_assessments': [(['awesome', '!'], 1.0, 1.0, None)]
}]

Using HuggingFace Transformer:

sa = SentimentAnalysis(tool='huggingface')
result = sa.analyze("This is awesome!")
print(result)

Output:

[{'label': 'POSITIVE', 'score': 0.9998669624328613}]

Custom HuggingFace Models

You can specify a custom HuggingFace transformer model by passing the model name during initialization:

sa = SentimentAnalysis(tool='huggingface', transformer_model="cardiffnlp/twitter-roberta-base-sentiment-latest")
result = sa.analyze("This is awesome!")
print(result)

Output:

[{'label': 'positive', 'score': 0.9813949465751648}]

Emotion Detection

The EmotionDetection module allows you to detect emotions such as happiness, sadness, and anger. For example, using HuggingFace models:

emotion_detector = EmotionDetection(tool='huggingface')
result = emotion_detector.analyze("I am so happy today!")
print(result)

This will return:

[{'label': 'joy', 'score': 0.95}]

Flexible Input Handling

All analyze functions in each module accept both single strings (str) and lists of strings (List[str]) as input:

# Single input
result = sa.analyze("I love this!")

# Multiple inputs
result = sa.analyze(["I love this!", "This is terrible."])

HuggingFace Transformer Parameters

When using a HuggingFace transformer model, you can pass additional parameters during initialization, such as return_all_scores:

sa = SentimentAnalysis(tool='huggingface', transformer_model="bert-base-uncased", return_all_scores=True)
result = sa.analyze("This is fantastic!")
print(result)

Conclusion

The Social Tools library simplifies the process of analyzing social data by providing multiple sentiment, emotion, and toxicity detection tools in a unified interface. You can integrate popular NLP libraries like NLTK, SpaCy, and HuggingFace models into your workflow seamlessly.

For more information about the supported HuggingFace models and additional parameters, refer to the HuggingFace documentation.

Acknowledgements

This project would not have been possible without the contributions of the following open-source projects:

  • Detoxify: For providing pre-trained models to detect toxic content in text.
  • HuggingFace: For providing a wide variety of pre-trained transformer models and their powerful transformers library.
  • NLTK (Natural Language Toolkit): For providing robust tools for text processing and sentiment analysis.
  • SpaCy: For offering fast and efficient NLP capabilities, along with the spacytextblob extension for sentiment analysis.
  • TextBlob: For providing an easy-to-use interface for text processing, sentiment analysis, and other NLP tasks.

A huge thank you to these projects and their respective communities for building the foundational tools that made this library possible.

Citation

@misc{socialtool,
  title={socialtool},
  author={Ridwan Amure},
  howpublished={Github. https://github.com/instabaines/social_tools_lib/},
  year={2024}
}

if you use the detoxify module in this tool, kindly cite

@misc{Detoxify,
  title={Detoxify},
  author={Hanu, Laura and {Unitary team}},
  howpublished={Github. https://github.com/unitaryai/detoxify},
  year={2020}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

social_tools-0.13.dev0.tar.gz (10.8 kB view details)

Uploaded Source

File details

Details for the file social_tools-0.13.dev0.tar.gz.

File metadata

  • Download URL: social_tools-0.13.dev0.tar.gz
  • Upload date:
  • Size: 10.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.0

File hashes

Hashes for social_tools-0.13.dev0.tar.gz
Algorithm Hash digest
SHA256 a645967d086d46ded91c890eb6ac0045e8239c96322c6fbcb0c1e7fca810c168
MD5 9864ed17be50e1f7d0bb30b19b63eb27
BLAKE2b-256 d4842a7aef3f833e52f48cf4dbe16d894789395341604a721990643f6fa92e12

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page