Skip to main content

The goal is to provide a unified interface to interact with various social analysis tools

Project description

Social Tools

Overview

The Social Tools library provides a unified interface for interacting with various social analysis tools, including sentiment analysis, toxicity detection, emotion detection, and other natural language processing (NLP) models. With this library, developers can quickly analyze social media text, chat messages, or other forms of unstructured data for toxicity, sentiment, emotions, and more.

Key Features

  • Sentiment Analysis: Determine whether the text expresses positive, negative, or neutral sentiment.
  • Toxicity Detection: Identify toxic language, hate speech, offensive comments, and inappropriate content.
  • Emotion Detection: Recognize emotions such as happiness, sadness, anger, and more.
  • Custom NLP Models: Integrate additional NLP models for detecting bias, misinformation, and other social signals.

Installation

To install Social Tools, run:

pip install social-tools

Usage

Import the Modules

from social_tools import EmotionDetection, SentimentAnalysis, ToxicityDetection

Toxicity Detection

The ToxicityDetection module allows you to analyze text for toxic comments using pre-trained models like HuggingFace's unitary/toxic-bert.

# Using the unitary/toxic-bert transformer model
tox_detector = ToxicityDetection(tool='transformer', model='unitary/toxic-bert')
result = tox_detector.analyze("I hate you.")
print(result)

This returns:

[{'label': 'toxic', 'score': 0.9475088119506836}]

You can also analyze multiple texts at once:

tox_detector.analyze(["I hate you.", "This is harsh"])

Output:

[
    {'label': 'toxic', 'score': 0.9475088119506836},
    {'label': 'toxic', 'score': 0.002488125581294298}
]

Sentiment Analysis

The SentimentAnalysis module offers several options for analyzing the sentiment of text, including NLTK, SpaCy, and HuggingFace models.

# Using NLTK
sa = SentimentAnalysis(tool='nltk')
result = sa.analyze("This is awesome!")
print(result)

Output:

[{'neg': 0.0, 'neu': 0.313, 'pos': 0.687, 'compound': 0.6588}]

Using SpaCy:

sa = SentimentAnalysis(tool='spacy')
result = sa.analyze("This is awesome!")
print(result)

Output:

[{
    'polarity': 1.0, 
    'subjectivity': 1.0, 
    'sentiment_assessments': [(['awesome', '!'], 1.0, 1.0, None)]
}]

Using HuggingFace Transformer:

sa = SentimentAnalysis(tool='huggingface')
result = sa.analyze("This is awesome!")
print(result)

Output:

[{'label': 'POSITIVE', 'score': 0.9998669624328613}]

Custom HuggingFace Models

You can specify a custom HuggingFace transformer model by passing the model name during initialization:

sa = SentimentAnalysis(tool='huggingface', transformer_model="cardiffnlp/twitter-roberta-base-sentiment-latest")
result = sa.analyze("This is awesome!")
print(result)

Output:

[{'label': 'positive', 'score': 0.9813949465751648}]

Emotion Detection

The EmotionDetection module allows you to detect emotions such as happiness, sadness, and anger. For example, using HuggingFace models:

emotion_detector = EmotionDetection(tool='huggingface')
result = emotion_detector.analyze("I am so happy today!")
print(result)

This will return:

[{'label': 'joy', 'score': 0.95}]

Flexible Input Handling

All analyze functions in each module accept both single strings (str) and lists of strings (List[str]) as input:

# Single input
result = sa.analyze("I love this!")

# Multiple inputs
result = sa.analyze(["I love this!", "This is terrible."])

HuggingFace Transformer Parameters

When using a HuggingFace transformer model, you can pass additional parameters during initialization, such as return_all_scores:

sa = SentimentAnalysis(tool='huggingface', transformer_model="bert-base-uncased", return_all_scores=True)
result = sa.analyze("This is fantastic!")
print(result)

Conclusion

The Social Tools library simplifies the process of analyzing social data by providing multiple sentiment, emotion, and toxicity detection tools in a unified interface. You can integrate popular NLP libraries like NLTK, SpaCy, and HuggingFace models into your workflow seamlessly.

For more information about the supported HuggingFace models and additional parameters, refer to the HuggingFace documentation.

Acknowledgements

This project would not have been possible without the contributions of the following open-source projects:

  • Detoxify: For providing pre-trained models to detect toxic content in text.
  • HuggingFace: For providing a wide variety of pre-trained transformer models and their powerful transformers library.
  • NLTK (Natural Language Toolkit): For providing robust tools for text processing and sentiment analysis.
  • SpaCy: For offering fast and efficient NLP capabilities, along with the spacytextblob extension for sentiment analysis.
  • TextBlob: For providing an easy-to-use interface for text processing, sentiment analysis, and other NLP tasks.

A huge thank you to these projects and their respective communities for building the foundational tools that made this library possible.

Citation

@misc{socialtool,
  title={socialtool},
  author={Ridwan Amure},
  howpublished={Github. https://github.com/instabaines/social_tools_lib/},
  year={2024}
}

if you use the detoxify module in this tool, kindly cite

@misc{Detoxify,
  title={Detoxify},
  author={Hanu, Laura and {Unitary team}},
  howpublished={Github. https://github.com/unitaryai/detoxify},
  year={2020}
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

social_tools-1.0.0.tar.gz (10.8 kB view details)

Uploaded Source

File details

Details for the file social_tools-1.0.0.tar.gz.

File metadata

  • Download URL: social_tools-1.0.0.tar.gz
  • Upload date:
  • Size: 10.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.11.0

File hashes

Hashes for social_tools-1.0.0.tar.gz
Algorithm Hash digest
SHA256 67b717b6df6a8501c06a866c741e5c9fc19a14a8ecee22c9d46f1cd9be275e1e
MD5 888ed48d515fd7b26062eecf30f32d25
BLAKE2b-256 adcf32b560de4e976b8eb5266726d54a17a4fe60ae7d71ff0e87031545b2ee0f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page