Skip to main content

PEACE method Augmentation package

Project description

peace-augmentation

PEACE stands for Providing Explanation and Analysis for Combating Hate Expressions. This framework is engineered to effectively identify, visualize, and elucidate implicit and subtle forms of hate speech. It aims to address the complex nature of modern hate speech by integrating three pivotal components:

  • Data Analysis: In-depth examination of cutting-edge data on Implicit and Subtle Hate Speech to understand its nuances and variations.
  • Machine Learning Identification: Employment of sophisticated machine learning classifiers to accurately detect instances of hate speech.
  • Explanation Generation: Articulation of clear, understandable explanations concerning why particular messages are classified as hateful.

This repository is only including the augmentation extension. The code of the PEACE augmentation web app can be found here

Installation Instructions

To set up the peace-augmentation app on your local system, you can install it using pip:

pip install peace-augmentation

## Usage

To use it in one of your project, you have to use the Augmentation class:

from peace_augmentation import Augmentation

my_str = "This string contains hate speech that I want to augment usong PEACE"
augmentation_args = ...
augmentation_method = ...

aug = Augmentation(my_str, *augmentation_args)
result = aug.apply(augmentation_method)

print(result)

Args reference

Depending on the method, you can specify a few arguments listed below:

TO DO

Method reference

Here is a list of the different augmentation methods available with their description:

Identifier Name Description
easy_data_augmentation Easy Data Augmentation Given an input sentence: - it randomly replaces a non-stopword expression with a synonym using Word-net; it inserts a synonym of a non-stopword word in a random position; it chooses two words of the sentence and swap their positions; it removes each word in the sentence with a certain probability. Only one of the four operations at a time is applied to a sentence.
replace_adjectives Replace Adjectives It takes all the adjectives or nouns in a sentence and replaces each of them nof_repl_p_cand times for a synonym. In particular, if an adjective/noun is selected, then the synonym/hyponym will be an adjective/noun.
replace_in_domain_expressions Replace In-domain Expressions It replaces a list of manually-crafted expressions often used in HS messages with other semantically similar expressions. That is, it checks all the ocurrences of in-domain expressions in sentence, and generates nof_repl_p_cand sentences per each candidate by changing it for another manually-collected expression.
replace_scalar_adverbs Replace Scalar Adverbs It selects all the adverbs of pos type 'RB' in the sentence. For each candidate, it generates nof_repl_p_cand by replacing the candidate with an scalar adverb.
add_adverbs_to_verbs Add adverbs to verbs It selects all the words in a sentence that are POS labeled with cand_pos_name. We called these words as candidates. Each candidate is selected and new sentences are generated by adding an adverb before the candidate. In this case, speculative adverbs are used like certainly, likely, and clearly from a previously collected list. cand_pos_name can indicate verbs or adjectives. That is, the following values: - VB Verb, base form. - VBD Verb, past tense. - VBG Verb, gerund or present participle. - VBN Verb, past participle. - VBP Verb, non-3rd person singular present. - VBZ Verb, 3rd person singular present. - JJ, Adjective.
replace_named_entities Replace Named Entities It replaces a named entity (PER, LOC, ORG, and MISC) in the input sentence. A candidate NE in a sentence is replaced by another one according to a previously collected list of NEs. Then, the most similar NE is selected by using pre-trained FastText embeddings. Per each candidate it creates nof_repl_p_cand sentences. For example, if ner_name = 'PER', nof_repl_p_cand = 5 and input sentence has 2 named entities PER, thid method generates 2 * 5 sentences.
back-translation Back Translation TO DO

Contact Information

For inquiries or additional details, please feel free to reach out to any of the following individuals:

Please consider reaching out with respect to the specific areas of the project you are interested in, as this will help direct your queries more effectively.

Contributing

Your contributions are welcome! If you have suggestions or want to contribute to the PEACE framework, please feel free to open an issue or create a pull request on our repository.

License

This project is licensed under the Boost Software License 1.0 (BSL-1.0) - see the LICENSE file in the project repository for detailed information and terms.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

peace_augmentation-0.0.28.tar.gz (97.2 kB view details)

Uploaded Source

Built Distribution

peace_augmentation-0.0.28-py3-none-any.whl (98.7 kB view details)

Uploaded Python 3

File details

Details for the file peace_augmentation-0.0.28.tar.gz.

File metadata

  • Download URL: peace_augmentation-0.0.28.tar.gz
  • Upload date:
  • Size: 97.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.12

File hashes

Hashes for peace_augmentation-0.0.28.tar.gz
Algorithm Hash digest
SHA256 92a3ec1659b8ee9c7c4a3cf1b7b191b1cf9abf1b2ff9f98b64304cca0be89338
MD5 04c003c9a8188cc9e2905e413a588958
BLAKE2b-256 6edcb777343a222894b571749e21fc25ed5c8b1201d3d2e210653742cc1a90ed

See more details on using hashes here.

File details

Details for the file peace_augmentation-0.0.28-py3-none-any.whl.

File metadata

File hashes

Hashes for peace_augmentation-0.0.28-py3-none-any.whl
Algorithm Hash digest
SHA256 97aaace25004d1d30312b137a4cedb23af4d11ad06fb933c9105e06846dd626c
MD5 5221026f7f431d2867a2633db8ed16f2
BLAKE2b-256 71f6f9a001b01a1db0c84d22d78a33c15243323aeb72078564fee45d0c239771

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page