Project description

Unwanted Content Detector

A library to detect undesired, unbranded, or harmful content

Usage

In python:

pip install unwanted-content-detector

With Pandas

import pandas as pd
from unwanted_content_detector import Detector

detector = Detector()
df = pd.DataFrame({"text": [
    "this is hate speech",
    "We should all do our part to protect the environment.",
    'Everyone has the right to love who they want.'
]})

df['is_unwanted'] = df['text'].apply(lambda x: detector.is_unwanted(x))

In the terminal:

unwanted_detector

To get the manual

Models

Model name	size (mb)
distilbert-finetuned	3 gb

Training

unwanted_detector train

Target Architecture / Features

multiple Swappable models
multiple evaluation datasets
possibility of configuring a custom personal dataset to fine tune
Single performance evaluation criteria

Use cases it could be applied to

detecting the generation of harmful content from LLMs
preventing harmful prompts to be injected into LLMs
using it as a validator of content being generated according to the brand guidelines

Liability

This tool aims to help you to detect harmful content but it is not meant to be used as the final decision maker alone.

Project details

These details have not been verified by PyPI

Release history Release notifications | RSS feed

0.1.8

Apr 24, 2023

0.1.7

Apr 22, 2023

0.1.6

Apr 22, 2023

0.1.3

Apr 22, 2023

0.1.2

Apr 22, 2023

This version

0.1.1

Apr 22, 2023

0.1.0

Apr 13, 2023

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unwanted_content_detector-0.1.1.tar.gz (4.0 kB view hashes)

Uploaded Apr 22, 2023 Source

Built Distribution

unwanted_content_detector-0.1.1-py3-none-any.whl (6.4 kB view hashes)

Uploaded Apr 22, 2023 Python 3

Hashes for unwanted_content_detector-0.1.1.tar.gz

Hashes for unwanted_content_detector-0.1.1.tar.gz
Algorithm	Hash digest
SHA256	`223c57d93df671d35c2753d54a100a199a856acf4e68d202713b38a37ccd845c`
MD5	`a4a55ac7acb3df62c3a3d44d88567ea6`
BLAKE2b-256	`d06e3042759b5ef7365dae47d4b8c4f0e6760f3aa9dad23f3f11951cbdea1c2c`

Hashes for unwanted_content_detector-0.1.1-py3-none-any.whl

Hashes for unwanted_content_detector-0.1.1-py3-none-any.whl
Algorithm	Hash digest
SHA256	`c8cff5b51aaf9ecfc53c6487867ee8307631792cc8d29a4f5d033aa301b5b9ca`
MD5	`4d8693e13519a93d131d29ff2ef2f70e`
BLAKE2b-256	`300874680c7d81bcd29385b16f87095377dcc200cdbdd35a24356fdb158b8f7c`