Skip to main content

A Python module for detecting and filtering profanity using Google's Gemini API

Project description

Gemini Profanity Filter

A Python module for detecting and filtering profanity in text using Google's Gemini API. The module can identify standard and obfuscated profanity across multiple languages.

Features

  • Detects standard profanity in multiple languages
  • Identifies obfuscated profanity (character substitutions, leetspeak, etc.)
  • Handles stretched or repeated characters
  • Detects deliberately fragmented words
  • Works with mixed case patterns
  • Processes context-sensitive profanity
  • Language-specific profanity patterns including Cyrillic, Asian scripts, etc.

Installation

pip install gemini-profanity-filter

Usage

from profanity_filter import ProfanityFilter
import pandas as pd
import traceback

def run_batch_tests(pf):
    test_inputs = [
        "Ebl@NiShe nahui ti che delaesh?",
        "kon4ennyj ublYd0k idi nax",
        "ty dur@k ili 4to?",
        "suka bly@t idi v sr@ku",
        "govn0ed pozornyj",
        "pid0rasina ti proklyataya",
        "naxui idi s etim",
        "zh0p0lik bl9t",
        "mraz0ta gnid@",
        "hohol eb@n@y",
        "4mo bly@dsk@ya",
        "d@un ubly@dnij",
        "pizd@nutij shelobok",
        "ebanut@ya ovca",
        "k@kazec ebanij",
        "g0vni4e ty prost@k",
        "nax d@unish4e",
        "ni4ego ti ne ponim@esh",
        "s0sni d@",
        "nahu9 provalis"
    ]

    rows = []
    for i, text in enumerate(test_inputs, 1):
        try:
            result = pf.filter_text(text)
            if result and result.detected_profanity:
                for entry in result.detected_profanity:
                    rows.append({
                        "Тест #": i,
                        "Текст": text,
                        "Оригинал": entry.original_form,
                        "Нормализовано": entry.normalized_form,
                        "Метод": entry.detection_method,
                        "Уверенность": round(entry.confidence_score, 2),
                        "Объяснение": entry.reasoning[:100] + "..." if entry.reasoning else ""
                    })
            else:
                rows.append({
                    "Тест #": i,
                    "Текст": text,
                    "Оригинал": "-",
                    "Нормализовано": "-",
                    "Метод": "-",
                    "Уверенность": 0,
                    "Объяснение": "Нецензурная лексика не найдена"
                })
        except Exception as e:
            rows.append({
                "Тест #": i,
                "Текст": text,
                "Оригинал": "❌ Ошибка",
                "Нормализовано": "-",
                "Метод": "-",
                "Уверенность": 0,
                "Объяснение": str(e)
            })
    
    df = pd.DataFrame(rows)
    print("\n=== Сводка по результатам ===\n")
    print(df.to_string(index=False))

def main():
    try:
        print("=== Запуск авто-тестов ===")
        api_key = "AIzaSyCs3Bbjq_7PeGTAEHJkkSsUtYZvj4Fbbwo"
        pf = ProfanityFilter(api_key)
        run_batch_tests(pf)
    except Exception as e:
        print(f"❌ Ошибка: {e}")
        traceback.print_exc()

if __name__ == "__main__":
    main()

Requirements

  • Python 3.7 or higher
  • Google GenerativeAI Python library
  • A valid Google Gemini API key

License

MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gemini_profanity_filter-0.5.0.tar.gz (13.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

gemini_profanity_filter-0.5.0-py3-none-any.whl (12.8 kB view details)

Uploaded Python 3

File details

Details for the file gemini_profanity_filter-0.5.0.tar.gz.

File metadata

  • Download URL: gemini_profanity_filter-0.5.0.tar.gz
  • Upload date:
  • Size: 13.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.4

File hashes

Hashes for gemini_profanity_filter-0.5.0.tar.gz
Algorithm Hash digest
SHA256 7db68d9bb9e69a8245352c8aa510d6d67bf29bedded51a477ca20e2511ec97e2
MD5 de86478eff720673fe61b139a922664b
BLAKE2b-256 59c2284cef3a96261520ba89a55dae34f1dc6aaac49456c6881cc7e61ffb7d76

See more details on using hashes here.

File details

Details for the file gemini_profanity_filter-0.5.0-py3-none-any.whl.

File metadata

File hashes

Hashes for gemini_profanity_filter-0.5.0-py3-none-any.whl
Algorithm Hash digest
SHA256 e47cac31774d44583686f61c3238eda180d32bd8ed28fa5fdba56fe67a823f4a
MD5 95efeb7b421896d80d6011295724c3da
BLAKE2b-256 95647ef78406e470a57dca3982b21d3b3587a290b66e2ae7fda4ad173f0c67b5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page