Skip to main content

A fast, memory-efficient multilingual profanity filter using Parquet.

Project description

Multilingual Profanity Filter


GitHub Actions Workflow Status License Code style: black

Introduction

This project provides a profanity filter that supports multiple languages. It is designed to be easy to use and integrate into various applications. The filter uses a comprehensive list of profane words for each supported language, allowing it to effectively identify and filter out inappropriate content.

It uses the List of Dirty, Naughty, Obscene, and Otherwise Bad Words as its primary source for profane words, which is a widely used repository of such words in multiple languages.

Table of Contents 🗂

Supported Languages 🌐

The profanity filter currently supports the same languages as the LDNOOBWV2 repository, you can find it here.

Installation ⚙️

This package is available on PyPI, so you can install it using pip:

pip install multilingual-profanity

Usage ⌨️

Here's a simple example of how to use the ProfanityFilter class:

from multilingual_profanity import ProfanityFilter

pf = ProfanityFilter("nl")
print(pf.is_clean("fiets")) # Output: True
print(pf.is_clean("kanker")) # Output: False

Citation ✍️

If you use this project in your research, please cite as follows:

@misc{profanity_filter,
  author  = {Stephan Akkerman},
  title   = {Profanity Filter},
  year    = {2026},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/StephanAkkerman/profanity-filter}}
}

Contributing 🛠

Contributions are welcome! If you have a feature request, bug report, or proposal for code refactoring, please feel free to open an issue on GitHub. We appreciate your help in improving this project.
https://github.com/StephanAkkerman/profanity-filter/graphs/contributors

License 📜

This project is licensed under the MIT License. See the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

multilingual_profanity-0.1.1.tar.gz (474.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

multilingual_profanity-0.1.1-py3-none-any.whl (470.7 kB view details)

Uploaded Python 3

File details

Details for the file multilingual_profanity-0.1.1.tar.gz.

File metadata

  • Download URL: multilingual_profanity-0.1.1.tar.gz
  • Upload date:
  • Size: 474.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for multilingual_profanity-0.1.1.tar.gz
Algorithm Hash digest
SHA256 2ee857427f8568b26d9312995745cf63bb08cb49010859a7f207c1f2fa3a78e7
MD5 61d8ac575302b9687c69ceb1d5d5b492
BLAKE2b-256 435c5eb2ac4b550cd876c7e2bda1c9400db6f5356626a999e66995de28cc5b75

See more details on using hashes here.

File details

Details for the file multilingual_profanity-0.1.1-py3-none-any.whl.

File metadata

File hashes

Hashes for multilingual_profanity-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 1ba4453898155997c2ca8e4143111cedffa5a40d31d25995be5c055eaac3ee55
MD5 7320201e30e90f022d0ddf9d30a7114c
BLAKE2b-256 ba114bd64c42b22fd756d0f36f85657371de32a9b9cc75c5a2b1360a02ebdd27

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page