Skip to main content

A fast, memory-efficient multilingual profanity filter using Parquet.

Project description

Multilingual Profanity Filter


GitHub Actions Workflow Status License Code style: black

Introduction

This project provides a profanity filter that supports multiple languages. It is designed to be easy to use and integrate into various applications. The filter uses a comprehensive list of profane words for each supported language, allowing it to effectively identify and filter out inappropriate content.

It uses the List of Dirty, Naughty, Obscene, and Otherwise Bad Words as its primary source for profane words, which is a widely used repository of such words in multiple languages.

Table of Contents 🗂

Supported Languages 🌐

The profanity filter currently supports the same languages as the LDNOOBWV2 repository, you can find it here.

Installation ⚙️

This package is available on PyPI, so you can install it using pip:

pip install multilingual-profanity

Usage ⌨️

Here's a simple example of how to use the ProfanityFilter class:

from multilingual_profanity import ProfanityFilter

pf = ProfanityFilter("nl")
print(pf.is_clean("fiets")) # Output: True
print(pf.is_clean("kanker")) # Output: False

Citation ✍️

If you use this project in your research, please cite as follows:

@misc{profanity_filter,
  author  = {Stephan Akkerman},
  title   = {Profanity Filter},
  year    = {2026},
  publisher = {GitHub},
  journal = {GitHub repository},
  howpublished = {\url{https://github.com/StephanAkkerman/profanity-filter}}
}

Contributing 🛠

Contributions are welcome! If you have a feature request, bug report, or proposal for code refactoring, please feel free to open an issue on GitHub. We appreciate your help in improving this project.
https://github.com/StephanAkkerman/profanity-filter/graphs/contributors

License 📜

This project is licensed under the MIT License. See the LICENSE file for details.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

multilingual_profanity-0.1.2.tar.gz (45.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

multilingual_profanity-0.1.2-py3-none-any.whl (42.3 kB view details)

Uploaded Python 3

File details

Details for the file multilingual_profanity-0.1.2.tar.gz.

File metadata

  • Download URL: multilingual_profanity-0.1.2.tar.gz
  • Upload date:
  • Size: 45.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.13.12

File hashes

Hashes for multilingual_profanity-0.1.2.tar.gz
Algorithm Hash digest
SHA256 127d0f7109890faa26e5bc40450be7282ec4436f63bea9094d578c63c49cb722
MD5 d6df16941b14d21aae2a170bbb8d66b7
BLAKE2b-256 8e5b74e877cec9236a77a4d767c99ba6d03420c1692a60f944aef318f4692098

See more details on using hashes here.

File details

Details for the file multilingual_profanity-0.1.2-py3-none-any.whl.

File metadata

File hashes

Hashes for multilingual_profanity-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 cb14cdb3ae3efec0e5034f076f6c934b31ee88914e39ddad084ebb1ea57591ee
MD5 a33459f50dd5747c5783554b36320866
BLAKE2b-256 76f7306eebf885ead35484b6c377ada12ab8abecbbaabb3360334e74405397c7

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page