Skip to main content

The Python package docfilter is used to detect and remove inappropriate information from text.

Project description

Python Package: Filter

The Python package filter is used to detect and remove inappropriate information from text.

Installation

pip install filter

Features

  1. Regular expression filtering This method is used to find and filter out specific string patterns, such as personal information like phone numbers or email addresses, or words and phrases related to profanity.
  2. Dictionary-based filtering This method is used to filter out words that match with profanity or personal information already listed in a pre-built dictionary. While this method is quick to apply, it has the disadvantage of needing to be updated with new profanity or personal information.
  3. Machine learning-based filtering This method uses a machine learning model to identify sentences related to profanity or personal information. While this method is more accurate than others, it requires a lot of time and resources to train the model.
  4. Special character removal This method is used to remove special characters from strings that may contain personal information, such as phone numbers or email addresses. While this method is quick to apply, it may be difficult to handle exceptional cases.

One or a combination of the above methods can be used to implement an appropriate filtering method depending on the requirements of the application, and it should be validated in terms of performance and security.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

docfilter-0.1.0.tar.gz (2.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

docfilter-0.1.0-py3-none-any.whl (3.2 kB view details)

Uploaded Python 3

File details

Details for the file docfilter-0.1.0.tar.gz.

File metadata

  • Download URL: docfilter-0.1.0.tar.gz
  • Upload date:
  • Size: 2.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.12

File hashes

Hashes for docfilter-0.1.0.tar.gz
Algorithm Hash digest
SHA256 543394a2180a1f5154bb4d2a2774141ee79b014211b58b90b6c4ec361a7f3b8e
MD5 db58eb887cd664eba226da69a146f5ed
BLAKE2b-256 ac7a8a67cc0852028fecfcc65f503c07aaf5b5882d3bcee3b9c453f61df499e9

See more details on using hashes here.

File details

Details for the file docfilter-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: docfilter-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 3.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.12

File hashes

Hashes for docfilter-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 2d8062e832233ace6c69c66f2e91e3b2c610bc69531327986187f22d9d384d18
MD5 de702cc58d46b0e469e7f8b02f70fce8
BLAKE2b-256 d0170c3a307f9be21e4069cd8f361cb441ab5614ccb86862d6fc5195084a9472

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page