Skip to main content

Text cleaner and formatter for handling messy copy-pasted content

Project description

CleanPad - Text Cleaner & Formatter

A versatile Python library for cleaning and formatting messy text, particularly useful for handling copy-pasted content with unwanted formatting.

Features

  • Whitespace normalization
  • Line break cleaning
  • Emoji removal
  • Bullet point cleaning
  • Text to list/dict conversion
  • HTML tag removal
  • Quote normalization
  • URL removal
  • Special character handling
  • Number extraction
  • Punctuation spacing normalization
  • Data structure parsing
  • Line ending standardization
  • Indentation cleaning
  • Unicode character normalization

Installation

pip install cleanpad

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cleanpad-0.1.1.tar.gz (7.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cleanpad-0.1.1-py3-none-any.whl (8.1 kB view details)

Uploaded Python 3

File details

Details for the file cleanpad-0.1.1.tar.gz.

File metadata

  • Download URL: cleanpad-0.1.1.tar.gz
  • Upload date:
  • Size: 7.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.2

File hashes

Hashes for cleanpad-0.1.1.tar.gz
Algorithm Hash digest
SHA256 d4e32df27575ab096590682b18644de1f11814c28ac57fdffff98173763276df
MD5 c26c4e32558884a3052ec300adf51965
BLAKE2b-256 d0d127ca82389f75ad4fe92bcf67914467b9e8a00a45a6c98249dc333fa041c0

See more details on using hashes here.

File details

Details for the file cleanpad-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: cleanpad-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 8.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.2

File hashes

Hashes for cleanpad-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 1a0e391f94ecf0f96320825b26701e21c21c97f64f6843c60fa8d9dce823b983
MD5 1a97fe0cea7b20df324b0cd6be0894b0
BLAKE2b-256 fa740b863debcf36ca6743f83cad22cea73cb4627bdfcd460aa1543ebeef665f

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page