Skip to main content

Text cleaner and formatter for handling messy copy-pasted content

Project description

CleanPad - Text Cleaner & Formatter

A versatile Python library for cleaning and formatting messy text, particularly useful for handling copy-pasted content with unwanted formatting.

Features

  • Whitespace normalization
  • Line break cleaning
  • Emoji removal
  • Bullet point cleaning
  • Text to list/dict conversion
  • HTML tag removal
  • Quote normalization
  • URL removal
  • Special character handling
  • Number extraction
  • Punctuation spacing normalization
  • Data structure parsing
  • Line ending standardization
  • Indentation cleaning
  • Unicode character normalization

Installation

pip install cleanpad

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cleanpad-0.1.0.tar.gz (6.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cleanpad-0.1.0-py3-none-any.whl (7.4 kB view details)

Uploaded Python 3

File details

Details for the file cleanpad-0.1.0.tar.gz.

File metadata

  • Download URL: cleanpad-0.1.0.tar.gz
  • Upload date:
  • Size: 6.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.2

File hashes

Hashes for cleanpad-0.1.0.tar.gz
Algorithm Hash digest
SHA256 81a732137c81cd1fbc71d31d91474facb4e53b80f87d616628931b4335c10e1f
MD5 991e88cdaca9aa1a1e67e34fb336681b
BLAKE2b-256 8d98c8fc70cf14b7845cc1c4b7d3ec640be2473ba2d2b99336c511799cefa88d

See more details on using hashes here.

File details

Details for the file cleanpad-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: cleanpad-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 7.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.2

File hashes

Hashes for cleanpad-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 5780fe7278d60e77508b9a832aef2249927141d8b010a1fdb3fb0f44a2045cd7
MD5 57e074b104fee839f49b0567f2c7fb09
BLAKE2b-256 bc03dfb88e1c95e2794490f1abdb5340f7154a465dc2f1bc5b1cd485cddfe566

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page