Skip to main content

Text cleaner and formatter for handling messy copy-pasted content

Project description

CleanPad - Text Cleaner & Formatter

A versatile Python library for cleaning and formatting messy text, particularly useful for handling copy-pasted content with unwanted formatting.

Features

  • Whitespace normalization
  • Line break cleaning
  • Emoji removal
  • Bullet point cleaning
  • Text to list/dict conversion
  • HTML tag removal
  • Quote normalization
  • URL removal
  • Special character handling
  • Number extraction
  • Punctuation spacing normalization
  • Data structure parsing
  • Line ending standardization
  • Indentation cleaning
  • Unicode character normalization

Installation

pip install cleanpad

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cleanpad-0.1.2.tar.gz (7.4 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

cleanpad-0.1.2-py3-none-any.whl (8.1 kB view details)

Uploaded Python 3

File details

Details for the file cleanpad-0.1.2.tar.gz.

File metadata

  • Download URL: cleanpad-0.1.2.tar.gz
  • Upload date:
  • Size: 7.4 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.2

File hashes

Hashes for cleanpad-0.1.2.tar.gz
Algorithm Hash digest
SHA256 c6099b281a7dd9637a6f7f4af578655afa72139f633e2f6bbbf57cbcd70fdb26
MD5 94f6a9ba7543c3fa60010619940ac5d4
BLAKE2b-256 d0b3b9902099cc1445358fe6c0b65e641806a68e027897d8db5415a34b27cd64

See more details on using hashes here.

File details

Details for the file cleanpad-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: cleanpad-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 8.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.12.2

File hashes

Hashes for cleanpad-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 cfbe32cc58350b371c9504efcd39c7153b8b802dc9012f75692109f5ddb2ad60
MD5 865861ed1261a410c00702f2b41c6ba3
BLAKE2b-256 7101278aba9cfabdea79cadc8cfaa71b5fc4905a6a977a9289ee2997f502c389

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page