Basic cleaning of text
Project description
# 🛀 Cleanliness
Cleanliness is a package for doing (very) basic cleaning of text. It can be used for preprocessing input to various natural language processing tasks.
## Install
Install using from PyPI as follows:
`python pip install cleanliness `
Alternatively, you can download the source from GitHub, then install using pip (from the repo directory):
`python pip install -e . `
## Usage
`python from cleanliness import normalize_whitespace clean_text = normalize_whitespace("This\ntext\t\contains odd\n\t whitespace.") `
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file cleanliness-0.1.1.tar.gz
.
File metadata
- Download URL: cleanliness-0.1.1.tar.gz
- Upload date:
- Size: 3.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.27.0 CPython/3.6.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 658b73cb16677d48fa0f8f7e206598d8391981020c3becef720b717e220677de |
|
MD5 | c975a573bd6b2ff8f9d7891410e852f2 |
|
BLAKE2b-256 | ad0e02082ba1c6a1c56769dbf1a5de706ad4425a9252115690ca5beb24f73914 |
File details
Details for the file cleanliness-0.1.1-py2.py3-none-any.whl
.
File metadata
- Download URL: cleanliness-0.1.1-py2.py3-none-any.whl
- Upload date:
- Size: 3.5 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.4.3 requests-toolbelt/0.8.0 tqdm/4.27.0 CPython/3.6.4
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 7ba238c78b5a1491955e48d3466343a0b876cae1358c57f1ce55fb4dc37b7316 |
|
MD5 | 0eb6d6c0f77f37a654bcfd3d62939bf4 |
|
BLAKE2b-256 | db0ef1f3969afdeff10c504340eaec483348c6607b0e21880554c02f6bc31a78 |