Sanitise text while keeping your sanity
Project description
Saniti
Sanitise lists of text documents quickly, easily and whilst maintaining your sanity
###Usage:
As a function-ish
''' original_text = ["I like to moves it, move its", "I likeing to move it!", "the of"]
text = saniti(original_text, ["token", "destop", "depunct", "unempty", "stem", "out_corp_dict"])
print(text.text)
'''
As a class
''' sani1 = saniti() text = sani1.process(original_text, ["token", "destop", "depunct", "unempty", "lemma", "out_tag_doc"]) print(text) '''
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
saniti-0.0.11.tar.gz
(2.3 kB
view details)
File details
Details for the file saniti-0.0.11.tar.gz
.
File metadata
- Download URL: saniti-0.0.11.tar.gz
- Upload date:
- Size: 2.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
1ac577e3478f5a29eb61d3b6d43cdaf4816a5f436c69c667f64815e4082a0bba
|
|
MD5 |
bca25c759fafabf8214ddc97046c10ee
|
|
BLAKE2b-256 |
3deb99207d7e7c98c49501ab9d0a6c3d42ca2bd0fcfce0fe948c325ec7cf0833
|