Skip to main content

Sanitise text while keeping your sanity

Project description

Saniti

Sanitise lists of text documents quickly, easily and whilst maintaining your sanity

###Usage:

As a function-ish

''' original_text = ["I like to moves it, move its", "I likeing to move it!", "the of"]

text = saniti(original_text, ["token", "destop", "depunct", "unempty", "stem", "out_corp_dict"])

print(text.text)

'''

As a class

''' sani1 = saniti() text = sani1.process(original_text, ["token", "destop", "depunct", "unempty", "lemma", "out_tag_doc"]) print(text) '''

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

saniti-0.0.11.tar.gz (2.3 kB view details)

Uploaded Source

File details

Details for the file saniti-0.0.11.tar.gz.

File metadata

  • Download URL: saniti-0.0.11.tar.gz
  • Upload date:
  • Size: 2.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for saniti-0.0.11.tar.gz
Algorithm Hash digest
SHA256 1ac577e3478f5a29eb61d3b6d43cdaf4816a5f436c69c667f64815e4082a0bba
MD5 bca25c759fafabf8214ddc97046c10ee
BLAKE2b-256 3deb99207d7e7c98c49501ab9d0a6c3d42ca2bd0fcfce0fe948c325ec7cf0833

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page