Skip to main content

Python package for cleaning up xml and tokenizing text

Project description

XML cleaner

Utilities for tokenizing and cleaning up xml text.

Usage

Parse and tokenize sentence and words in sentences:

> [sentence for sentence in xml_cleaner.to_raw_text(“Joey was a great sailor.”)] #=> [[“Joey”, “was”, “a”, “great”, “sailor”, “.”]]

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

xml-cleaner-1.0.16.tar.gz (6.1 kB view details)

Uploaded Source

File details

Details for the file xml-cleaner-1.0.16.tar.gz.

File metadata

File hashes

Hashes for xml-cleaner-1.0.16.tar.gz
Algorithm Hash digest
SHA256 f7ec1a554d117bc4db54845b2fb3bca41b32e6cd79fce0d3906f3c6adc4d10df
MD5 2691f004ec429fbc583c2a0498c12c33
BLAKE2b-256 8936ba1471cb325fa1384cc76b476f2777719e5e1d2755745f105a44fab8be4f

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page