Python package for cleaning up xml and tokenizing text
Project description
XML cleaner
Utilities for tokenizing and cleaning up xml text.
Usage
Parse and tokenize sentence and words in sentences:
> [sentence for sentence in xml_cleaner.to_raw_text(“Joey was a great sailor.”)] #=> [[“Joey”, “was”, “a”, “great”, “sailor”, “.”]]
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
xml-cleaner-1.0.9.tar.gz
(5.9 kB
view hashes)