5 projects
date-guesser
Extract publication dates from web pages
ultimate-sitemap-parser
Ultimate Sitemap Parser
sentence-splitter
Text to sentence splitter using heuristic algorithm by Philipp Koehn and Josh Schroeder
feed-seeker
Extract rss, atom, and other feeds from webpages
hausastemmer
Hausa language stemmer (Bimba et al., 2015)