Library for advanced search
Sitemap.xml generator for a local copy of a website
Enables Beetle to generate a sitemap.xml file
BIBCAT RDF Framework Application
Text retrieval and analytics engine.
Cheshire3 Search and Retrieval Engine and Information Framework
Chinese Province, City and Area Recognition Utilities
Chinese Words Segementation Utilities
A distributed network crawler framework
Content classification/clustering through language processing
convert html,PDF,DOC file to txt
Lightweight, fast and scalable text corpus library.
Tools for loading and analyzing large text corpora.
Browser-based search tool for quickly `grep`ing source code.