Project Gutenberg corpus interface
This package contains a variety of scripts to make working with the tremendous NLP resource Project Gutenberg easier.
The functionality provided by this package includes: * Downloading etexts from Project Gutenberg * Removing headers and footers from etexts * Organizing meta-data about the etexts in a database
Release history Release notifications | RSS feed
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.