Project Gutenberg corpus interface
Project description
This package contains a variety of scripts to make working with the tremendous NLP resource Project Gutenberg easier.
The functionality provided by this package includes: * Downloading etexts from Project Gutenberg * Removing headers and footers from etexts * Organizing meta-data about the etexts in a database
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Gutenberg-0.1.0.tar.gz
(20.8 kB
view hashes)