Extract data from HTML pages that have some kind of a repetitive pattern
Project description
This package tries to find a repetitive pattern in an HTML page that contains some kind of a list (like digest pages). It extracts the sub-html text that creates the pattern, and try to extract useful information from it.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
HtmlList-1.3.0-py2.5.egg
(186.9 kB
view hashes)