HTML DOM Tree Leaf Structure Identification Package
Project description
WebLeaf Package
HTML DOM Tree Leaf Structure Identification Package
"You become who you surround yourself with."
src: Someone Important
Description
Websites are generally built as a composition of components. If you understand the structure of a given website then you can better understand the data within it. This package helps you classify elements within the DOM tree by creating a set representation of an element's neighbors. This set can then be used to develop robust data scraping logic.
-> show image of HTML and an image of the result. make it clear how it works
Concepts
Leaf
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
webleaf-0.1.0.tar.gz
(4.3 kB
view hashes)