Displays HTML page as a Graph
Project description
WebTree
Project Description
All HTML documents are structured as trees. This project provides code to parse and visualise HTML documents as graphs.
Features of the project are:
- View any html page (using page source) as a graph (collection of connected nodes).
- Search graph by html tags, tag attributes, strings etc.
- Find shortest path between HTML node to any other node in graph, for easier web-scraping.
Usage
pip install webtree
webtree scrape --site=https://www.google.com
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
webtree-0.0.17-py3-none-any.whl
(12.1 kB
view details)
File details
Details for the file webtree-0.0.17-py3-none-any.whl
.
File metadata
- Download URL: webtree-0.0.17-py3-none-any.whl
- Upload date:
- Size: 12.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/49.6.0 requests-toolbelt/0.8.0 tqdm/4.48.2 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | c1a4e063faf130b6516b8c94e0385ac50e8154d9db6c251ddf2eaaff79558480 |
|
MD5 | 56907b4e5140c39b3892f45cdb38dfeb |
|
BLAKE2b-256 | f9774c4cfe22827a03fe3d1ae20f395a45d923b670c2733102e8466712b3333d |