Skip to main content

Displays HTML page as a Graph

Project description

WebTree

Project Description

All HTML documents are structured as trees. This project provides code to parse and visualise HTML documents as graphs.
Features of the project are:

  • View any html page (using page source) as a graph (collection of connected nodes).
  • Search graph by html tags, tag attributes, strings etc.
  • Find shortest path between HTML node to any other node in graph, for easier web-scraping.

Usage

pip install webtree
webtree scrape --site=https://www.google.com

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

webtree-0.0.17-py3-none-any.whl (12.1 kB view details)

Uploaded Python 3

File details

Details for the file webtree-0.0.17-py3-none-any.whl.

File metadata

  • Download URL: webtree-0.0.17-py3-none-any.whl
  • Upload date:
  • Size: 12.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/49.6.0 requests-toolbelt/0.8.0 tqdm/4.48.2 CPython/3.8.5

File hashes

Hashes for webtree-0.0.17-py3-none-any.whl
Algorithm Hash digest
SHA256 c1a4e063faf130b6516b8c94e0385ac50e8154d9db6c251ddf2eaaff79558480
MD5 56907b4e5140c39b3892f45cdb38dfeb
BLAKE2b-256 f9774c4cfe22827a03fe3d1ae20f395a45d923b670c2733102e8466712b3333d

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page