A package for analysing webpages
Project description
WEBPAGE ANALYSIS TOOL
This tool extracts insight from a webpage. The insights that can be extracted are:
- all unique tags used in the document.
- the most commonly used tag.
- the longest path starting from root node to the descendent.
- the longest path starting from root node where the most popular tag is used the most times.
Installation
$ pip install webanalytictool
from analytic import WebPageAnalyticTool
Usage
To create an object of WebPageAnalyticTool,
$ url = 'https://www.bbc.com/sport/football'
$ wat = WebPageAnalyticTool(url)
To get longest path starting from root node to the descendent
$ wat.get_longest_path
To get the longest path starting from root node where the most popular tag is used the most times
$ wat.get_longest_path_with_most_common_tag
To get the most commonly used tag
$ wat.get_most_common_tags
To get all unique tags used in the document
$ wat.get_unique_tags
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
webanalytictool-0.0.2.tar.gz
(16.3 kB
view details)
Built Distribution
File details
Details for the file webanalytictool-0.0.2.tar.gz
.
File metadata
- Download URL: webanalytictool-0.0.2.tar.gz
- Upload date:
- Size: 16.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5e9ea47d370b8907fb487ef70b745915acea24d65b826ac3d3adfdbef247843d |
|
MD5 | 60230876937acff353f291391c175c46 |
|
BLAKE2b-256 | 82a7aae8e6e94dc309ef66db7206fa06f06300f8363f4241bd95d48466f932ca |
File details
Details for the file webanalytictool-0.0.2-py3-none-any.whl
.
File metadata
- Download URL: webanalytictool-0.0.2-py3-none-any.whl
- Upload date:
- Size: 16.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.4.2 importlib_metadata/4.8.1 pkginfo/1.7.1 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.62.2 CPython/3.8.8
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 10cc55d8b7f63e7765383860a0e2bebc38dee5a0f17f19884e9c7f5bed1dbb00 |
|
MD5 | 70604b1487490df63eb9b3d41e04f0e1 |
|
BLAKE2b-256 | 1a8e54e4215a7ba797295ef7ec80db542e254c5d287f1319cafcacf4a9cd88c0 |