Skip to main content

A package for analysing webpages

Project description

WEBPAGE ANALYSIS TOOL

This tool extracts insight from a webpage. The insights that can be extracted are:

  • all unique tags used in the document.
  • the most commonly used tag.
  • the longest path starting from root node to the descendent.
  • the longest path starting from root node where the most popular tag is used the most times.

Installation

    $ pip install webanalytictool
    from analytic import WebPageAnalyticTool

Usage

To create an object of WebPageAnalyticTool,

    $ url = 'https://www.bbc.com/sport/football'
    $ wat = WebPageAnalyticTool(url)

To get longest path starting from root node to the descendent

    $ wat.get_longest_path

To get the longest path starting from root node where the most popular tag is used the most times

    $ wat.get_longest_path_with_most_common_tag

To get the most commonly used tag

    $ wat.get_most_common_tags

To get all unique tags used in the document

    $ wat.get_unique_tags

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

webanalytictool-0.0.2.tar.gz (16.3 kB view hashes)

Uploaded Source

Built Distribution

webanalytictool-0.0.2-py3-none-any.whl (16.2 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page