Skip to main content

Utility to check the size of pages from a sitemap and its resources parsering the HAR file of request

Project description

=========
Page Size Check
=========

Page Size Check is an utility to check the size of pages from a sitemap and its resources parsering the HAR file of the
request using Selenium and haralyzer. The execution of this utility produces some files to allow the user to make an
analysis of the number of requests and its size. The execution use ThreadPoolExecutor to launch the browsers in parallel.

Dependencies
------------

* Git.
* Python 3.5 or higher, `pip`_ and virtualenvwrapper.
* System requeriments in requirements-sys.txt

Installation
------------

Nowadays the project is only available on github.

#. Clone project from repo ::

git clone git@github.com:APSL/page-size-check.git

#. Setup virtualenv Python ::

cd page_size_check
mkvirtualenv "page_size_check" -p python3 -a .
pip install -r requirements.txt

#. Branches ::

master --> prod env. Bug issues start from here.

#. Download ::

- Firefox webdriver for selenium: geckodriver
- Browsermob-proxy

#. Execution ::

python page_size_check.py --sitemap_url="sitemap.url" [--help]

#. Output ::

- Resume urls file: a resume of the urls with the number of entries, the page size and the page load times
- Resources list file: a list of the resources on every page with its mimetype, size and load time
- Mimetype resources: a resume of the resources grouped by mimetype in each url of the sitemap



Usage
-----
As was showned before, you can execute the script doing :code:`python page_size_check.py --help` to see all the
parameters that you can set:

--browsermob_server_path TEXT Browsermob Server Path.
--firefox_driver_path TEXT Firefox Driver Path.
--sitemap_url TEXT Sitemap to get urls.
--threads INTEGER Number of threads.
--help Show this message and exit.

Contributing
------------

Contributions are very welcome. Please open a `pull request`_ or `file an issue`_.
Tests will be ready as soon as posible, please ensure the coverage at least stays the same
before you submit a pull request.

License
-------

Distributed under the terms of the `MIT`_ license, "page-size-check" is free and open source software


Issues
------

If you encounter any problems, please `file an issue`_ along with a detailed description.

.. _`pip`: https://pypi.python.org/pypi/pip/
.. _`pull request`: https://github.com/APSL/page-size-check/pulls
.. _`file an issue`: https://github.com/APSL/page-size-check/issues
.. _`MIT`: http://opensource.org/licenses/MIT

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

page-size-check-1.0.0.tar.gz (16.2 kB view details)

Uploaded Source

File details

Details for the file page-size-check-1.0.0.tar.gz.

File metadata

  • Download URL: page-size-check-1.0.0.tar.gz
  • Upload date:
  • Size: 16.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.3.0 requests-toolbelt/0.8.0 tqdm/4.27.0 CPython/3.5.2

File hashes

Hashes for page-size-check-1.0.0.tar.gz
Algorithm Hash digest
SHA256 69c775037687dc1e1a05552b4256cc15921ad6703f056fe79b4c287287fcb6a9
MD5 fd693ed372d3d16e50a568f4a6553cc2
BLAKE2b-256 a5d73dbe719a7b1ddd8a5379b26857f6cd4ce0c08d96e15c53cb3c843e5a1b86

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page