Utility to check the size of pages from a sitemap and its resources parsering the HAR file of request
Project description
=========
Page Size Check
=========
Page Size Check is an utility to check the size of pages from a sitemap and its resources parsering the HAR file of the
request using Selenium and haralyzer. The execution of this utility produces some files to allow the user to make an
analysis of the number of requests and its size. The execution use ThreadPoolExecutor to launch the browsers in parallel.
Dependencies
------------
* Git.
* Python 3.5 or higher, `pip`_ and virtualenvwrapper.
* System requeriments in requirements-sys.txt
Installation
------------
Nowadays the project is only available on github.
#. Clone project from repo ::
git clone git@github.com:APSL/page-size-check.git
#. Setup virtualenv Python ::
cd page_size_check
mkvirtualenv "page_size_check" -p python3 -a .
pip install -r requirements.txt
#. Branches ::
master --> prod env. Bug issues start from here.
#. Download ::
- Firefox webdriver for selenium: geckodriver
- Browsermob-proxy
#. Execution ::
python page_size_check.py --sitemap_url="sitemap.url" [--help]
#. Output ::
- Resume urls file: a resume of the urls with the number of entries, the page size and the page load times
- Resources list file: a list of the resources on every page with its mimetype, size and load time
- Mimetype resources: a resume of the resources grouped by mimetype in each url of the sitemap
Usage
-----
As was showned before, you can execute the script doing :code:`python page_size_check.py --help` to see all the
parameters that you can set:
--browsermob_server_path TEXT Browsermob Server Path.
--firefox_driver_path TEXT Firefox Driver Path.
--sitemap_url TEXT Sitemap to get urls.
--threads INTEGER Number of threads.
--help Show this message and exit.
Contributing
------------
Contributions are very welcome. Please open a `pull request`_ or `file an issue`_.
Tests will be ready as soon as posible, please ensure the coverage at least stays the same
before you submit a pull request.
License
-------
Distributed under the terms of the `MIT`_ license, "page-size-check" is free and open source software
Issues
------
If you encounter any problems, please `file an issue`_ along with a detailed description.
.. _`pip`: https://pypi.python.org/pypi/pip/
.. _`pull request`: https://github.com/APSL/page-size-check/pulls
.. _`file an issue`: https://github.com/APSL/page-size-check/issues
.. _`MIT`: http://opensource.org/licenses/MIT
Page Size Check
=========
Page Size Check is an utility to check the size of pages from a sitemap and its resources parsering the HAR file of the
request using Selenium and haralyzer. The execution of this utility produces some files to allow the user to make an
analysis of the number of requests and its size. The execution use ThreadPoolExecutor to launch the browsers in parallel.
Dependencies
------------
* Git.
* Python 3.5 or higher, `pip`_ and virtualenvwrapper.
* System requeriments in requirements-sys.txt
Installation
------------
Nowadays the project is only available on github.
#. Clone project from repo ::
git clone git@github.com:APSL/page-size-check.git
#. Setup virtualenv Python ::
cd page_size_check
mkvirtualenv "page_size_check" -p python3 -a .
pip install -r requirements.txt
#. Branches ::
master --> prod env. Bug issues start from here.
#. Download ::
- Firefox webdriver for selenium: geckodriver
- Browsermob-proxy
#. Execution ::
python page_size_check.py --sitemap_url="sitemap.url" [--help]
#. Output ::
- Resume urls file: a resume of the urls with the number of entries, the page size and the page load times
- Resources list file: a list of the resources on every page with its mimetype, size and load time
- Mimetype resources: a resume of the resources grouped by mimetype in each url of the sitemap
Usage
-----
As was showned before, you can execute the script doing :code:`python page_size_check.py --help` to see all the
parameters that you can set:
--browsermob_server_path TEXT Browsermob Server Path.
--firefox_driver_path TEXT Firefox Driver Path.
--sitemap_url TEXT Sitemap to get urls.
--threads INTEGER Number of threads.
--help Show this message and exit.
Contributing
------------
Contributions are very welcome. Please open a `pull request`_ or `file an issue`_.
Tests will be ready as soon as posible, please ensure the coverage at least stays the same
before you submit a pull request.
License
-------
Distributed under the terms of the `MIT`_ license, "page-size-check" is free and open source software
Issues
------
If you encounter any problems, please `file an issue`_ along with a detailed description.
.. _`pip`: https://pypi.python.org/pypi/pip/
.. _`pull request`: https://github.com/APSL/page-size-check/pulls
.. _`file an issue`: https://github.com/APSL/page-size-check/issues
.. _`MIT`: http://opensource.org/licenses/MIT
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
page-size-check-1.0.0.tar.gz
(16.2 kB
view details)
File details
Details for the file page-size-check-1.0.0.tar.gz
.
File metadata
- Download URL: page-size-check-1.0.0.tar.gz
- Upload date:
- Size: 16.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.19.1 setuptools/40.3.0 requests-toolbelt/0.8.0 tqdm/4.27.0 CPython/3.5.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 69c775037687dc1e1a05552b4256cc15921ad6703f056fe79b4c287287fcb6a9 |
|
MD5 | fd693ed372d3d16e50a568f4a6553cc2 |
|
BLAKE2b-256 | a5d73dbe719a7b1ddd8a5379b26857f6cd4ce0c08d96e15c53cb3c843e5a1b86 |