Skip to main content

Grab from a remote site page all resources that a browser will probably download visiting the page

Project description

Quick info

Let say you need to use the HTTP load testing and benchmarking utility siege on a web page and you also want to use the --internet option, to simulate at best the behavior of a web browser.

When a web browser load a page, it also load all the resources inside that page:

  • Images
  • JavaScript files
  • CSS

So you need a list of all URLs taken from that page.

This utility (its name mean “You Will Download”) will simply create this list for you.

You simply need to redirect the utility output to a file, then use also the siege --file option.

Usage

$ youlldownload http://host.com/section/page

Using with siege:

$ youlldownload http://host.com/section/page > list.txt
$ siege -i -f list.txt [other options]

Taken resouces

  • from script tags we’ll take the src URL
  • from link tags with rel equals to stylesheet we’ll take the href url
  • from img tags we’ll take the src URL
  • from object tags we’ll take the data URL
  • from embed tags we’ll take the src URL
  • from style tags we’ll take the URL inside if the tag is using an “@import url” directive

Authors

This product was developed by RedTurtle Technology team.

RedTurtle Technology Site

Changelog

0.2 (2014-04-02)

  • Added support for src attribute of iframe tag [keul]
  • Added support for src attribute of source tag (HTML 5 video element) [keul]
  • Do not break if base tag is not present [keul]

0.1 (2013-01-30)

  • initial release

Project details


Release history Release notifications

History Node

0.4

History Node

0.3

This version
History Node

0.2

History Node

0.1

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Filename, size & hash SHA256 hash help File type Python version Upload date
YoullDownload-0.2.zip (12.2 kB) Copy SHA256 hash SHA256 Source None Apr 2, 2014

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging CloudAMQP CloudAMQP RabbitMQ AWS AWS Cloud computing Fastly Fastly CDN DigiCert DigiCert EV certificate StatusPage StatusPage Status page