Skip to main content

Crawl rendered JavaScript templates from a local server.

Project description

Command line tool that crawls a local webserver with a headless browser and outputs static html files. Works well with html5validator to validate HTML5 from dynamic content with Javascript.

https://travis-ci.org/svenkreiss/localcrawl.svg?branch=master

Run localcrawl --help:

https://raw.githubusercontent.com/svenkreiss/localcrawl/master/docs/help.png

PhantomJS is required. It is pre-installed on TravisCI. On a Mac run brew install PhantomJS.

Example Command

localcrawl --start _build/html/index.html --out _crawled/ --depth 3

Mustache Example

This can be used to convert templated files to HTML files (e.g. for validation with html5validator).

Input:

<html>
<head>
  <title>Mustache Test</title>
</head>
<body>
  <div id="output"></div>

  <script type="text/javascript" src="https://cdnjs.cloudflare.com/ajax/libs/mustache.js/2.2.1/mustache.min.js"></script>
  <script>
    var data = {
      item: 'Fork',
      price: function() { return (1.10 * 1.08).toFixed(2); },
    };
    var html = Mustache.render('{{item}}: <b>${{price}}</b>', data);
    document.getElementById('output').innerHTML = html;
  </script>
</body>
</html>

The crawled output includes the output from processing the template (Fork: <b>$1.19</b>):

<html><head>
  <title>Mustache Test</title>
</head>
<body>
  <div id="output">Fork: <b>$1.19</b></div>

  <script type="text/javascript" src="https://cdnjs.cloudflare.com/ajax/libs/mustache.js/2.2.1/mustache.min.js"></script>
  <script>
    var data = {
      item: 'Fork',
      price: function() { return (1.10 * 1.08).toFixed(2); },
    };
    var html = Mustache.render('{{item}}: <b>${{price}}</b>', data);
    document.getElementById('output').innerHTML = html;
  </script>


</body></html>

Should play nice with:

JavaScript template engines / JS frameworks:

Static site generators:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

localcrawl-0.2.3.tar.gz (5.2 kB view details)

Uploaded Source

Built Distribution

localcrawl-0.2.3-py2.py3-none-any.whl (6.0 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file localcrawl-0.2.3.tar.gz.

File metadata

  • Download URL: localcrawl-0.2.3.tar.gz
  • Upload date:
  • Size: 5.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for localcrawl-0.2.3.tar.gz
Algorithm Hash digest
SHA256 9b222f2ee69860a7b4c96ecb7f98aaf71d24cee55d1441b737c59a5811dc17a5
MD5 65787f364806f7ff4693a0e479e6276b
BLAKE2b-256 1a9653ac58886509486ea2181677d33f809e0a2b5644b5a59bb5f80cf3e58631

See more details on using hashes here.

File details

Details for the file localcrawl-0.2.3-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for localcrawl-0.2.3-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 2f7c629bc110bf3f3b0c8773ab87f8c11c5adc85bcb2d2a1000a1aad382f4c7f
MD5 27f9aeeb5af2eecf990dc5a243698797
BLAKE2b-256 5a94dab8a257e468a5f91b5bc3ade9cd92d6c5c33c97989a3643f2213ff3b16e

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page