HTML scraper with templates
# weakscraper HTML scraper with templates
Most HTML pages are generated using templates. Why not use templates too for scraping HTML pages? As for a template language, let’s use HTML plus a few keywords. That way, the workflow with weakscraper is the following : * Get the source of a HTML page you want to scrap. * Using a few keywords, edit the HTML to select which information is of interest and which parts to discard. * If complicated processing is required, write additional callbacks. * Run weakscraper on the template and on the HTML.
## Pros * Observes the [rule of least power](https://en.wikipedia.org/wiki/Rule_of_least_power). A declarative language helps to focus on what to keep. How the information is scrapped is the job of the library.
## How it works ?
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|Filename, size & hash SHA256 hash help||File type||Python version||Upload date|
|weakscraper-0.0.1-py3-none-any.whl (14.0 kB) Copy SHA256 hash SHA256||Wheel||py3||Mar 28, 2016|
|weakscraper-0.0.1.tar.gz (7.4 kB) Copy SHA256 hash SHA256||Source||None||Mar 28, 2016|