HTML scraper with templates
# weakscraper HTML scraper with templates
Most HTML pages are generated using templates. Why not use templates too for scraping HTML pages? As for a template language, let’s use HTML plus a few keywords. That way, the workflow with weakscraper is the following : * Get the source of a HTML page you want to scrap. * Using a few keywords, edit the HTML to select which information is of interest and which parts to discard. * If complicated processing is required, write additional callbacks. * Run weakscraper on the template and on the HTML.
## Pros * Observes the [rule of least power](https://en.wikipedia.org/wiki/Rule_of_least_power). A declarative language helps to focus on what to keep. How the information is scrapped is the job of the library.
## How it works ?
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
|Filename, Size & Hash SHA256 Hash Help||File Type||Python Version||Upload Date|
(14.0 kB) Copy SHA256 Hash SHA256
|Wheel||py3||Mar 28, 2016|
(7.4 kB) Copy SHA256 Hash SHA256
|Source||None||Mar 28, 2016|