HTML scraper with templates

Project Description

# weakscraper HTML scraper with templates

## Description

Most HTML pages are generated using templates. Why not use templates too for scraping HTML pages? As for a template language, let’s use HTML plus a few keywords. That way, the workflow with weakscraper is the following : * Get the source of a HTML page you want to scrap. * Using a few keywords, edit the HTML to select which information is of interest and which parts to discard. * If complicated processing is required, write additional callbacks. * Run weakscraper on the template and on the HTML.

## Pros * Observes the [rule of least power]( A declarative language helps to focus on what to keep. How the information is scrapped is the job of the library.

## Cons

## Examples

## How it works ?

## License


Release History

Download Files

File Name & Checksum SHA256 Checksum Help Version File Type Upload Date
weakscraper-0.0.1-py3-none-any.whl (14.0 kB) Copy SHA256 Checksum SHA256 py3 Wheel Mar 28, 2016
weakscraper-0.0.1.tar.gz (7.4 kB) Copy SHA256 Checksum SHA256 Source Mar 28, 2016

