A framework for creating web content extractors
Project description
Scrapple is a framework for creating web scrapers and web crawlers according to a key-value based configuration file. It provides a command line interface to run the script on a given JSON-based configuration input, as well as a web interface to provide the necessary input.
You can install Scrapple by using
$ sudo apt-get install libxml2-dev libxslt-dev python-dev lib32z1-dev $ pip install scrapple
You can read the complete documentation.
Maintained by Alex Mathew and Harish Balakrishnan.
History
0.2.4 - 2015-04-13
Update documentation
Minor fixes
0.2.3 - 2015-03-11
Include implementation to use csv as the output format
0.2.2 - 2015-02-22
Fix bug in generate script template
0.2.1 - 2015-02-21
Update tests
0.2.0 - 2015-02-20
Include implementation for scrapple run and scrapple generate for crawlers
Modify web interface for editing scraper config files
Revise skeleton configuration files
0.1.1 - 2015-02-10
Release on PyPI with revisions
Include web interface for editing scraper config files
Modified implementations of certain functions
0.1.0 - 2015-02-04
First release on PyPI
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Hashes for scrapple-0.2.4.linux-i686.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8dd61a2da06d3cfb964d694c35c5db836cbbc958830a32f7157781248a9143cb |
|
MD5 | 08392c7fa7e8c08160b330726b725b79 |
|
BLAKE2b-256 | c0f9efce9bc3abcefaaef8971b89472a1e6589c4f3bc25db659ff5ed275a2fc2 |
Hashes for scrapple-0.2.4-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5d1b32cd9a1ebb5af71917728fd6d4da3703124369c10a89a99c623e08b4f689 |
|
MD5 | 6dca155bc513271ff990f32d3ede706a |
|
BLAKE2b-256 | 8835431afabd28a3de19e6c3f8e2c28644badcff726a8628e4d6980bac430102 |