A framework for creating web content extractors
Project description
Scrapple is a framework for creating web scrapers and web crawlers according to a key-value based configuration file. It provides a command line interface to run the script on a given JSON-based configuration input, as well as a web interface to provide the necessary input.
You can install Scrapple by using
$ sudo apt-get install libxml2-dev libxslt-dev python-dev lib32z1-dev $ pip install scrapple
You can read the complete documentation.
History
0.1.0 - 2015-02-04
First release on PyPI
0.1.1 - 2015-02-10
Release on PyPI with revisions
Include web interface for editing scraper config files
Modified implementations of certain functions
0.2.0 - 2015-02-20
Include implementation for scrapple run and scrapple generate for crawlers
Modify web interface for editing scraper config files
Revise skeleton configuration files
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Hashes for scrapple-0.2.0.linux-i686.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8cd2ac78ab62b5db7625930dc75d89ae4cb721ac8422eae9e9c9262b336f0cc1 |
|
MD5 | 84fc05be52a46aecb609dd01cc9d931f |
|
BLAKE2b-256 | 03587f6a6649d3e0572fc27e2f193d8bf2599d0b2c110c440f769db98e66ecf0 |
Hashes for scrapple-0.2.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e2efa6db753a58c1bbb436e6a61f84c6cb71dc0c9605bc2144d9aeb7322ba5ef |
|
MD5 | 66514424cd313c990a4afd2da8b8fef0 |
|
BLAKE2b-256 | 77db08686128f43a1364d753fa2dff800600f20c07f7f3c41c3e972ade3792be |