A framework for creating web content extractors
Project description
Scrapple is a framework for creating web scrapers and web crawlers according to a key-value based configuration file. It provides a command line interface to run the script on a given JSON-based configuration input, as well as a web interface to provide the necessary input.
You can install Scrapple by using
$ sudo apt-get install libxml2-dev libxslt-dev python-dev lib32z1-dev $ pip install scrapple
You can read the complete documentation.
History
0.2.1 - 2015-02-21
Update tests
0.2.0 - 2015-02-20
Include implementation for scrapple run and scrapple generate for crawlers
Modify web interface for editing scraper config files
Revise skeleton configuration files
0.1.1 - 2015-02-10
Release on PyPI with revisions
Include web interface for editing scraper config files
Modified implementations of certain functions
0.1.0 - 2015-02-04
First release on PyPI
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Hashes for scrapple-0.2.1.linux-i686.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | 47168afd5753c99960d7819b6d94421ccf583c75033c633bcc3d10416a5f094b |
|
MD5 | 23db31515ccb8e50485a9c92bc493b03 |
|
BLAKE2b-256 | d40c9515548773d5c4bafdfa000c0529318d6a1a2de8f3a9e69ea0d4fa8e93f6 |
Hashes for scrapple-0.2.1-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 2baec252c88facf6e9dc5e3f4c466172e238d212e69f5b3d03fa50ba973c554a |
|
MD5 | 030c8bc24a75c38d4f714f5d29df4c10 |
|
BLAKE2b-256 | 8c3f7997529cdbe4f9108aeb50bd206d64f5cdd0f8bfeaf0f498ad3cec8eaa76 |