A framework for creating web content extractors
Project description
Scrapple is a framework for creating web scrapers and web crawlers according to a key-value based configuration file. It provides a command line interface to run the script on a given JSON-based configuration input, as well as a web interface to provide the necessary input.
You can install Scrapple by using
$ sudo apt-get install libxml2-dev libxslt-dev python-dev lib32z1-dev $ pip install scrapple
You can read the complete documentation.
History
0.2.2 - 2015-02-22
Fix bug in generate script template
0.2.1 - 2015-02-21
Update tests
0.2.0 - 2015-02-20
Include implementation for scrapple run and scrapple generate for crawlers
Modify web interface for editing scraper config files
Revise skeleton configuration files
0.1.1 - 2015-02-10
Release on PyPI with revisions
Include web interface for editing scraper config files
Modified implementations of certain functions
0.1.0 - 2015-02-04
First release on PyPI
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Hashes for scrapple-0.2.2.linux-i686.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | b0b6ab4ab31553fda1c1333d8b07f1946f5c0a40c940023cd9219c9930e15d23 |
|
MD5 | 8d7fb029d633f1a72b9964a620b7ca0e |
|
BLAKE2b-256 | a41c8c75185c04f5ae4e12e5e55da8a4d1d5b83858ba7ac2b6c9820d9de39baa |
Hashes for scrapple-0.2.2-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4d4c79545529e934f3e1cfdc7f583255f30232d2df9e893b49c8f9ebcb5debc8 |
|
MD5 | a1a4ee994db42475c20a47ddbef51271 |
|
BLAKE2b-256 | 9dfc55237bdd9c3992f6562754e8b7277220a9eafaae7739a187f88dead984cb |