A framework for creating web content extractors
Project description
Scrapple is a framework for creating web scrapers and web crawlers according to a key-value based configuration file. It provides a command line interface to run the script on a given JSON-based configuration input, as well as a web interface to provide the necessary input.
You can install Scrapple by using
$ sudo apt-get install libxml2-dev libxslt-dev python-dev lib32z1-dev $ pip install scrapple
You can read the complete documentation.
History
0.2.3 - 2015-03-11
Include implementation to use csv as the output format
0.2.2 - 2015-02-22
Fix bug in generate script template
0.2.1 - 2015-02-21
Update tests
0.2.0 - 2015-02-20
Include implementation for scrapple run and scrapple generate for crawlers
Modify web interface for editing scraper config files
Revise skeleton configuration files
0.1.1 - 2015-02-10
Release on PyPI with revisions
Include web interface for editing scraper config files
Modified implementations of certain functions
0.1.0 - 2015-02-04
First release on PyPI
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
Built Distributions
Hashes for scrapple-0.2.3.linux-i686.tar.gz
Algorithm | Hash digest | |
---|---|---|
SHA256 | b92a74c1c97b87bf879a98fcb48ad9df1c5247929a69469f228e7196c8de6630 |
|
MD5 | afd0704f5bea41b8bcee989a1405021c |
|
BLAKE2b-256 | aaaf3726392d86853ccf6a073ff93edb56d0826915503cac7e63491c6caefbca |
Hashes for scrapple-0.2.3-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | d213d1d860027e068b51fd248a512f3333a15004f98893464961d0a06aba1dbd |
|
MD5 | ca031cc1f2f0f140a9d906d2a054eadb |
|
BLAKE2b-256 | 87f8c1496221f49ed3a93d4db15c2c7e3924ff7cdeaf408765a0f8a128a27c8c |