Scrapy Item Record Extension
Project description
os-scrapy-record
This project provide extensions to process Response/Failure, generate standard Item.
Install
pip install os-scrapy-record
You can run example spider directly in the project root path
scrapy crawl example
APIs
-
os_scrapy_record.ResponseCallback- the
callbackmethod of this extension will replace the defaultRequest.callback, process Response and generate FetchRecord - the
callbackmethod will not work when the request already set callback function - the
callbackmethod will override theparsemethod of spider - enable extension in the project settings.py file:
EXTENSIONS = { "os_scrapy_record.ResponseCallback": 1, } - the
-
os_scrapy_record.ResponseErrback- the
errbackmethod of this extension will replace the defaultRequest.errback, process Failure and generate FetchRecord - the
errbackmethod will not work when the request already set errback function - enable extension in the project settings.py file:
EXTENSIONS = { "os_scrapy_record.ResponseErrback": 1, } - the
-
os_scrapy_record.FetchRecordThis class is subclass of Item
the mumbers of this class are:
- request:
os_scrapy_record.items.RequestItem, members: url, method, headers, body - meta:
dict, request.meta, it is better to use lower case and '_' as separator as key - response:
os_scrapy_record.items.ResponseItem,members: headers, body, status, ip_address(Scrapy 2.1.0+), failure
- request:
-
os_scrapy_record.fetch_status.FetchStatusA mumber of ResponseItem, include HTTP, DNS, Network and user defined status. It is a two-tuple object: group and code. e.g, HTTP:200, DNS:-2, SERVER:111, RULE:16
Unit Tests
sh scripts/test.sh
License
MIT licensed.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
os-scrapy-record-0.0.14.tar.gz
(10.3 kB
view details)
File details
Details for the file os-scrapy-record-0.0.14.tar.gz.
File metadata
- Download URL: os-scrapy-record-0.0.14.tar.gz
- Upload date:
- Size: 10.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.6.1 requests/2.25.1 setuptools/51.1.0 requests-toolbelt/0.9.1 tqdm/4.54.1 CPython/3.6.7
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
00b8082fe1eafee0fc3913645963e5876269f4c29b2a1b3cb7aa20868094e2a5
|
|
| MD5 |
fdbe1523089dd193280a2b5f1cb607ba
|
|
| BLAKE2b-256 |
7891a900f48aa42fe99ad3ece7e540a0362f916cd2d1cb732c1ef225d484fa1a
|