在scrapy的基础上进行修改,使用了异步请求,加快采集速度,并与rabbitmq结合,实现了断点续爬,与防止漏采
Project description
Scrapy_Rabbit
Overview
将Scrapy与RabbitMQ进行结合,解决断点续爬,采集进度可视化并且提高代码复用性
Requirements
- Python 3.6+
- Works on Linux, Windows, macOS, BSD
Install
The quick way::
pip install scrapy-rabbit
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
scrapy_rabbit-0.2.4.tar.gz
(33.7 kB
view hashes)
Built Distribution
Close
Hashes for scrapy_rabbit-0.2.4-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | f34aa1f8ebba0bdf0d8445f9867b89ea3e36872623d9e2b62badb3d0413f0030 |
|
MD5 | 68a296d1dde530d24d7da901443035eb |
|
BLAKE2b-256 | f0a6a1d1e1888f6e4bd3c157b7d3042de64269ebb511286cf57f7dab30c55eb1 |