Skip to main content

easy spider

Project description

Easy Sipder


Easy Spider 主要有四大模块:

  • Spider 负责推送请求到请求线程池
  • Downloader 负责启动请求与数据,请求在启动前会经过请求处理程序,响应在下载后会经过响应处理程序
  • Pipeline 负责清理数据,数据的持久化等工作

流程图如下

epsider流程图


TODO

  • 2020-04-06

    • 修复 start_requests 错误提示
    • 自动设置请求优先级
    • 请求和响应扩展合并为下载中间件
    • settings 像 scrapy 看齐
  • 2020-04-07

    • 优化 setting
    • 下载器开始停止问题

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

espider-1.7.3.tar.gz (46.8 kB view hashes)

Uploaded Source

Built Distribution

espider-1.7.3-py2.py3-none-any.whl (51.2 kB view hashes)

Uploaded Python 2 Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page