Skip to main content

爬虫系统业务层封装

Project description

爬虫系统公共类封装

目标

  • redis db 划分
  • url 记录池封装
  • 下载页面池封装
  • 解析结果池封装
  • 网站信息类封装

redis db 划分(0-15)

- 0 -> 常用的队列与有序集或集合
其中的hash表定义
    - website -> 网站信息
    - url_record -> url 记录池
    - url_page -> 下载页面池封装
    - parse_result -> 解析结果池封装
- 10-15 -> 监控自行处理

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

qg_spider_sdk-5.2.9.tar.gz (21.4 kB view hashes)

Uploaded Source

Built Distribution

qg_spider_sdk-5.2.9-py3-none-any.whl (30.3 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page