Skip to main content

A Simple Web Crawling and Web Scraping framework

Project description

Crwy

PyPI Version Download Status Build Status License Status

简介

Crwy是一个轻量级的爬虫抓取框架,参考Scrapy框架结构开发而来。该框架提供了实用的爬虫模板,旨在帮助大家快速实现爬虫任务,高效开发。并为scrapy使用者提供通用轮子.。新增了gevent,使爬虫异步执行,速度更快。

运行环境

  • Python2 & Python3

  • Works on Linux, Mac OSX

依赖包

  • beautifulsoup4>=4.5.1

  • requests>=2.20.0

  • configparser>=3.5.0

  • SQLAlchemy>=1.0.14

  • pyssdb>=0.1.2

  • redis>=2.10.5,<3.0.0

  • gevent>=1.2.1

  • retrying>=1.3.3

  • imapclient>=2.0.0

安装

快速安装

pip install crwy

or 前往下载: https://pypi.python.org/pypi/Crwy/

使用手册

在这里: http://wuyue92tree.antio.top/opensource/crwy.html

友情链接

更新日志

http://wuyue92tree.antio.top/opensource/crwy.html#更新日志

TODO

  • 完善scrapy_plugs

  • 完善selenium_api

  • 兼容python3

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Crwy-1.5.7.tar.gz (30.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

Crwy-1.5.7-py2.7.egg (110.7 kB view details)

Uploaded Egg

File details

Details for the file Crwy-1.5.7.tar.gz.

File metadata

  • Download URL: Crwy-1.5.7.tar.gz
  • Upload date:
  • Size: 30.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.6.1 requests/2.24.0 setuptools/44.1.1 requests-toolbelt/0.9.1 tqdm/4.51.0 CPython/2.7.15

File hashes

Hashes for Crwy-1.5.7.tar.gz
Algorithm Hash digest
SHA256 df01e75696ab4f988d985b11e9f23bd2976dd41e094e9030c5d2aa7eb306f4da
MD5 e1382f1b0108679f4ac8905cacb43228
BLAKE2b-256 606931e01f84deb238d43f03e5136d102178a18f5bef92ffc18a79962c67984d

See more details on using hashes here.

File details

Details for the file Crwy-1.5.7-py2.7.egg.

File metadata

  • Download URL: Crwy-1.5.7-py2.7.egg
  • Upload date:
  • Size: 110.7 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.15.0 pkginfo/1.6.1 requests/2.24.0 setuptools/44.1.1 requests-toolbelt/0.9.1 tqdm/4.51.0 CPython/2.7.15

File hashes

Hashes for Crwy-1.5.7-py2.7.egg
Algorithm Hash digest
SHA256 476330ad7caed35ec84eab5944d38baab6b3aba601266323dd892aa6f8abde99
MD5 8f2fc5ac605ff805fa8712c4cd40e069
BLAKE2b-256 90902c2fb19ef4b0e32c13d296ab4a31ece2427d16911f8838b268ae663d66f2

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page