Skip to main content

A Simple Web Crawling and Web Scraping framework

Project description

# Crwy

[![PyPI Version](https://img.shields.io/pypi/v/Crwy.svg)](https://pypi.python.org/pypi/Crwy) [![Build Status](https://travis-ci.org/wuyue92tree/crwy.svg?branch=1.5.0)](https://travis-ci.org/wuyue92tree/crwy)

# 简介

Crwy是一个轻量级的爬虫抓取框架,参考Scrapy框架结构开发而来。该框架提供了实用的爬虫模板,旨在帮助大家快速实现爬虫任务,高效开发。并为scrapy使用者提供通用轮子^.^。新增了gevent,使爬虫异步执行,速度更快。

# 运行环境

  • Python2 & Python3

  • Works on Linux, Mac OSX

# 依赖包

  • beautifulsoup4>=4.5.1

  • requests>=2.20.0

  • configparser>=3.5.0

  • SQLAlchemy>=1.0.14

  • pyssdb>=0.1.2

  • redis>=2.10.5,<3.0.0

  • gevent>=1.2.1

  • retrying>=1.3.3

  • imapclient>=2.0.0

# 安装

快速安装 ` pip install crwy `

or 前往下载: https://pypi.python.org/pypi/Crwy/1.5.0/

# 使用手册

在这里: http://wuyue92tree.antio.top/opensource/crwy.html

# 友情链接

# 更新日志

http://wuyue92tree.antio.top/opensource/crwy.html#更新日志

# TODO

  • 完善scrapy_plugs

  • 完善selenium_api

  • 兼容python3

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

Crwy-1.5.0.tar.gz (30.2 kB view details)

Uploaded Source

File details

Details for the file Crwy-1.5.0.tar.gz.

File metadata

  • Download URL: Crwy-1.5.0.tar.gz
  • Upload date:
  • Size: 30.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.12.1 pkginfo/1.4.2 requests/2.20.1 setuptools/39.0.1 requests-toolbelt/0.8.0 tqdm/4.28.1 CPython/3.6.6

File hashes

Hashes for Crwy-1.5.0.tar.gz
Algorithm Hash digest
SHA256 286cf578c6f46224c395b0de6b4aa04f415d595306a43dac113e49c7373b1c0a
MD5 d3abb407dc9792c8f1abb0c38be62264
BLAKE2b-256 9e2e3525a507a8529e688acbdeae424b68e3fa6c7b44bd2dcc85d7ae95260dae

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page