A Web Crawling and Web Scraping framework (crawlframe) for Python.
Project description
CrawlFrame: A Web Crawling framework
Behold, the power of CrawlFrame:
crawlf createapp <appname> -url http://<website>/
crawlf createpro <projectname>
crawlf start app:<name>
crawlf start project
crawlf stop app:<name>
crawlf stop project
crawlf stop <pid>
CrawlFrame officially supports python3.5+ .
Installation
To install CrawlFrame, simply use pip:
$ pip install crawlframe
Documentation
In the construction...
Changed
crawlframe-1.0.3
下载器并发方式变更
crawlframe-1.0.2
CRAWLFRAME_SURVIVE_SWITCH = True
CRAWLFRAME_SURVIVE_MAX = int
reload spider
CRAWLFRAME_SURVIVE_SWITCH = True CRAWLFRAME_SURVIVE_MAX = int 设置两个参数 发出一定请求数之后, 自动重启爬虫. 避免内存溢出. 修复middle不能加载以及logger不能创建的bug. BaseLogger改为单例模式.
crawlframe-1.0.1
crawlf createapp <appname> -url http://<website>/
crawlf createpro <projectname>
crawlf start app:<name>
crawlf start project
crawlf stop app:<name>
crawlf stop project
crawlf stop <pid>
实际上以上命令只实现了start 与 stop. 但是已经能够在项目中使用了. 为了方便自己安装使用, 就先上传到 pypi 与 github 了.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
crawlframe-1.0.3.tar.gz
(27.4 kB
view details)
Built Distribution
File details
Details for the file crawlframe-1.0.3.tar.gz
.
File metadata
- Download URL: crawlframe-1.0.3.tar.gz
- Upload date:
- Size: 27.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.18.4 setuptools/40.9.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 619b1431411c9ea76d1b3787ee03fdfd606802b8918f8e374b02378646baa453 |
|
MD5 | 8fa65e08bf86c6689373534ad4fb2184 |
|
BLAKE2b-256 | 2431886f70a99116dcd9d3376583286574222d97c4e431791c2f826d8fed7e3a |
File details
Details for the file crawlframe-1.0.3-py2.py3-none-any.whl
.
File metadata
- Download URL: crawlframe-1.0.3-py2.py3-none-any.whl
- Upload date:
- Size: 46.1 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.18.4 setuptools/40.9.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.2
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3f3c5feb3b20cb97ed81d4b36eba48a4300fb0b2dc30c4523e370e89e57c2c18 |
|
MD5 | d1a2806498636c3864729ed681365b84 |
|
BLAKE2b-256 | a3e5c15adce5b73fb9c4b38bd09c07e30e4b1e200946d619acc65bbbe760f2b4 |