A Web Crawling and Web Scraping framework (crawlframe) for Python.
Project description
CrawlFrame: A Web Crawling framework
Behold, the power of CrawlFrame:
crawlf createapp <appname> -url http://<website>/
crawlf createpro <projectname>
crawlf start app:<name>
crawlf start project
crawlf stop app:<name>
crawlf stop project
crawlf stop <pid>
CrawlFrame officially supports python3.5+ .
Installation
To install CrawlFrame, simply use pip:
$ pip install crawlframe
Documentation
In the construction...
Changed
crawlframe-1.0.3
下载器并发方式变更
crawlframe-1.0.2
CRAWLFRAME_SURVIVE_SWITCH = True
CRAWLFRAME_SURVIVE_MAX = int
reload spider
CRAWLFRAME_SURVIVE_SWITCH = True CRAWLFRAME_SURVIVE_MAX = int 设置两个参数 发出一定请求数之后, 自动重启爬虫. 避免内存溢出. 修复middle不能加载以及logger不能创建的bug. BaseLogger改为单例模式.
crawlframe-1.0.1
crawlf createapp <appname> -url http://<website>/
crawlf createpro <projectname>
crawlf start app:<name>
crawlf start project
crawlf stop app:<name>
crawlf stop project
crawlf stop <pid>
实际上以上命令只实现了start 与 stop. 但是已经能够在项目中使用了. 为了方便自己安装使用, 就先上传到 pypi 与 github 了.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file crawlframe-1.0.3.tar.gz.
File metadata
- Download URL: crawlframe-1.0.3.tar.gz
- Upload date:
- Size: 27.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.18.4 setuptools/40.9.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
619b1431411c9ea76d1b3787ee03fdfd606802b8918f8e374b02378646baa453
|
|
| MD5 |
8fa65e08bf86c6689373534ad4fb2184
|
|
| BLAKE2b-256 |
2431886f70a99116dcd9d3376583286574222d97c4e431791c2f826d8fed7e3a
|
File details
Details for the file crawlframe-1.0.3-py2.py3-none-any.whl.
File metadata
- Download URL: crawlframe-1.0.3-py2.py3-none-any.whl
- Upload date:
- Size: 46.1 kB
- Tags: Python 2, Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.13.0 pkginfo/1.5.0.1 requests/2.18.4 setuptools/40.9.0 requests-toolbelt/0.9.1 tqdm/4.31.1 CPython/3.6.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
3f3c5feb3b20cb97ed81d4b36eba48a4300fb0b2dc30c4523e370e89e57c2c18
|
|
| MD5 |
d1a2806498636c3864729ed681365b84
|
|
| BLAKE2b-256 |
a3e5c15adce5b73fb9c4b38bd09c07e30e4b1e200946d619acc65bbbe760f2b4
|