Python Common Spider for Humans.
Project description
CrawlSpider
CrawlSpider is a simple, yet light, spider library.
import asyncio
from CrawlSpider.Utils.SpiderRequest import spiderRequest
async def get(url):
res = await spiderRequest.get(
url,
form='json'
)
print(res)
if __name__ == '__main__':
asyncio.run(get("https://httpbin.org/get"))
响应结果
{
"status_code": 200,
"content": {
"args": {},
"headers": {
"Accept": "*/*",
"Accept-Encoding": "gzip, deflate",
"Cookie": "_hjAbsoluteSessionInProgress=0; _sp_id.eeee=d332c9c-a67e-4564-80ed-114737664d84",
"Host": "httpbin.org",
"User-Agent": "Mozilla/5.0 (Windows NT 10.0; Win64; x64) AppleWebKit/537.36 (KHTML, like Gecko) Chrome/99.0.4844.51 Safari/537.36",
"X-Amzn-Trace-Id": "Root=1-62c6edbe-3de25d31339352e"
},
"origin": "xxx.xxx.xxx.xxx",
"url": "https://httpbin.org/get"
}
}
CrawlSpider allows you to crawl data from website extremely easily. There’s no need to manually change proxy and request's headers in crawling data
Installing CrawlSpider and Supported Versions
CrawlSpider is available on PyPI:
$ python -m pip install CrawlSpider
CrawlSpider officially supports Python 3.7+.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
CrawlSpider-2.0.8.tar.gz
(52.0 kB
view details)
Built Distribution
File details
Details for the file CrawlSpider-2.0.8.tar.gz
.
File metadata
- Download URL: CrawlSpider-2.0.8.tar.gz
- Upload date:
- Size: 52.0 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f5f0c4df25a02f9f24db1b310e745dd829c6109ca8eb70c8785a5ee25946d56a |
|
MD5 | 1b49b47235eeb949fd9d722fe44411e8 |
|
BLAKE2b-256 | 99d76bd954a8a5d46d53cc91b576946467288175c137f10d1628080c78681473 |
File details
Details for the file CrawlSpider-2.0.8-py3-none-any.whl
.
File metadata
- Download URL: CrawlSpider-2.0.8-py3-none-any.whl
- Upload date:
- Size: 70.8 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.8.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 758b378a789bd55552b2a036b78610203062a964d445066a9fcd2a8aa97d1462 |
|
MD5 | febf5d8c12757ace377753bd22256b13 |
|
BLAKE2b-256 | 51f591b079727477d6492f514eb77cc9024bdda8716d18fb4f229691abcb408a |