A mini spider framework, Integrate aiohttp into scrapy
Project description
Python async library for web scraping PyPI version License: MIT
Build Status codecov codebeat badge Codacy Badge
Installing pip install aioscrapy_redis Usage Plain text scraping
from aioscrapy_redis.core.spider import Spider
from aioscrapy_redis.https.request import Request
import re
from urllib.parse import unquote
""" The start url can be placed in start_urls or written to the redis queue """
class Async_Spider(Spider):
name = 'aioscrapy_spider'
redis_key = 'aioscrapy_spider:url'
start_urls = []
def parse(self, response):
pass
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
aioscrapy_redis-0.1.tar.gz
(32.5 kB
view details)
Built Distribution
File details
Details for the file aioscrapy_redis-0.1.tar.gz
.
File metadata
- Download URL: aioscrapy_redis-0.1.tar.gz
- Upload date:
- Size: 32.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.23.0 setuptools/53.0.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d33548d87109d3782e123929d4dd2077247a5aadc4a910bf462fd64990d1247f |
|
MD5 | 1760b2b420efb9ff0af1cdc09e68d430 |
|
BLAKE2b-256 | dbe4a27e209a7c022b600922ca623a15047eb19b20e575f9ebbcb9d8b0b7491d |
File details
Details for the file aioscrapy_redis-0.1-py3-none-any.whl
.
File metadata
- Download URL: aioscrapy_redis-0.1-py3-none-any.whl
- Upload date:
- Size: 42.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.3.0 pkginfo/1.7.0 requests/2.23.0 setuptools/53.0.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.3
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a6ab1fdcbbe79fced5672b673efa5817429dfafab7ddb082eb366dd7553650e5 |
|
MD5 | 7196f5e4982b0f264bbd2c510c8cae5e |
|
BLAKE2b-256 | 0f69ad17f6b29311cdaeede09380f7faeae58450c07d341bdfc71bb9d24766ee |