Skip to main content

A mini spider framework, Integrate aiohttp into scrapy

Project description

Python async library for web scraping PyPI version License: MIT

Build Status codecov codebeat badge Codacy Badge

Installing pip install aioscrapy_redis Usage Plain text scraping

from aioscrapy_redis.core.spider import Spider

from aioscrapy_redis.https.request import Request

import re

from urllib.parse import unquote

""" The start url can be placed in start_urls or written to the redis queue """

class Async_Spider(Spider):

name = 'aioscrapy_spider'

redis_key = 'aioscrapy_spider:url'

start_urls = []

def parse(self, response):
    pass

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aioscrapy_redis-0.6.tar.gz (34.1 kB view hashes)

Uploaded Source

Built Distribution

aioscrapy_redis-0.6-py3-none-any.whl (43.0 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page