scrapy-inline-requests

A decorator for writing coroutine-like spider callbacks.

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Programming Language

Project description

Scrapy Inline Requests

https://img.shields.io/travis/rolando/scrapy-inline-requests.svg

A decorator for writing coroutine-like spider callbacks.

Free software: MIT license
Documentation: https://scrapy-inline-requests.readthedocs.org.
Python versions: 2.7, 3.4+

Quickstart

The spider below shows a simple use case of scraping a page and following a few links:

from inline_requests import inline_requests
from scrapy import Spider, Request

class MySpider(Spider):
    name = 'myspider'
    start_urls = ['http://httpbin.org/html']

    @inline_requests
    def parse(self, response):
        urls = [response.url]
        for i in range(10):
            next_url = response.urljoin('?page=%d' % i)
            try:
                next_resp = yield Request(next_url)
                urls.append(next_resp.url)
            except Exception:
                self.logger.info("Failed request %s", i, exc_info=True)

        yield {'urls': urls}

See the examples/ directory for a more complex spider.

Known Issues

Middlewares can drop or ignore non-200 status responses causing the callback to not continue its execution. This can be overcome by using the flag handle_httpstatus_all. See the httperror middleware documentation.
High concurrency and large responses can cause higher memory usage.
This decorator assumes your method have the following signature (self, response).
Wrapped requests may not be able to be serialized by persistent backends.
Unless you know what you are doing, the decorated method must be a spider method and return a generator instance.

History

0.3.1 (2016-07-04)

Added deprecation about decorating non-spider functions.
Warn if the callback returns requests with callback or errback set. This reverts the compability with requests with callbacks.

0.3.0 (2016-06-24)

~~Backward incompatible change: Added more restrictions to the request object (no callback/errback).~~
Cleanup callback/errback attributes before sending back the request to the generator. This fixes an edge case when using request.replace().
Simplified example spider.

0.2.0 (2016-06-23)

Python 3 support.

0.1.2 (2016-05-22)

Scrapy API and documentation updates.

0.1.1 (2013-02-03)

Minor tweaks and fixes.

0.1.0 (2012-02-03)

First release on PyPI.

Project details

These details have not been verified by PyPI

Project links

Homepage

Development Status
- 4 - Beta
Intended Audience
- Developers
License
- OSI Approved :: MIT License
Natural Language
- English
Programming Language

Release history Release notifications | RSS feed

This version

0.3.1

Jul 5, 2016

0.3.0

Jun 24, 2016

0.2.0

Jun 23, 2016

0.1.2

Feb 4, 2013

0.1.1

Feb 4, 2013

0.1

Feb 3, 2012

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scrapy-inline-requests-0.3.1.tar.gz (19.1 kB view details)

Uploaded Jul 5, 2016 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

scrapy_inline_requests-0.3.1-py2.py3-none-any.whl (8.2 kB view details)

Uploaded Jul 5, 2016 Python 2Python 3

File details

Details for the file scrapy-inline-requests-0.3.1.tar.gz.

File metadata

Download URL: scrapy-inline-requests-0.3.1.tar.gz
Upload date: Jul 5, 2016
Size: 19.1 kB
Tags: Source
Uploaded using Trusted Publishing? No

File hashes

Hashes for scrapy-inline-requests-0.3.1.tar.gz
Algorithm	Hash digest
SHA256	`06e884dee63d8293180ed622a3a8c00125248144f94213ac81277ebd84224a4d`
MD5	`f68704db4c6b16244f5a1cd3730eb8d5`
BLAKE2b-256	`e55e47c1266b9be69f23249e808c97649052bf7e2aeed30e5874fad9762d6a4b`

See more details on using hashes here.

File details

Details for the file scrapy_inline_requests-0.3.1-py2.py3-none-any.whl.

File metadata

Download URL: scrapy_inline_requests-0.3.1-py2.py3-none-any.whl
Upload date: Jul 5, 2016
Size: 8.2 kB
Tags: Python 2, Python 3
Uploaded using Trusted Publishing? No

File hashes

Hashes for scrapy_inline_requests-0.3.1-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`d5b5443e37aba5c3d0acf739f3b02354f24e705256a713c5069b0dbafd685f2e`
MD5	`e08db935ce4e4cfecec426b2c480d427`
BLAKE2b-256	`49a7f5093677d9cdff3d6fffb1e2324e66c5c90719cde432027c8902004ed4cb`

See more details on using hashes here.

scrapy-inline-requests 0.3.1

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Scrapy Inline Requests

Quickstart

Known Issues

History

0.3.1 (2016-07-04)

0.3.0 (2016-06-24)

0.2.0 (2016-06-23)

0.1.2 (2016-05-22)

0.1.1 (2013-02-03)

0.1.0 (2012-02-03)

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes