Skip to main content

Distribution Support for Scrapy & Gerapy using RabbitMQ

Project description

Gerapy RabbitMQ

This is a package for supporting distribution in Scrapy using RabbitMQ, also this package is a module in Gerapy.

Installation

You can install with this command:

pip3 install gerapy-rabbitmq

Usage

Required configuration:

# Use RabbitMQ for queue
SCHEDULER = "gerapy_rabbitmq.scheduler.Scheduler"
SCHEDULER_QUEUE_KEY = '%(spider)s_requests'

# RabbitMQ Connection Parameters, see https://pika.readthedocs.io/en/stable/modules/parameters.html
RABBITMQ_CONNECTION_PARAMETERS = {
    'host': 'localhost'
}

# Use Redis for dupefilter
DUPEFILTER_CLASS = "gerapy_redis.dupefilter.RFPDupeFilter"
SCHEDULER_DUPEFILTER_KEY = '%(spider)s:dupefilter'

Optional configuration:

# RabbitMQ Queue Configuration
SCHEDULER_QUEUE_DURABLE = True
SCHEDULER_QUEUE_MAX_PRIORITY = 100
SCHEDULER_QUEUE_PRIORITY_OFFSET = 30
SCHEDULER_QUEUE_FORCE_FLUSH = True
SCHEDULER_PERSIST = False
SCHEDULER_IDLE_BEFORE_CLOSE = 0
SCHEDULER_FLUSH_ON_START = False
SCHEDULER_PRE_ENQUEUE_ALL_START_REQUESTS = True

More

For more detail, you can refer to example.

RabbitMQ Preview

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

gerapy-rabbitmq-0.1.1.tar.gz (6.3 kB view details)

Uploaded Source

Built Distribution

gerapy_rabbitmq-0.1.1-py2.py3-none-any.whl (6.7 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file gerapy-rabbitmq-0.1.1.tar.gz.

File metadata

  • Download URL: gerapy-rabbitmq-0.1.1.tar.gz
  • Upload date:
  • Size: 6.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.13

File hashes

Hashes for gerapy-rabbitmq-0.1.1.tar.gz
Algorithm Hash digest
SHA256 1e944ee404c1ffe18c2f324a7438e2678311c1b496ac1f0da739711c1adea3f9
MD5 16b05da3b759b741407f839d6507bf28
BLAKE2b-256 7f05f2796ea580d466aab79f9836ad59a61d61e1ea152d9650b920f7ba12f789

See more details on using hashes here.

File details

Details for the file gerapy_rabbitmq-0.1.1-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for gerapy_rabbitmq-0.1.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 b9a8b1131e5e48f2167c7282162e2f70c1e4576474eab7af81165075bb5a3663
MD5 84db7fd053acc9e55db094c3b529f81f
BLAKE2b-256 61ee7bc368c43e1eceff6a43fa11c89357c2dca12bf3d55ce216e7834daa6d04

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page