Skip to main content

Scrapyd launcher module that schedules scrapy spiders by time

Project description

Scrapyd launcher module that schedules scrapy spiders by time.

Install

$ pip install scrapyduler

Config

To start using this library you just need to override the launcher option in your scrapyd.conf file:

[scrapyd]
launcher = scrapyduler.launcher.Launcher

and then add the schedulers configuration, e.g.:

[scheduler.1]
cron        = * * * * *
project     = quotesbot
spider      = toscrape-xpath
settings    = HTTPPROXY_ENABLED=True
args        = key1=value1 key2=value2 start_url=http://quotes.toscrape.com/

[scheduler.2]
interval    = weeks=0 days=0 hours=0 minutes=0 seconds=30
project     = quotesbot
spider      = toscrape-css
settings    = HTTPPROXY_ENABLED=True
args        = key1=value1 key2=value2 start_url=http://quotes.toscrape.com/

In the examples above, we set up two schedulers. The first scheduler uses cron syntax to run spiders. The second scheduler triggers on specified intervals, starting on start_date if specified, datetime.now() + interval otherwise. See https://github.com/agronholm/apscheduler for more information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scrapyduler-1.0.2.tar.gz (3.0 kB view details)

Uploaded Source

Built Distribution

scrapyduler-1.0.2-py3-none-any.whl (3.1 kB view details)

Uploaded Python 3

File details

Details for the file scrapyduler-1.0.2.tar.gz.

File metadata

  • Download URL: scrapyduler-1.0.2.tar.gz
  • Upload date:
  • Size: 3.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.18

File hashes

Hashes for scrapyduler-1.0.2.tar.gz
Algorithm Hash digest
SHA256 f9a434e95566330f446480671d78f0429e44da62097a6df98e59ce9a34bf9d7f
MD5 cd81b6e27d6b2ef5f84fd7e6110c40fd
BLAKE2b-256 d620ce92ce27c29f09588be699a7a5bbafba5ff0eaf838c9ff97ad7adaa968b9

See more details on using hashes here.

File details

Details for the file scrapyduler-1.0.2-py3-none-any.whl.

File metadata

  • Download URL: scrapyduler-1.0.2-py3-none-any.whl
  • Upload date:
  • Size: 3.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.18

File hashes

Hashes for scrapyduler-1.0.2-py3-none-any.whl
Algorithm Hash digest
SHA256 4b00611fb6a1b142f5cfa356b6702857603d2032127023b23f3356d016d9ea35
MD5 f14ea0c83e1a878ba7c9a1b4ad7b7394
BLAKE2b-256 3804e2e7a108c8e191349aa3bb7f0b8d54373ba36cc7f1285cfe0c5790a292b8

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page