Skip to main content

Scrapyd launcher module that schedules scrapy spiders by time

Project description

Scrapyd launcher module that schedules scrapy spiders by time.

Install

$ pip install scrapyduler

Config

To start using this library you just need to override the launcher option in your scrapyd.conf file:

[scrapyd]
launcher = scrapyduler.launcher.Launcher

and then add the schedulers configuration, e.g.:

[scheduler.1]
cron        = * * * * *
project     = quotesbot
spider      = toscrape-xpath
settings    = HTTPPROXY_ENABLED=True
args        = key1=value1 key2=value2 start_url=http://quotes.toscrape.com/

[scheduler.2]
interval    = weeks=0 days=0 hours=0 minutes=0 seconds=30
project     = quotesbot
spider      = toscrape-css
settings    = HTTPPROXY_ENABLED=True
args        = key1=value1 key2=value2 start_url=http://quotes.toscrape.com/

In the examples above, we set up two schedulers. The first scheduler uses cron syntax to run spiders. The second scheduler triggers on specified intervals, starting on start_date if specified, datetime.now() + interval otherwise. See https://github.com/agronholm/apscheduler for more information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scrapyduler-1.0.0.tar.gz (3.0 kB view details)

Uploaded Source

Built Distribution

scrapyduler-1.0.0-py3-none-any.whl (3.1 kB view details)

Uploaded Python 3

File details

Details for the file scrapyduler-1.0.0.tar.gz.

File metadata

  • Download URL: scrapyduler-1.0.0.tar.gz
  • Upload date:
  • Size: 3.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.18

File hashes

Hashes for scrapyduler-1.0.0.tar.gz
Algorithm Hash digest
SHA256 9f4e85531a1f8a3b4d7f691d1006010bade961fc57f303dae88c38033966eeed
MD5 d83275de971f967a1b2ff49e711aab9d
BLAKE2b-256 2b375505cecf5dc2008df3632b0a894e4c379c3ca35e4e825fde6d3a685282ca

See more details on using hashes here.

File details

Details for the file scrapyduler-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: scrapyduler-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 3.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.18

File hashes

Hashes for scrapyduler-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8a1d606698f237db62bb80ac4c8652f00bc373c02ff4185ab88a7f0875892340
MD5 7342b63a924d2c3c052e65ba3eedf8b9
BLAKE2b-256 77012bed642fa31fc06d50015725c5f52592fe4ddf7b49cff0607fd30cea39c5

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page