Skip to main content

Scrapyd launcher module that schedules scrapy spiders by time

Project description

scrapyduler

version pyversions actions codecov

Scrapyd launcher module that schedules scrapy spiders by time.

Install

pip install scrapyduler

Config

To start using this library you just need to override the launcher option in your scrapyd.conf file:

[scrapyd]
launcher = scrapyduler.launcher.Launcher

and then add the schedulers configuration, e.g.:

[scheduler.1]
cron        = * * * * *
project     = quotesbot
spider      = toscrape-xpath
settings    = HTTPPROXY_ENABLED=True
args        = key1=value1 key2=value2 start_url=http://quotes.toscrape.com/

[scheduler.2]
interval    = weeks=0 days=0 hours=0 minutes=0 seconds=30
project     = quotesbot
spider      = toscrape-css
settings    = HTTPPROXY_ENABLED=True
args        = key1=value1 key2=value2 start_url=http://quotes.toscrape.com/

In the examples above, we set up two schedulers. The first scheduler uses cron syntax to run spiders. The second scheduler triggers on specified intervals, starting on start_date if specified, datetime.now() + interval otherwise. See https://github.com/agronholm/apscheduler for more information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scrapyduler-2.0.0.tar.gz (4.8 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scrapyduler-2.0.0-py3-none-any.whl (3.4 kB view details)

Uploaded Python 3

File details

Details for the file scrapyduler-2.0.0.tar.gz.

File metadata

  • Download URL: scrapyduler-2.0.0.tar.gz
  • Upload date:
  • Size: 4.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for scrapyduler-2.0.0.tar.gz
Algorithm Hash digest
SHA256 908f50d58af6afa351b8f0fbf50dd58ee222edd01a9c76ad1a61aa79a53a1dc1
MD5 c24c65b3f3306b209ef29489e996ca2c
BLAKE2b-256 f9046bc5b72d846fa0b064a4f3a4ce91264de5b2f0d9a02d07e695c77de94a26

See more details on using hashes here.

File details

Details for the file scrapyduler-2.0.0-py3-none-any.whl.

File metadata

  • Download URL: scrapyduler-2.0.0-py3-none-any.whl
  • Upload date:
  • Size: 3.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for scrapyduler-2.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6073ad2a9ad656c224da7a5dea0ff8de0f16a1ba34e95f5e78047823ba2d13a2
MD5 35366730e9b62491f08ad8246ad50b07
BLAKE2b-256 22e6b470472be2075ac28a2352071b7ef1b9743140baf0a675f1c06ed79e2c1a

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page