Scrapyd launcher module that schedules scrapy spiders by time
Project description
Scrapyd launcher module that schedules scrapy spiders by time.
Install
$ pip install scrapyduler
Config
To start using this library you just need to override the launcher option in your scrapyd.conf file:
[scrapyd]
launcher = scrapyduler.launcher.Launcher
and then add the schedulers configuration, e.g.:
[scheduler.1]
cron = * * * * *
project = quotesbot
spider = toscrape-xpath
settings = HTTPPROXY_ENABLED=True
args = key1=value1 key2=value2 start_url=http://quotes.toscrape.com/
[scheduler.2]
interval = weeks=0 days=0 hours=0 minutes=0 seconds=30
project = quotesbot
spider = toscrape-css
settings = HTTPPROXY_ENABLED=True
args = key1=value1 key2=value2 start_url=http://quotes.toscrape.com/
In the examples above, we set up two schedulers. The first scheduler uses cron syntax to run spiders. The second scheduler triggers on specified intervals, starting on start_date if specified, datetime.now() + interval otherwise. See https://github.com/agronholm/apscheduler for more information.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
scrapyduler-1.0.2.tar.gz
(3.0 kB
view hashes)
Built Distribution
Close
Hashes for scrapyduler-1.0.2-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4b00611fb6a1b142f5cfa356b6702857603d2032127023b23f3356d016d9ea35 |
|
MD5 | f14ea0c83e1a878ba7c9a1b4ad7b7394 |
|
BLAKE2b-256 | 3804e2e7a108c8e191349aa3bb7f0b8d54373ba36cc7f1285cfe0c5790a292b8 |