10 projects
shub-workflow
Workflow manager for scrapinghub ScrapyCloud tasks.
crawlera-session
Class that provides decorators and functions for easy handling of crawlera sessions in a scrapy spider.
hcf-backend
ScrapyCloud HubStorage frontier backend for Frontera
locode
Country and city codes from around the world.
scrapy-frontera
Featured Frontera scheduler for Scrapy
collection-scanner
Scrapinghub Hubstorage Collection scanner.
japanese-address
Japanese address parser
json-pipeline
Json processing pipeline tools
sequential-parser
Utility to extract structured text using text patterns and a state machine.
kafka-scanner
High Level Kafka Scanner, supporting inverse consuming and deduplication. Based on kafka-python library.