11 projects
shub-workflow
Workflow manager for Zyte ScrapyCloud tasks.
e-models
Tools for helping build of extraction models with scrapy spiders.
hcf-backend
ScrapyCloud HubStorage frontier backend for Frontera
japanese-address
Japanese address parser
crawlera-session
Class that provides decorators and functions for easy handling of crawlera sessions in a scrapy spider.
scrapy-frontera
Featured Frontera scheduler for Scrapy
collection-scanner
Scrapinghub Hubstorage Collection scanner.
locode
Country and city codes from around the world.
json-pipeline
Json processing pipeline tools
sequential-parser
Utility to extract structured text using text patterns and a state machine.
kafka-scanner
High Level Kafka Scanner, supporting inverse consuming and deduplication. Based on kafka-python library.