Skip to main content
Avatar for scrapy from gravatar.com
Username    scrapy

36 projects

itemloaders

Last released

Base library for scrapy's ItemLoader

queuelib

Last released

Collection of persistent (disk-based) and non-persistent (memory-based) queues

Protego

Last released

Pure-Python robots.txt parser with support for modern conventions

parsel

Last released

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

w3lib

Last released

Library of web-related functions

cssselect

Last released

cssselect parses CSS3 Selectors and translates them to XPath 1.0

scrapy-lint

Last released

A linter for Scrapy projects

web-poet

Last released

Zyte's Page Object pattern for web scraping

scrapy-poet

Last released

Page Object pattern for Scrapy

Scrapy

Last released

A high-level Web Crawling and Web Scraping framework

itemadapter

Last released

Common interface for data container classes

andi

Last released

Library for annotation-based dependency injection

sphinx-scrapy

Last released

Sphinx extension for documentation in the Scrapy ecosystem

scrapyd

Last released

A service for running Scrapy spiders, with an HTTP API

scrapyd-client

Last released

A client for Scrapyd

scrapy-zyte-smartproxy

Last released

Scrapy middleware for Zyte Smart Proxy Manager

scrapy-deltafetch

Last released

Scrapy middleware to ignore previously crawled pages

scrapy-feedexporter-sftp

Last released

Scrapy extension Feed Exporter Storage Backend to export items to an SFTP server

scrapy-splash

Last released

JavaScript support for Scrapy using Splash

form2request

Last released

Build HTTP requests out of HTML forms

xtractmime

Last released

Implementation of the MIME Sniffing standard (https://mimesniff.spec.whatwg.org/)

splash

Last released

A javascript rendered with a HTTP API

scrapely

Last released

A pure-python HTML screen-scraping library

scrapy-po

Last released

Page Object pattern for Scrapy

flake8-scrapy

Last released

webstruct

Last released

A library for creating statistical NER systems that work on HTML data

PyPyDispatcher

Last released

Multi-producer-multi-consumer signal dispatching mechanism

adblockparser

Last released

Parser for Adblock Plus rules

loginform

Last released

Fill HTML login forms automatically

scrapy-splitvariants

Last released

Scrapy spider middleware to split an item into multiple items on a multi-valued key

scrapy-hcf

Last released

Scrapy spider middleware to use Scrapinghub's Hub Crawl Frontier as a backend for URLs

scrapy-querycleaner

Last released

Scrapy spider middleware to clean up query parameters in request URLs

scrapy-magicfields

Last released

Scrapy middleware to add extra "magic" fields to items

scrapy-djangoitem

Last released

Scrapy extension to write scraped items using Django models

scrapyjs

Last released

JavaScript support for Scrapy using Splash

scrapy-jsonrpc

Last released

Scrapy extenstion to control spiders using JSON-RPC

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page