Skip to main content
Avatar for scrapy from gravatar.com

  scrapy

21 projects

parsel

Last released on Oct 25, 2018

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

Scrapy

Last released on Jul 11, 2018

A high-level Web Crawling and Web Scraping framework

queuelib

Last released on Mar 12, 2018

Collection of persistent (disk-based) queues

splash

Last released on Feb 15, 2018

A javascript rendered with a HTTP API

w3lib

Last released on Jan 25, 2018

Library of web-related functions

webstruct

Last released on Dec 29, 2017

A library for creating statistical NER systems that work on HTML data

cssselect

Last released on Dec 27, 2017

cssselect parses CSS3 Selectors and translates them to XPath 1.0

scrapyd-client

Last released on Aug 24, 2017

A client for scrapyd

PyPyDispatcher

Last released on Jul 3, 2017

Multi-producer-multi-consumer signal dispatching mechanism

scrapely

Last released on May 26, 2017

A pure-python HTML screen-scraping library

scrapyd

Last released on Apr 12, 2017

A service for running Scrapy spiders, with an HTTP API

scrapy-deltafetch

Last released on Feb 9, 2017

Scrapy middleware to ignore previously crawled pages

adblockparser

Last released on Oct 17, 2016

Parser for Adblock Plus rules

loginform

Last released on Aug 16, 2016

Fill HTML login forms automatically

scrapy-splitvariants

Last released on Jul 18, 2016

Scrapy spider middleware to split an item into multiple items on a multi-valued key

scrapy-hcf

Last released on Jul 18, 2016

Scrapy spider middleware to use Scrapinghub's Hub Crawl Frontier as a backend for URLs

scrapy-querycleaner

Last released on Jun 30, 2016

Scrapy spider middleware to clean up query parameters in request URLs

scrapy-magicfields

Last released on Jun 30, 2016

Scrapy middleware to add extra "magic" fields to items

scrapy-djangoitem

Last released on May 4, 2016

Scrapy extension to write scraped items using Django models

scrapyjs

Last released on Mar 25, 2016

JavaScript support for Scrapy using Splash

scrapy-jsonrpc

Last released on Apr 13, 2015

Scrapy extenstion to control spiders using JSON-RPC

Supported by

Elastic Elastic Search Pingdom Pingdom Monitoring Google Google BigQuery Sentry Sentry Error logging AWS AWS Cloud computing DataDog DataDog Monitoring Fastly Fastly CDN SignalFx SignalFx Supporter DigiCert DigiCert EV certificate StatusPage StatusPage Status page