This is a pre-production deployment of Warehouse, however changes made here WILL affect the production instance of PyPI.
pablohoffman

pablohoffman

Projects

adblockparser

Last released on Oct 17, 2016

Parser for Adblock Plus rules

dateparser

Last released on Sep 26, 2016

Date parsing library designed to parse dates from HTML pages

frontera

Last released on Nov 29, 2016

A scalable frontier for web crawlers

hubstorage

Last released on Dec 5, 2016

Client interface for Scrapinghub HubStorage

parsel

Last released on Nov 22, 2016

Parsel is a library to extract data from HTML and XML using XPath and CSS selectors

queuelib

Last released on Sep 9, 2015

Collection of persistent (disk-based) queues

scrapely

Last released on Jan 26, 2015

A pure-python HTML screen-scraping library

Scrapy

Last released on Dec 6, 2016

A high-level Web Crawling and Web Scraping framework

scrapy-crawlera

Last released on Oct 17, 2016

Crawlera middleware for Scrapy

scrapyd

Last released on Nov 3, 2016

A service for running Scrapy spiders, with an HTTP API

scrapy-dotpersistence

Last released on Aug 4, 2016

Scrapy extension to sync `.scrapy` folder to an S3 bucket

scrapyjs

Last released on Mar 25, 2016

JavaScript support for Scrapy using Splash

scrapylib

Last released on Nov 14, 2016

Scrapy helper functions and processors

shub

Last released on Sep 20, 2016

Scrapinghub Command Line Client

slybot

Last released on Nov 11, 2016

Slybot crawler

splash

Last released on Nov 30, 2016

A javascript rendered with a HTTP API

w3lib

Last released on Nov 10, 2016

Library of web-related functions

webstruct

Last released on Nov 28, 2016

A library for creating statistical NER systems that work on HTML data

Supported By

WebFaction WebFaction Technical Writing Elastic Elastic Search Pingdom Pingdom Monitoring Dyn Dyn DNS HPE HPE Development Sentry Sentry Error Logging CloudAMQP CloudAMQP RabbitMQ Heroku Heroku PaaS Kabu Creative Kabu Creative UX & Design Fastly Fastly CDN DigiCert DigiCert EV Certificate Rackspace Rackspace Cloud Servers DreamHost DreamHost Log Hosting