Skip to main content

BotProxy (IP Rotating HTTP proxy) downloader middlewarefor Scrapy

Project description

BotProxy downloader middleware for Scrapy.

Overview

BotProxy is an IP Rotating HTTP Proxy. Plug BotProxy into your web scraping application and it will automatically route your requests through one of our outgoing proxy servers. Multiple locations, fresh IP addresses every day. Typical integrations take less than 5 minutes into any script or application.

Please note, in order to use this service you need to have active subscription. You can get one at BotProxy website.

Installation

From PyPI

pip install scrapy-botproxy

From GitHub

pip install -e git+https://github.com/botproxy/scrapy-botproxy@master#egg=scrapy_botproxy

Usage

settings.py:

DOWNLOADER_MIDDLEWARES = {
    'scrapy_botproxy.BotProxyMiddleware': 100,
}

BOTPROXY_USER = 'proxy_username'
BOTPROXY_PASSWORD = 'proxy_password'
BOTPROXY_LOCATION = 'us-ny'  # optional
BOTPROXY_COUNTRY = 'US'  # optional

You can also control botproxy behavior using scrapy request.meta property. Set botproxy_disable to True to temorary bypass proxy for current request. Set location or country to route request through selected location or any location in the specific country.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scrapy-botproxy-1.0.1.tar.gz (2.8 kB view details)

Uploaded Source

File details

Details for the file scrapy-botproxy-1.0.1.tar.gz.

File metadata

File hashes

Hashes for scrapy-botproxy-1.0.1.tar.gz
Algorithm Hash digest
SHA256 472203c4aa3c7d16b4e50a2b556f37546a4276532904266cf26e035870c95fc6
MD5 c189fd45c1626f5e36ed326cfd2b30a4
BLAKE2b-256 cc13ffd6acdd910463462e3e210bb710f6efa3c274d7c019262f983b1b614a04

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page