Skip to main content

scrapy的一个下载中间件,绕过cloudflare检测

Project description

AroayCloudScraper

scrapy一个插件,绕过cloudflare检测,主要是封装 cloudscraper模块,并且将cloudscraper模块在scrapy中异步执行

使用,在setting添加

DOWNLOADER_MIDDLEWARES = {
    'aroay_cloudscraper.downloadermiddlewares.CloudScraperMiddleware': 543,
}

需setting设置

# 默认日志级别
AROAY_CLOUDSCRAPER_LOGGING_LEVEL = logging.DEBUG

默认超时
AROAY_CLOUDSCRAPER_DOWNLOAD_TIMEOUT = 30

# 默认延迟
AROAY_CLOUDSCRAPER_DELAY = 1

#必须设置,否则报错
COMPRESSION_ENABLED = False

RETRY_ENABLED: True
RETRY_TIMES: 3

代理使用

    def start_requests(self):
        for page in range(1, 2):
            yield CloudScraperRequest(self.base_url, callback=self.parse_index, dont_filter=True, proxy={
                "http": "http://username:password@ip:port",
                "https": "http://username:password@ip:port",
            },cookies={"over18":"1"},timeout=5)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

aroay_cloudscraper-1.4.tar.gz (4.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

aroay_cloudscraper-1.4-py3-none-any.whl (6.0 kB view details)

Uploaded Python 3

File details

Details for the file aroay_cloudscraper-1.4.tar.gz.

File metadata

  • Download URL: aroay_cloudscraper-1.4.tar.gz
  • Upload date:
  • Size: 4.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.4

File hashes

Hashes for aroay_cloudscraper-1.4.tar.gz
Algorithm Hash digest
SHA256 b7e79d782b65794389db5199ef5df63ecac2c91531d4d143f01c22b226d0b153
MD5 2b05e81fc3e8cb0fe3f8ee8fddf93c99
BLAKE2b-256 07ebb35ea2126e6fa03519be8b3413894001f0fda0a76368ada262e5ae897cd0

See more details on using hashes here.

File details

Details for the file aroay_cloudscraper-1.4-py3-none-any.whl.

File metadata

  • Download URL: aroay_cloudscraper-1.4-py3-none-any.whl
  • Upload date:
  • Size: 6.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.4

File hashes

Hashes for aroay_cloudscraper-1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 408491cea338cbd934ce915fec69d48dff36c31868284530f6ab1af6ae7bbe1f
MD5 2c0b7771e9a41d227df1666c44578676
BLAKE2b-256 9e292973c723b57867bfe2faf210b1bd1390858b10f1f9328398e418b9682c75

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page