Skip to main content

Scrapfly SDK for Scrapfly

Project description

Scrapfly SDK

Installation

pip install scrapfly-sdk

You can also install extra dependencies

  • pip install "scrapfly-sdk[seepdup]" for performance improvement
  • pip install "scrapfly-sdk[concurrency]" for concurrency out of the box (asyncio / thread)
  • pip install "scrapfly-sdk[scrapy]" for scrapy integration
  • pip install "scrapfly-sdk[all]" Everything!

For use of built-in HTML parser (via ScrapeApiResponse.selector property) additional requirement of either parsel or scrapy is required.

Get Your API Key

You can create a free account on Scrapfly to get your API Key.

Migration

Migrate from 0.7.x to 0.8

asyncio-pool dependency has been dropped

scrapfly.concurrent_scrape is now an async generator. If the concurrency is None or not defined, the max concurrency allowed by your current subscription is used.

    async for result in scrapfly.concurrent_scrape(concurrency=10, scrape_configs=[ScrapConfig(...), ...]):
        print(result)

brotli args is deprecated and will be removed in the next minor. There is not benefit in most of case versus gzip regarding and size and use more CPU.

What's new

0.8.x

  • Better error log
  • Async/Improvement for concurrent scrape with asyncio
  • Scrapy media pipeline are now supported out of the box

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scrapfly-sdk-0.8.10.tar.gz (24.2 kB view details)

Uploaded Source

Built Distribution

scrapfly_sdk-0.8.10-py3-none-any.whl (28.8 kB view details)

Uploaded Python 3

File details

Details for the file scrapfly-sdk-0.8.10.tar.gz.

File metadata

  • Download URL: scrapfly-sdk-0.8.10.tar.gz
  • Upload date:
  • Size: 24.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.9.6

File hashes

Hashes for scrapfly-sdk-0.8.10.tar.gz
Algorithm Hash digest
SHA256 76d35fb846bfc40e444985d71f52097c7c3131c77071e16225ec2ff67caea411
MD5 3f6b534ed808ee41187e3148691f17de
BLAKE2b-256 8c1e82b8441cf21f56b6f335e1206663000c1ec433a3e7c94498824129078bc5

See more details on using hashes here.

File details

Details for the file scrapfly_sdk-0.8.10-py3-none-any.whl.

File metadata

File hashes

Hashes for scrapfly_sdk-0.8.10-py3-none-any.whl
Algorithm Hash digest
SHA256 ee662fef4b9b3e2560b9ed1d60c979e8f97b8041778fabec793ef7acd86ed4be
MD5 73297e41b42b2466d8eb7b7b6f5cc59d
BLAKE2b-256 ba97aa8085ee683b91a15b1b077d64cb25383ce9200329ddac516528702f0ff1

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page