Skip to main content

A high-level Web Crawling and Web Scraping framework

Project description

https://scrapy.org/img/scrapylogo.png

Scrapy

PyPI Version Supported Python Versions Ubuntu Windows Wheel Status Coverage report Conda Version

Overview

Scrapy is a BSD-licensed fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.

Scrapy is maintained by Zyte (formerly Scrapinghub) and many other contributors.

Check the Scrapy homepage at https://scrapy.org for more information, including a list of features.

Requirements

  • Python 3.9+

  • Works on Linux, Windows, macOS, BSD

Install

The quick way:

pip install scrapy

See the install section in the documentation at https://docs.scrapy.org/en/latest/intro/install.html for more details.

Documentation

Documentation is available online at https://docs.scrapy.org/ and in the docs directory.

Releases

You can check https://docs.scrapy.org/en/latest/news.html for the release notes.

Community (blog, twitter, mail list, IRC)

See https://scrapy.org/community/ for details.

Contributing

See https://docs.scrapy.org/en/master/contributing.html for details.

Code of Conduct

Please note that this project is released with a Contributor Code of Conduct.

By participating in this project you agree to abide by its terms. Please report unacceptable behavior to opensource@zyte.com.

Companies using Scrapy

See https://scrapy.org/companies/ for a list.

Commercial Support

See https://scrapy.org/support/ for details.

Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scrapy-2.12.0.tar.gz (1.2 MB view details)

Uploaded Source

Built Distribution

Scrapy-2.12.0-py2.py3-none-any.whl (311.2 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file scrapy-2.12.0.tar.gz.

File metadata

  • Download URL: scrapy-2.12.0.tar.gz
  • Upload date:
  • Size: 1.2 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for scrapy-2.12.0.tar.gz
Algorithm Hash digest
SHA256 d66d6e76009b12447604196875a463b61d10721140032a8084a0a52df7f4788f
MD5 17c13179ca0fd249a9e4a4b91ee3d9ee
BLAKE2b-256 f852b0f4ded03c5966e7e90c607bb9aa7e3c5b228cb1d7051325fde017c46987

See more details on using hashes here.

Provenance

The following attestation bundles were made for scrapy-2.12.0.tar.gz:

Publisher: publish.yml on scrapy/scrapy

Attestations:

File details

Details for the file Scrapy-2.12.0-py2.py3-none-any.whl.

File metadata

  • Download URL: Scrapy-2.12.0-py2.py3-none-any.whl
  • Upload date:
  • Size: 311.2 kB
  • Tags: Python 2, Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for Scrapy-2.12.0-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 c33e2dc7da42e727390bacb32dd9938a54ac210fa71972b5c392754f478669cd
MD5 20507c747386f76b14f15a23a3c8ff09
BLAKE2b-256 e8432cc828e9b7a453d791afbe3ef36c951f4641fc1d886b6d39e9455c5468e0

See more details on using hashes here.

Provenance

The following attestation bundles were made for Scrapy-2.12.0-py2.py3-none-any.whl:

Publisher: publish.yml on scrapy/scrapy

Attestations:

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page