A high-level Web Crawling and Web Scraping framework
Project description
Overview
Scrapy is a fast high-level web crawling and web scraping framework, used to crawl websites and extract structured data from their pages. It can be used for a wide range of purposes, from data mining to monitoring and automated testing.
For more information including a list of features check the Scrapy homepage at: https://scrapy.org
Requirements
Python 2.7 or Python 3.4+
Works on Linux, Windows, Mac OSX, BSD
Install
The quick way:
pip install scrapy
For more details see the install section in the documentation: https://docs.scrapy.org/en/latest/intro/install.html
Documentation
Documentation is available online at https://docs.scrapy.org/ and in the docs directory.
Releases
You can find release notes at https://docs.scrapy.org/en/latest/news.html
Community (blog, twitter, mail list, IRC)
Contributing
See https://docs.scrapy.org/en/master/contributing.html
Code of Conduct
Please note that this project is released with a Contributor Code of Conduct (see https://github.com/scrapy/scrapy/blob/master/CODE_OF_CONDUCT.md).
By participating in this project you agree to abide by its terms. Please report unacceptable behavior to opensource@scrapinghub.com.
Companies using Scrapy
Commercial Support
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for Scrapy-1.7.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 0f3114c6e67c0512cb3c21f5a520afe6b6efa80ddc84d19e39ccc345198fe783 |
|
MD5 | d0e9d4cb67b22b2c2d315e0c02eb4eae |
|
BLAKE2b-256 | 16c7a39786c21fdf3a4ef1694b61d273d7b2e700d7d7b64afaa26be2e237fd71 |