ScrapSpyder: Simple websites scraping package
Project description
ScrapSpyder
ScrapSpyder is a Python package that allows you to scrape and download files from a website.
Installation
To install ScrapSpyder, simply run:
pip install scrapspyder
Usage
To use ScrapSpyder, create an instance of the Scraper class and pass in the URL of the website you want to scrape. You can also specify the destination folder where the downloaded files will be saved, the depth of the crawl, whether to perform a deep scan for assets, and more. Here's an example:
from scraper import Scraper
sc = Scraper(source="https://www.example.com/",
dest="example",
depth=1,
deep_scan=True,
verbose=True,
patterns_to_trim=["some-pattern"]
)
sc.download()
This will crawl the website and download all the files to a folder called example.
Requirements
This package requires the following Python libraries:
Contributing
If you find a bug or would like to contribute to the development of ScrapSpyder, please open an issue or pull request on GitHub repository.
License
ScrapSpyder is licensed under the MIT License. See LICENSE for more information.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file scrapspyder-0.1.0.tar.gz.
File metadata
- Download URL: scrapspyder-0.1.0.tar.gz
- Upload date:
- Size: 4.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
65e333019c62b9bbad449048360f3fc8797c9cf697145554991291c15a51fdda
|
|
| MD5 |
1c1ca99cb55517dc46156b81984263c3
|
|
| BLAKE2b-256 |
c437c0ea3cc13180cb06ac40a84a870fe156434c6ca463f9fd73b38edc9b52b0
|
File details
Details for the file scrapspyder-0.1.0-py3-none-any.whl.
File metadata
- Download URL: scrapspyder-0.1.0-py3-none-any.whl
- Upload date:
- Size: 4.7 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.1
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
ae2961c9a42f365c18c612f5b53a1bff2d1946c44e5aeb4ada675f99f6bb021f
|
|
| MD5 |
c930e872f25741d4e93976a9b52cc80e
|
|
| BLAKE2b-256 |
ba5a60a37f2e6d56002af5dbfa0349692f5983df0c88387bd7cbce1b39bf3c7c
|