Skip to main content

ScrapSpyder: Simple websites scraping package

Project description

ScrapSpyder

ScrapSpyder is a Python package that allows you to scrape and download files from a website.

Installation

To install ScrapSpyder, simply run:

pip install scrapspyder

Usage

To use ScrapSpyder, create an instance of the Scraper class and pass in the URL of the website you want to scrape. You can also specify the destination folder where the downloaded files will be saved, the depth of the crawl, whether to perform a deep scan for assets, and more. Here's an example:

from scraper import Scraper

sc = Scraper(source="https://www.example.com/",
             dest="example",
             depth=1,
             deep_scan=True,
             verbose=True,
             patterns_to_trim=["some-pattern"]
             )
sc.download()

This will crawl the website and download all the files to a folder called example.

Requirements

This package requires the following Python libraries:

Contributing

If you find a bug or would like to contribute to the development of ScrapSpyder, please open an issue or pull request on GitHub repository.

License

ScrapSpyder is licensed under the MIT License. See LICENSE for more information.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

scrapspyder-0.1.0.tar.gz (4.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

scrapspyder-0.1.0-py3-none-any.whl (4.7 kB view details)

Uploaded Python 3

File details

Details for the file scrapspyder-0.1.0.tar.gz.

File metadata

  • Download URL: scrapspyder-0.1.0.tar.gz
  • Upload date:
  • Size: 4.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.1

File hashes

Hashes for scrapspyder-0.1.0.tar.gz
Algorithm Hash digest
SHA256 65e333019c62b9bbad449048360f3fc8797c9cf697145554991291c15a51fdda
MD5 1c1ca99cb55517dc46156b81984263c3
BLAKE2b-256 c437c0ea3cc13180cb06ac40a84a870fe156434c6ca463f9fd73b38edc9b52b0

See more details on using hashes here.

File details

Details for the file scrapspyder-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: scrapspyder-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 4.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.1

File hashes

Hashes for scrapspyder-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 ae2961c9a42f365c18c612f5b53a1bff2d1946c44e5aeb4ada675f99f6bb021f
MD5 c930e872f25741d4e93976a9b52cc80e
BLAKE2b-256 ba5a60a37f2e6d56002af5dbfa0349692f5983df0c88387bd7cbce1b39bf3c7c

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page