Sitesweeper is a python package to help you automate your web scraping process, outputing pages to a file
Project description
sitesweeper
SiteSweeper is a Python command-line interface (CLI) tool for crawling websites and generating output files. It supports crawling a website with a given depth, and saving the output in either a single PDF file or a folder of individual PDF files.
Installation
Install sitesweeper with pip
pip install sitesweeper
Usage
python3.9 -m sitesweeper https://example.com --output-path example
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sitesweeper-1.0.2.tar.gz
(4.7 kB
view details)
Built Distribution
File details
Details for the file sitesweeper-1.0.2.tar.gz
.
File metadata
- Download URL: sitesweeper-1.0.2.tar.gz
- Upload date:
- Size: 4.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9626de4f57cf169672981c88361699478f37a189c46f9219c1f8fc8c87408b57 |
|
MD5 | 12af62ccd1b7d3ba74f9122885dfe9fa |
|
BLAKE2b-256 | 7b8984520fa3066f97f6d45747b2d2b820902ad293e848d98f60cfa6c4820efc |
File details
Details for the file sitesweeper-1.0.2-py3-none-any.whl
.
File metadata
- Download URL: sitesweeper-1.0.2-py3-none-any.whl
- Upload date:
- Size: 5.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | dc55a886fd1794e0f10465d47b5c07004bcecfec9301ac14f336455ee0953fe8 |
|
MD5 | f4caa1a84ef41a8e6b048f39f935c156 |
|
BLAKE2b-256 | e18a8a1006f74a562983901427b2074672feff7eced26c79aa037a803ed39cb2 |