A package that allows users to capture full-page screenshots of websites using Selenium and Chrome webdriver.
Project description
Pywebcapture
A package that allows users to capture full-page screenshots of websites using Selenium and Chrome webdriver.
Tested with Python version 3.8.3
Installation
- Download the latest version of Chrome webdriver
- Add chrome webdriver path to your system PATH (its also possible to pass the absolute path of your driver to the Driver instance)
- Run
pip install pywebcapture
Basic Usage
Import the modules:
from pywebcapture import loader, driver
Use the CSVLoader to load your csv file containing the urls and optional file names:
Options:
- input_filepath - The absolute path to your csv file (str)
- has_header - Whether your csv has a header row or now (bool)
- uri_column - The column that contains the uri's, can use either column name (str) or the index position (int)
- filename_column - The column that contains the desired file names (str), can be set to None, where the driver will use the uri netloc as the filename
csv_file = loader.CSVLoader("example.csv", True, 3, None)
Call the get_uri_dict() method from the CSVLoader instance, this parses the CSV into a Python dictionary:
uri_dict = csv_file.get_uri_dict()
Create instance of the web driver:
Options:
- driver_path - This is the absolute path to the chrome webdriver, if None or "chromedriver" it will attempt to search %PATH
- output_path - This is the output path that you want to save screen shots at (str)
- delay - This is the delay in seconds between each page request, minimum is 2 seconds, please crawl pages respectfully :)
- uri_dict - The Python dictionary containing your file names and uri's
d = driver.Driver("path/to/chrome/webdriver", None, 3, uri_dict)
Run the driver, this will loop through all uri's, get the maximum scrollheight and then take a screenshot
d.run()
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file pywebcapture-0.0.3.tar.gz.
File metadata
- Download URL: pywebcapture-0.0.3.tar.gz
- Upload date:
- Size: 5.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0 requests-toolbelt/0.9.1 tqdm/4.48.0 CPython/3.8.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
470cc51e1c6069cb4fb0ea5d8819f2528a6a26c5f17348ab89c7e305f8475c49
|
|
| MD5 |
a139ec9e985fa8a15be8942594670087
|
|
| BLAKE2b-256 |
b20b88dfdaf7c686c0bee7a1fc7bbb1a21a24637e48ba990b25d934890cdf492
|
File details
Details for the file pywebcapture-0.0.3-py3-none-any.whl.
File metadata
- Download URL: pywebcapture-0.0.3-py3-none-any.whl
- Upload date:
- Size: 10.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0 requests-toolbelt/0.9.1 tqdm/4.48.0 CPython/3.8.3
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
240571b52dc496f33420240f824f6a15c6950ecc81172d0a2202cffe502f5833
|
|
| MD5 |
6eaa28135e606860ee92763117861836
|
|
| BLAKE2b-256 |
89a42935b5dabaa4c6c411b40591587552e286d7d307948ec40747e56fdc9749
|