Skip to main content

A package that allows users to capture full-page screenshots of websites using Selenium and Chrome webdriver.

Project description

Pywebcapture

A package that allows users to capture full-page screenshots of websites using Selenium and Chrome webdriver.

Tested with Python version 3.8.3

Installation

  1. Download the latest version of Chrome webdriver
  2. Add chrome webdriver path to your system PATH (its also possible to pass the absolute path of your driver to the Driver instance)
  3. Run pip install pywebcapture

Basic Usage

Import the modules:

from pywebcapture import loader, driver

Use the CSVLoader to load your csv file containing the urls and optional file names:

Options:

  • input_filepath - The absolute path to your csv file (str)
  • has_header - Whether your csv has a header row or now (bool)
  • uri_column - The column that contains the uri's, can use either column name (str) or the index position (int)
  • filename_column - The column that contains the desired file names (str), can be set to None, where the driver will use the uri netloc as the filename
csv_file = loader.CSVLoader("example.csv", True, 3, None)

Call the get_uri_dict() method from the CSVLoader instance, this parses the CSV into a Python dictionary:

uri_dict = csv_file.get_uri_dict()

Create instance of the web driver:

Options:

  • driver_path - This is the absolute path to the chrome webdriver, if None or "chromedriver" it will attempt to search %PATH
  • output_path - This is the output path that you want to save screen shots at (str)
  • delay - This is the delay in seconds between each page request, minimum is 2 seconds, please crawl pages respectfully :)
  • uri_dict - The Python dictionary containing your file names and uri's
d = driver.Driver("path/to/chrome/webdriver", None, 3, uri_dict)

Run the driver, this will loop through all uri's, get the maximum scrollheight and then take a screenshot

d.run()

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pywebcapture-0.0.3.tar.gz (5.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

pywebcapture-0.0.3-py3-none-any.whl (10.1 kB view details)

Uploaded Python 3

File details

Details for the file pywebcapture-0.0.3.tar.gz.

File metadata

  • Download URL: pywebcapture-0.0.3.tar.gz
  • Upload date:
  • Size: 5.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0 requests-toolbelt/0.9.1 tqdm/4.48.0 CPython/3.8.3

File hashes

Hashes for pywebcapture-0.0.3.tar.gz
Algorithm Hash digest
SHA256 470cc51e1c6069cb4fb0ea5d8819f2528a6a26c5f17348ab89c7e305f8475c49
MD5 a139ec9e985fa8a15be8942594670087
BLAKE2b-256 b20b88dfdaf7c686c0bee7a1fc7bbb1a21a24637e48ba990b25d934890cdf492

See more details on using hashes here.

File details

Details for the file pywebcapture-0.0.3-py3-none-any.whl.

File metadata

  • Download URL: pywebcapture-0.0.3-py3-none-any.whl
  • Upload date:
  • Size: 10.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.2.0 requests-toolbelt/0.9.1 tqdm/4.48.0 CPython/3.8.3

File hashes

Hashes for pywebcapture-0.0.3-py3-none-any.whl
Algorithm Hash digest
SHA256 240571b52dc496f33420240f824f6a15c6950ecc81172d0a2202cffe502f5833
MD5 6eaa28135e606860ee92763117861836
BLAKE2b-256 89a42935b5dabaa4c6c411b40591587552e286d7d307948ec40747e56fdc9749

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page