Skip to main content

Generic screenshot capture utilities (models, services, CLI helpers).

Project description

infra-screenshot

PyPI Python Versions Build Status License: AGPL v3 License: Commercial

infra-screenshot provides reusable models, services, and CLI helpers for capturing website screenshots. It exposes the core abstractions.

For a task-oriented index, see CAPABILITIES.md or inspect screenshot.capabilities from an installed environment. The installed wheel exposes screenshot.capabilities; the source tree and source distribution also include AGENTS.md, examples/, agent-snippets/, and skills/.

Table of Contents

Features

  • 🎭 Multiple backends: Support for both Playwright and Selenium
  • 📸 Flexible capture: Single screenshots or batch processing
  • 🔧 Configurable viewports: Desktop, mobile, and custom viewport sizes
  • 💾 Storage abstractions: Local filesystem or cloud storage backends
  • 🚀 Async/await support: Modern async architecture for better performance
  • 🛠️ CLI tools: Ready-to-use command-line interface
  • 🔄 Retry logic: Built-in retry with exponential backoff for reliability
  • 🎨 Visual cleanup: Auto-hide overlays, disable animations for cleaner screenshots

Installation

Using uv (Recommended)

# Install with Playwright backend (includes bundled Chromium)
uv pip install "infra-screenshot[playwright]"
uv run playwright install chromium

# OR install with Selenium backend (requires system Chrome)
uv pip install "infra-screenshot[selenium]"

Using pip

# Install with Playwright backend
pip install "infra-screenshot[playwright]"
playwright install chromium

# OR install with Selenium backend
pip install "infra-screenshot[selenium]"

Quick Verification

# Check installation
screenshot local -h

# Test capture
screenshot local --urls https://example.com --output-dir ./test-screenshots

Usage

CLI: Local Screenshot Capture

The CLI provides a local subcommand for capturing screenshots locally.

Basic Examples

# Capture a single URL
screenshot local --urls https://example.com --output-dir ./screenshots

# Capture multiple URLs (repeat the --urls flag for each URL)
screenshot local \
  --urls https://example.com \
  --urls https://github.com \
  --output-dir ./screenshots

# For many URLs, use a JSONL input file (recommended)
screenshot local --input urls.jsonl --output-dir ./screenshots

# Capture with custom settings
screenshot local \
  --urls http://localhost:3000 \
  --output-dir ./screenshots \
  --viewports desktop mobile \
  --depth 0 \
  --scroll false \
  --allow-autoplay true

Input File Format (JSONL)

For batch processing, create a file with one JSON object per line:

{"url": "https://example.com", "job_id": "example"}
{"url": "https://github.com", "job_id": "github"}
{"url": "https://docs.python.org", "job_id": "python-docs"}

Then run:

screenshot local --input urls.jsonl --output-dir ./screenshots

Each non-empty JSONL line must be an object with a non-empty url. Invalid records fail fast with a file-and-line validation error.

Common Options

Option Description Default
--viewports Viewport presets (desktop, mobile, tablet) desktop
--depth Link depth to follow (0 = single page only) 0
--scroll Enable scrolling before capture false
--scroll-step-delay-ms Delay between scroll steps (ms) 250
--max-scroll-steps Max scroll iterations 15
--full-page Capture entire page height (not just viewport) false
--timeout-s Page load timeout in seconds 60
--post-nav-wait-s Wait after navigation (settling time) 2.5
--pre-capture-wait-s Wait before screenshot 1.5
--hide-overlays Auto-hide popups/cookie banners true
--disable-animations Disable CSS animations for cleaner shots true
--allow-autoplay Allow media autoplay true
--mute-media Mute audio/video true
--block-media Block video/audio requests false
--site-concurrency Number of sites to capture in parallel 1
--max-viewport-concurrency Number of viewports captured in parallel per page 1
--max-pages Max pages per site (when following links) 1

See all options:

screenshot local -h

The CLI exits non-zero when any job is partial or failed. Partial jobs keep any artifacts already written and are reported in the JSON summary with outcome: "partial". Viewport preset typos now fail fast instead of falling back silently, and Playwright honors --post-nav-wait-s exactly without adding hidden settle waits. If you are upgrading existing consumers, see docs/contract-hardening-migration.md.

Real-World Examples

Capture homepage only (no scrolling, viewport-only):

screenshot local \
  --urls http://localhost:3000 \
  --output-dir ./tmp \
  --depth 0 \
  --scroll false \
  --full-page false

Full-page screenshot with scrolling:

screenshot local \
  --urls http://localhost:3000 \
  --output-dir ./tmp \
  --depth 0 \
  --scroll true \
  --full-page true

Capture multiple viewports:

screenshot local \
  --urls https://example.com \
  --output-dir ./screenshots \
  --viewports desktop mobile tablet

Python API: Programmatic Usage

For integration into your own tooling, call the async runner directly with a configured ScreenshotOptions payload:

from pathlib import Path
import asyncio

from screenshot import ScreenshotCaptureOutcome, ScreenshotOptions, capture_screenshots_async
from screenshot.models import CaptureOptions

async def capture_example() -> None:
    options = ScreenshotOptions(
        capture=CaptureOptions(
            enabled=True,
            viewports=("desktop",),
            depth=0,
            scroll=False,
        )
    )

    result = await capture_screenshots_async(
        "demo-job",
        "https://example.com",
        store_dir=Path("screenshots"),
        partition_date=None,
        options=options,
    )

    if result.outcome is ScreenshotCaptureOutcome.SUCCESS:
        print(f"Captured {result.captured} screenshot(s)")
    elif result.outcome is ScreenshotCaptureOutcome.PARTIAL:
        print(f"Partially captured {result.captured} screenshot(s)")
        for error in result.errors:
            print(f"Partial failure: {error.message}")
    else:
        for error in result.errors:
            print(f"Capture failed: {error.message}")

asyncio.run(capture_example())

Browser Setup

Playwright: Bundled Chromium vs System Chrome

By default, Playwright uses its own bundled Chromium (installed via playwright install chromium). This provides:

  • Reproducibility: Known browser version across environments
  • No system dependencies: Works in containers/CI without system Chrome
  • Headless-first design: Optimized for automation

When to use system Chrome (--playwright-executable-path):

  • 🎯 Testing against real Chrome (not Chromium)
  • 🎯 Using Chrome extensions or enterprise policies
  • 🎯 Matching end-user browser versions exactly
  • 🎯 Debugging with Chrome DevTools locally

Trade-offs:

Aspect Bundled Chromium System Chrome
Setup playwright install chromium Install Chrome + ensure compatibility
Version control Pinned to Playwright release Depends on system updates
Size ~300MB download Already on system
Reproducibility ✅ High (version-locked) ⚠️ Lower (varies by system)
Extensions ❌ Not supported ✅ Supported
DevTools Limited Full local debugging

Usage example with system Chrome:

screenshot local \
  --urls https://example.com \
  --output-dir ./screenshots \
  --playwright-executable-path /usr/bin/google-chrome-stable

Need a deeper comparison? Check the repository's .dev_docs/playwright_vs_selenium_linux.md for codec/DRM support, driver management, and when to switch to system Chrome.

Finding Chrome path:

# Linux/WSL
which google-chrome-stable    # Usually /usr/bin/google-chrome-stable
which chromium-browser         # Usually /usr/bin/chromium-browser

# macOS
/Applications/Google\ Chrome.app/Contents/MacOS/Google\ Chrome

# Windows (WSL path)
/mnt/c/Program\ Files/Google/Chrome/Application/chrome.exe

If the path is invalid, the tool logs a warning and falls back to bundled Chromium automatically.

Installing System Chrome/Chromium

For Playwright (Optional - only if using system Chrome)

# Google Chrome (stable) - Linux/WSL
wget -O /tmp/chrome.deb https://dl.google.com/linux/direct/google-chrome-stable_current_amd64.deb
sudo apt install -y /tmp/chrome.deb

# OR Chromium (from distro packages)
sudo apt-get update && sudo apt-get install -y chromium-browser fonts-liberation

For Selenium (Required)

Selenium always requires a system browser + matching chromedriver:

# Install Chrome (as above)
# Then install chromedriver
pip install webdriver-manager  # Auto-downloads matching chromedriver

Tools like webdriver-manager automatically download the chromedriver matching your installed Chrome version.

Configuration

Environment Variables

Runtime behavior can be customized via environment variables:

Variable Description Default
SCREENSHOT_SCROLL_STEP_DELAY_MS Delay between scroll steps (ms) 250
SCREENSHOT_MAX_SCROLL_STEPS Maximum scroll iterations 15
PLAYWRIGHT_CAPTURE_MAX_ATTEMPTS Retry attempts for failed captures 3
SCREENSHOT_RETRY_BACKOFF_S Initial retry delay (seconds) 0.5
SCREENSHOT_RETRY_MAX_BACKOFF_S Maximum retry delay (seconds) 5.0
SCREENSHOT_ENABLE_TIMING Enable additional timing/performance metrics during capture (see docs/perf-testing.md) See docs/perf-testing.md

Example:

export SCREENSHOT_SCROLL_STEP_DELAY_MS=200
export PLAYWRIGHT_CAPTURE_MAX_ATTEMPTS=5
screenshot local --urls https://example.com --output-dir ./screenshots

Logging

infra-screenshot uses Python's standard logging module. Enable diagnostics in your application or CLI runs with:

import logging

logging.basicConfig(level=logging.INFO)
logging.getLogger("screenshot.playwright_runner").setLevel(logging.DEBUG)

Logger namespaces:

Logger Purpose
screenshot.playwright_runner Playwright capture + upload lifecycle
screenshot.selenium_runner Selenium fallback pipeline
screenshot.cli CLI orchestration and batch processing

Log records include structured extra={...} fields such as job_id, url, and viewport. URLs are sanitized before logging to prevent leaking SAS tokens or credentials; configure your formatter (JSON/text) to emit those keys for easier filtering.

OpenTelemetry correlation

When using OpenTelemetry, attach trace/span IDs to screenshot logs so traces and logs stay aligned:

import logging
from pathlib import Path

from opentelemetry import trace

from screenshot import capture_screenshots_async

tracer = trace.get_tracer(__name__)
url = "https://example.com/products"
job_id = "otel-demo"

with tracer.start_as_current_span("screenshot-job") as span:
    logger = logging.getLogger("screenshot.playwright_runner")
    logger.info(
        "Starting screenshot job",
        extra={
            "job_id": job_id,
            "trace_id": span.get_span_context().trace_id,
            "span_id": span.get_span_context().span_id,
        },
    )
    options = ...  # Build ScreenshotOptions as shown above
    await capture_screenshots_async(
        job_id,
        url,
        store_dir=Path("/tmp/screens"),
        partition_date=None,
        options=options,
    )

Contributing

We welcome contributions! To get started with development:

  1. Read the contributing guide: CONTRIBUTING.md
  2. Set up your development environment (covered in CONTRIBUTING.md)
  3. Run tests and linters before submitting PRs

For bug reports and feature requests, please open an issue.

License

This project is dual-licensed:

AGPL-3.0 (Open Source)

Free for open-source and non-commercial use under the GNU Affero General Public License v3.0.

Key requirement: If you run this software as a service (SaaS, API, web app), you must make your complete source code available under AGPL-3.0.

Commercial License

For commercial use without AGPL obligations (proprietary products, SaaS without open-sourcing, etc.).

See LICENSE for full details.


Need help? Check out:

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

infra_screenshot-0.2.0.tar.gz (107.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

infra_screenshot-0.2.0-py3-none-any.whl (68.9 kB view details)

Uploaded Python 3

File details

Details for the file infra_screenshot-0.2.0.tar.gz.

File metadata

  • Download URL: infra_screenshot-0.2.0.tar.gz
  • Upload date:
  • Size: 107.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for infra_screenshot-0.2.0.tar.gz
Algorithm Hash digest
SHA256 0e2f9b7d54d98b9150741f5eb53cb4d13693a9ece13e0f07506660716583fed0
MD5 5e2125f101e4fb7a72959567985e750e
BLAKE2b-256 96919de3ba83289f61f151c31b83478a7b87833a854521b9989dcb0e45396a47

See more details on using hashes here.

Provenance

The following attestation bundles were made for infra_screenshot-0.2.0.tar.gz:

Publisher: publish.yml on pj-ms/infra-screenshot

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file infra_screenshot-0.2.0-py3-none-any.whl.

File metadata

File hashes

Hashes for infra_screenshot-0.2.0-py3-none-any.whl
Algorithm Hash digest
SHA256 864fc3ff169b3a88d96027954102c8e14d64daa8a312e62be2afc4fc36905114
MD5 8854a44f91847459c844f6a073cc5453
BLAKE2b-256 bb9853ab13964022aaeb9a7fa02a37177c18083575ecde52c5e1558486b0aa30

See more details on using hashes here.

Provenance

The following attestation bundles were made for infra_screenshot-0.2.0-py3-none-any.whl:

Publisher: publish.yml on pj-ms/infra-screenshot

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page