An ultra-lightweight web screenshot tool written in Python

These details have not been verified by PyPI

Project links

Project description

WebCap is an extremely lightweight web screenshot tool. It doesn't require Selenium, Playwright, Puppeteer, or any other browser automation framework; all it needs is a working Chrome installation. Used by BBOT.

Installation

pipx install webcap

Web Interface (`webcap server`)

https://github.com/user-attachments/assets/a5dea3fb-fa01-41e7-90cd-67c6efa3d6e5

Features

WebCap's most unique feature is its ability to capture not only the fully-rendered DOM, but also every snippet of parsed Javascript (regardless of inline or external), and the full content of every HTTP request + response (including Javascript API calls etc.). For convenience, it can output directly to JSON.

Example Commands

Scanning

# Capture screenshots of all URLs in urls.txt
webcap scan urls.txt -o ./my_screenshots

# Output to JSON, and include the fully-rendered DOM
webcap scan urls.txt --json --dom | jq

# Capture requests and responses
webcap scan urls.txt --json --requests --responses | jq

# Capture javascript
webcap scan urls.txt --json --javascript | jq

# Extract text from screenshots
webcap scan urls.txt --json --ocr | jq

Server

# Start the server
webcap server

# Browse to http://localhost:8000

Screenshots

CLI Interface (`webcap scan`)

webcap_gif

Fully-rendered DOM

Javascript Capture

Requests + Responses

OCR

Full feature list

Blazing fast screenshots
Fullscreen capture (entire scrollable page)
JSON output
Full DOM extraction
Javascript extraction (inline + external)
Javascript extraction (environment dump)
Full network logs (incl. request/response bodies)
Title
Status code
Fuzzy (perception) hashing
Technology detection
OCR text extraction
Web interface

Webcap as a Python library

import base64
from webcap import Browser

async def main():
    # create a browser instance
    browser = Browser()
    # start the browser
    await browser.start()
    # take a screenshot
    webscreenshot = await browser.screenshot("http://example.com")
    # save the screenshot to a file
    with open("screenshot.png", "wb") as f:
        f.write(webscreenshot.blob)
    # stop the browser
    await browser.stop()

if __name__ == "__main__":
    import asyncio
    asyncio.run(main())

CLI Usage (--help)

 Usage: webcap scan [OPTIONS] URLS                                                            
                                                                                              
 Screenshot URLs                                                                              
                                                                                              
╭─ Arguments ────────────────────────────────────────────────────────────────────────────────╮
│ *    urls      TEXT  URL(s) to capture, or file(s) containing URLs [default: None]         │
│                      [required]                                                            │
╰────────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Options ──────────────────────────────────────────────────────────────────────────────────╮
│ --json    -j                  Output JSON                                                  │
│ --chrome  -c      TEXT        Path to Chrome executable [default: None]                    │
│ --output  -o      OUTPUT_DIR  Output directory                                             │
│                               [default: /home/bls/Downloads/code/webcap/screenshots]       │
│ --help                        Show this message and exit.                                  │
╰────────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Screenshots ──────────────────────────────────────────────────────────────────────────────╮
│ --resolution      -r      RESOLUTION  Resolution to capture [default: 1440x900]            │
│ --full-page       -f                  Capture the full page (larger resolution images)     │
│ --no-screenshots                      Only visit the sites; don't capture screenshots      │
│                                       (useful with -j/--json)                              │
╰────────────────────────────────────────────────────────────────────────────────────────────╯
╭─ Performance ──────────────────────────────────────────────────────────────────────────────╮
│ --threads  -t      INTEGER  Number of threads to use [default: 15]                         │
│ --timeout  -T      INTEGER  Timeout before giving up on a web request [default: 10]        │
│ --delay            SECONDS  Delay before capturing [default: 3.0]                          │
╰────────────────────────────────────────────────────────────────────────────────────────────╯
╭─ HTTP ─────────────────────────────────────────────────────────────────────────────────────╮
│ --user-agent  -U      TEXT  User agent to use                                              │
│                             [default: Mozilla/5.0 (Windows NT 10.0; Win64; x64)            │
│                             AppleWebKit/537.36 (KHTML, like Gecko) Chrome/131.0.0.0        │
│                             Safari/537.36]                                                 │
│ --headers     -H      TEXT  Additional headers to send in format: 'Header-Name:            │
│                             Header-Value' (multiple supported)                             │
│ --proxy       -p      TEXT  HTTP proxy to use [default: None]                              │
╰────────────────────────────────────────────────────────────────────────────────────────────╯
╭─ JSON (Only apply when -j/--json is used) ─────────────────────────────────────────────────╮
│ --base64        -b                     Output each screenshot as base64                    │
│ --dom           -d                     Capture the fully-rendered DOM                      │
│ --responses     -rs                    Capture the full body of each HTTP response         │
│                                        (including API calls etc.)                          │
│ --requests      -rq                    Capture the full body of each HTTP request          │
│                                        (including API calls etc.)                          │
│ --javascript    -J                     Capture every snippet of Javascript (inline +       │
│                                        external)                                           │
│ --ignore-types                   TEXT  Ignore these filetypes                              │
│                                        [default: Image, Media, Font, Stylesheet]           │
│ --ocr                --no-ocr          Extract text from screenshots [default: no-ocr]     │
╰────────────────────────────────────────────────────────────────────────────────────────────╯

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

0.1.103

Apr 1, 2026

0.1.102

Mar 31, 2026

0.1.101

Mar 31, 2026

0.1.100

Mar 30, 2026

0.1.98

Mar 30, 2026

0.1.97

May 20, 2025

0.1.94

Apr 3, 2025

0.1.92

Mar 13, 2025

0.1.88

Mar 13, 2025

0.1.85

Mar 5, 2025

This version

0.1.83

Mar 4, 2025

0.1.75

Feb 28, 2025

0.1.74

Feb 27, 2025

0.1.73

Feb 26, 2025

0.1.34

Dec 29, 2024

0.1.31

Dec 29, 2024

0.1.22

Dec 20, 2024

0.1.21

Dec 16, 2024

0.1.19

Dec 16, 2024

0.1.11

Dec 16, 2024

0.1.10

Dec 16, 2024

0.1.9

Dec 12, 2024

0.1.2

Dec 9, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

webcap-0.1.83.tar.gz (102.0 kB view details)

Uploaded Mar 4, 2025 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

webcap-0.1.83-py3-none-any.whl (103.4 kB view details)

Uploaded Mar 4, 2025 Python 3

File details

Details for the file webcap-0.1.83.tar.gz.

File metadata

Download URL: webcap-0.1.83.tar.gz
Upload date: Mar 4, 2025
Size: 102.0 kB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for webcap-0.1.83.tar.gz
Algorithm	Hash digest
SHA256	`d7aac00fc370820d589379ed5c32d4720523d76954891f0830d023b50dedda64`
MD5	`aa4e55f1b58df95ac6220ccd4f24ecc0`
BLAKE2b-256	`b2d995a559b9ca634507d1d3ebcc947b502f8878f8e7f2ec572bcc4d08ff77cc`

See more details on using hashes here.

File details

Details for the file webcap-0.1.83-py3-none-any.whl.

File metadata

Download URL: webcap-0.1.83-py3-none-any.whl
Upload date: Mar 4, 2025
Size: 103.4 kB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/6.1.0 CPython/3.12.8

File hashes

Hashes for webcap-0.1.83-py3-none-any.whl
Algorithm	Hash digest
SHA256	`48e011f062e810b24132a39e6e3d79798de7cd03a87582228bb3579854f38c88`
MD5	`77be5e0ed9fd38d5220556cc21951b07`
BLAKE2b-256	`710dd2fdccfaa6f2c3f2bd25fd84020e15663a3eddf25d3d5ff86b4bc172d0e3`

See more details on using hashes here.

webcap 0.1.83

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Installation

Web Interface (webcap server)

Features

Example Commands

Scanning

Server

Screenshots

CLI Interface (webcap scan)

Fully-rendered DOM

Javascript Capture

Requests + Responses

OCR

Full feature list

Webcap as a Python library

CLI Usage (--help)

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes

Web Interface (`webcap server`)

CLI Interface (`webcap scan`)