Skip to main content

A simple library to capture websites using playwright

Reason this release was yanked:

Issue with playwright 1.49.0

Project description

Playwright Capture

Simple replacement for splash using playwright.

Install

pip install playwrightcapture

Usage

A very basic example:

from playwrightcapture import Capture

async with Capture() as capture:
    await capture.initialize_context()
    entries = await capture.capture_page(url, max_depth_capture_time=90)

Entries is a dictionaries that contains (if all goes well) the HAR, the screenshot, all the cookies of the session, the URL as it is in the browser at the end of the capture, and the full HTML page as rendered.

reCAPTCHA bypass

No blackmagic, it is just a reimplementation of a well known technique as implemented there, and there.

This modules will try to bypass reCAPTCHA protected websites if you install it this way:

pip install playwrightcapture[recaptcha]

This will install requests, pydub and SpeechRecognition. In order to work, pydub requires ffmpeg or libav, look at the install guide for more details. SpeechRecognition uses the Google Speech Recognition API to turn the audio file into text (I hope you appreciate the irony).

Project details


Release history Release notifications | RSS feed

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

playwrightcapture-1.27.2.tar.gz (22.3 kB view details)

Uploaded Source

Built Distribution

playwrightcapture-1.27.2-py3-none-any.whl (23.0 kB view details)

Uploaded Python 3

File details

Details for the file playwrightcapture-1.27.2.tar.gz.

File metadata

  • Download URL: playwrightcapture-1.27.2.tar.gz
  • Upload date:
  • Size: 22.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for playwrightcapture-1.27.2.tar.gz
Algorithm Hash digest
SHA256 b7afd43bc18fabc411c5c8edcf6e688b0f5bf64c6d718a32db8858fb26f4527c
MD5 bdf97d8cee113c82d21aaa1463ab4657
BLAKE2b-256 bb4933e655fe02caf07d690c53f6c653dae4cd4fc42e67cb767a51475aa20a16

See more details on using hashes here.

Provenance

The following attestation bundles were made for playwrightcapture-1.27.2.tar.gz:

Publisher: release.yml on Lookyloo/PlaywrightCapture

Attestations:

File details

Details for the file playwrightcapture-1.27.2-py3-none-any.whl.

File metadata

File hashes

Hashes for playwrightcapture-1.27.2-py3-none-any.whl
Algorithm Hash digest
SHA256 43a641a9dc3f3d6df0debc037dd2ed6a9832737f1ad54091ec2173d11f5c9167
MD5 951d557265d754f7cd61cd5d566af371
BLAKE2b-256 7e7b11f6bc247d5de5f5ca87cb56e7b8b6739c085b3dabbf472e0b85f2b22294

See more details on using hashes here.

Provenance

The following attestation bundles were made for playwrightcapture-1.27.2-py3-none-any.whl:

Publisher: release.yml on Lookyloo/PlaywrightCapture

Attestations:

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page