Skip to main content

Zyte's Page Object pattern for web scraping

Project description

PyPI Version Supported Python Versions Tox Ubuntu Tox Windows Coverage report Documentation Status

web-poet is a Python 3.10+ implementation of the page object pattern for web scraping. It enables writing portable, reusable web parsing code.

See the documentation.

Developing

Setup your local Python environment via:

  1. pip install -r requirements-dev.txt

  2. pre-commit install

Now everytime you perform a git commit, these tools will run against the staged files:

  • black

  • isort

  • flake8

You can also directly invoke pre-commit run –all-files or tox -e pre-commit to run them without performing a commit.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

web_poet-0.24.0.tar.gz (120.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

web_poet-0.24.0-py3-none-any.whl (51.8 kB view details)

Uploaded Python 3

File details

Details for the file web_poet-0.24.0.tar.gz.

File metadata

  • Download URL: web_poet-0.24.0.tar.gz
  • Upload date:
  • Size: 120.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for web_poet-0.24.0.tar.gz
Algorithm Hash digest
SHA256 4163f0b40af7879819c0a783299fd1ef4d0cdce05b3b1b1a691615786c937868
MD5 aaa82d8d08c064e70db0bd5daf6da6c5
BLAKE2b-256 24935a37e915cc26106e5ea92780a96b1e677be1f5964b5d604f160c69e1a48e

See more details on using hashes here.

Provenance

The following attestation bundles were made for web_poet-0.24.0.tar.gz:

Publisher: publish.yml on scrapinghub/web-poet

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file web_poet-0.24.0-py3-none-any.whl.

File metadata

  • Download URL: web_poet-0.24.0-py3-none-any.whl
  • Upload date:
  • Size: 51.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for web_poet-0.24.0-py3-none-any.whl
Algorithm Hash digest
SHA256 00c262d96686a9040d6103187423299ef6df50836e142eea725093f4d597e4a1
MD5 816c795faeeae8e2581b5883e3fc422f
BLAKE2b-256 5ad535a81c35bddc5f70190abf127c5b08b2d430a699d27fe749cbfc50406ba7

See more details on using hashes here.

Provenance

The following attestation bundles were made for web_poet-0.24.0-py3-none-any.whl:

Publisher: publish.yml on scrapinghub/web-poet

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page