Skip to main content

Zyte's Page Object pattern for web scraping

Project description

PyPI Version Supported Python Versions Tox Ubuntu Tox Windows Coverage report Documentation Status

web-poet is a Python 3.10+ implementation of the page object pattern for web scraping. It enables writing portable, reusable web parsing code.

See the documentation.

Developing

Setup your local Python environment via:

  1. pip install -r requirements-dev.txt

  2. pre-commit install

Now everytime you perform a git commit, these tools will run against the staged files:

  • black

  • isort

  • flake8

You can also directly invoke pre-commit run –all-files or tox -e linters to run them without performing a commit.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

web_poet-0.23.1.tar.gz (72.2 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

web_poet-0.23.1-py3-none-any.whl (47.7 kB view details)

Uploaded Python 3

File details

Details for the file web_poet-0.23.1.tar.gz.

File metadata

  • Download URL: web_poet-0.23.1.tar.gz
  • Upload date:
  • Size: 72.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for web_poet-0.23.1.tar.gz
Algorithm Hash digest
SHA256 be59cfdffc8c9e8af59df18214b58c23798b9ed46528c5913d7e6adbed7310ff
MD5 eee0769ea9251a471d27c700c55a44fb
BLAKE2b-256 d8b82b3ec92a6b7c9199d97ac0b8e0b187bbaac7505481ff8e3d4bba1af84aa5

See more details on using hashes here.

Provenance

The following attestation bundles were made for web_poet-0.23.1.tar.gz:

Publisher: publish.yml on scrapinghub/web-poet

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file web_poet-0.23.1-py3-none-any.whl.

File metadata

  • Download URL: web_poet-0.23.1-py3-none-any.whl
  • Upload date:
  • Size: 47.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for web_poet-0.23.1-py3-none-any.whl
Algorithm Hash digest
SHA256 a21971f67dc756e552daa8246a7d218603c0d2bbbad0a7e19075530769ba0ffe
MD5 e4d0ec24016103b2a28f2bb761ba4038
BLAKE2b-256 2af5c41fb9676997e809fc7401220332228fa065063fd407c7749c8e57a52022

See more details on using hashes here.

Provenance

The following attestation bundles were made for web_poet-0.23.1-py3-none-any.whl:

Publisher: publish.yml on scrapinghub/web-poet

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page