Skip to main content

Zyte's Page Object pattern for web scraping

Project description

PyPI Version Supported Python Versions Tox Ubuntu Tox Windows Coverage report Documentation Status

web-poet is a Python 3.10+ implementation of the page object pattern for web scraping. It enables writing portable, reusable web parsing code.

See the documentation.

Developing

Setup your local Python environment via:

  1. pip install -r requirements-dev.txt

  2. pre-commit install

Now everytime you perform a git commit, these tools will run against the staged files:

  • black

  • isort

  • flake8

You can also directly invoke pre-commit run –all-files or tox -e linters to run them without performing a commit.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

web_poet-0.23.3.tar.gz (72.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

web_poet-0.23.3-py3-none-any.whl (47.7 kB view details)

Uploaded Python 3

File details

Details for the file web_poet-0.23.3.tar.gz.

File metadata

  • Download URL: web_poet-0.23.3.tar.gz
  • Upload date:
  • Size: 72.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for web_poet-0.23.3.tar.gz
Algorithm Hash digest
SHA256 335578edb40943659100896c5f3bcdd576139f2eeeed889aef3f674e79659f1a
MD5 a180b19bd24e4a1f553c9a48cf10b792
BLAKE2b-256 64e35652923b1fcdb316114154b9fbd97bc381d650412e2d1a13c7c21bf1b8a8

See more details on using hashes here.

Provenance

The following attestation bundles were made for web_poet-0.23.3.tar.gz:

Publisher: publish.yml on scrapinghub/web-poet

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file web_poet-0.23.3-py3-none-any.whl.

File metadata

  • Download URL: web_poet-0.23.3-py3-none-any.whl
  • Upload date:
  • Size: 47.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for web_poet-0.23.3-py3-none-any.whl
Algorithm Hash digest
SHA256 6ca98f31578ea4c7d3a37c5a8df190a4d992b99e315d95512fbd0c9082b8ad36
MD5 19a465be26cc6bcc9d281326c397db20
BLAKE2b-256 8d14d5ffb1c055b62c957caeae4a08ae3650ee634317721c396eaf7f2d50c543

See more details on using hashes here.

Provenance

The following attestation bundles were made for web_poet-0.23.3-py3-none-any.whl:

Publisher: publish.yml on scrapinghub/web-poet

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page