Skip to main content

Zyte's Page Object pattern for web scraping

Project description

PyPI Version Supported Python Versions Tox Ubuntu Tox Windows Coverage report Documentation Status

web-poet is a Python 3.10+ implementation of the page object pattern for web scraping. It enables writing portable, reusable web parsing code.

See the documentation.

Developing

Setup your local Python environment via:

  1. pip install -r requirements-dev.txt

  2. pre-commit install

Now everytime you perform a git commit, these tools will run against the staged files:

  • black

  • isort

  • flake8

You can also directly invoke pre-commit run –all-files or tox -e pre-commit to run them without performing a commit.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

web_poet-0.24.1.tar.gz (120.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

web_poet-0.24.1-py3-none-any.whl (51.8 kB view details)

Uploaded Python 3

File details

Details for the file web_poet-0.24.1.tar.gz.

File metadata

  • Download URL: web_poet-0.24.1.tar.gz
  • Upload date:
  • Size: 120.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for web_poet-0.24.1.tar.gz
Algorithm Hash digest
SHA256 f810baf0a51f491dcb310fee10d5817eec7b2ca3a63b5130ad11ab97295750d2
MD5 01e0d3a1ab66e0c573ac8d4d661d4d4a
BLAKE2b-256 f08b0dc9afa054736878b51f6264fd9abd32ce200a09acccae022d8bf21f5421

See more details on using hashes here.

Provenance

The following attestation bundles were made for web_poet-0.24.1.tar.gz:

Publisher: publish.yml on scrapinghub/web-poet

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

File details

Details for the file web_poet-0.24.1-py3-none-any.whl.

File metadata

  • Download URL: web_poet-0.24.1-py3-none-any.whl
  • Upload date:
  • Size: 51.8 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? Yes
  • Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for web_poet-0.24.1-py3-none-any.whl
Algorithm Hash digest
SHA256 1e347a279db1781e76e4d3e9361f32c3bb885f0f5bb7ad6110f2a57d7edc0070
MD5 fee7a22bc370c593625bdabc369cd750
BLAKE2b-256 18c6968f2fb98e601d97efe2fce4e6f5bc86b7716293e03f28a023b303725f62

See more details on using hashes here.

Provenance

The following attestation bundles were made for web_poet-0.24.1-py3-none-any.whl:

Publisher: publish.yml on scrapinghub/web-poet

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page