Skip to main content

Zyte's Page Object pattern for web scraping

Project description

PyPI Version Supported Python Versions Tox Ubuntu Tox Windows Coverage report Documentation Status

web-poet is a Python 3.8+ implementation of the page object pattern for web scraping. It enables writing portable, reusable web parsing code.

See the documentation.

Developing

Setup your local Python environment via:

  1. pip install -r requirements-dev.txt

  2. pre-commit install

Now everytime you perform a git commit, these tools will run against the staged files:

  • black

  • isort

  • flake8

You can also directly invoke pre-commit run –all-files or tox -e linters to run them without performing a commit.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

web_poet-0.17.1.tar.gz (68.7 kB view details)

Uploaded Source

Built Distribution

web_poet-0.17.1-py3-none-any.whl (46.9 kB view details)

Uploaded Python 3

File details

Details for the file web_poet-0.17.1.tar.gz.

File metadata

  • Download URL: web_poet-0.17.1.tar.gz
  • Upload date:
  • Size: 68.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for web_poet-0.17.1.tar.gz
Algorithm Hash digest
SHA256 2db19040b1cf36bbb453bea83a07a9dabae5a3009cb9742c6b79fd2da3c5b1b9
MD5 0ccfd6ea06e03e930218d922e72a2fd6
BLAKE2b-256 9dc7ead714aa1ee8575297e1b7f600344c8e4afe6af52de2d4caa625b6fa89c6

See more details on using hashes here.

File details

Details for the file web_poet-0.17.1-py3-none-any.whl.

File metadata

  • Download URL: web_poet-0.17.1-py3-none-any.whl
  • Upload date:
  • Size: 46.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.12.7

File hashes

Hashes for web_poet-0.17.1-py3-none-any.whl
Algorithm Hash digest
SHA256 7859c59d70689441be556b9aea68b6af7c56d160e2d35012c04016f72ed43f3c
MD5 a015eb633c3e70468bc81e660e87c83c
BLAKE2b-256 c2f0f295b22d44011777c1c21ab25b75074061b07d4d8f97b5a9b6e40fa2587a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page