Skip to main content

Zyte's Page Object pattern for web scraping

Project description

PyPI Version Supported Python Versions Tox Ubuntu Tox Windows Coverage report Documentation Status

web-poet is a Python 3.7+ implementation of the page object pattern for web scraping. It enables writing portable, reusable web parsing code.

See the documentation.

Developing

Setup your local Python environment via:

  1. pip install -r requirements-dev.txt

  2. pre-commit install

Now everytime you perform a git commit, these tools will run against the staged files:

  • black

  • isort

  • flake8

You can also directly invoke pre-commit run –all-files or tox -e linters to run them without performing a commit.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

web-poet-0.13.1.tar.gz (65.8 kB view details)

Uploaded Source

Built Distribution

web_poet-0.13.1-py3-none-any.whl (44.4 kB view details)

Uploaded Python 3

File details

Details for the file web-poet-0.13.1.tar.gz.

File metadata

  • Download URL: web-poet-0.13.1.tar.gz
  • Upload date:
  • Size: 65.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.3

File hashes

Hashes for web-poet-0.13.1.tar.gz
Algorithm Hash digest
SHA256 4c5f31d58cc57324f419481cabcdbbc4207df7007b56700e82e8d6cd2212fa31
MD5 0281a43f52360fb3f08627ed887fce92
BLAKE2b-256 f93082590d9d87eb4368154ba54ef6593e4fd2058ccbbbfbbbe05e96d52ac8e5

See more details on using hashes here.

Provenance

File details

Details for the file web_poet-0.13.1-py3-none-any.whl.

File metadata

  • Download URL: web_poet-0.13.1-py3-none-any.whl
  • Upload date:
  • Size: 44.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.3

File hashes

Hashes for web_poet-0.13.1-py3-none-any.whl
Algorithm Hash digest
SHA256 c25d6896eae81dfeb840f87127184161ebb09386778f31659bb6b8b8fca9a0d4
MD5 785a48a9a12459924016db26ef373c7e
BLAKE2b-256 b775f0835a9ef64a4f1d7d85cbfeaade0a80ab1dc6c3a8ad7dc38b01877ab59e

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page