Skip to main content

Zyte's Page Object pattern for web scraping

Project description

PyPI Version Supported Python Versions Build Status Coverage report Documentation Status

web-poet is a Python 3.7+ implementation of the page object pattern for web scraping. It enables writing portable, reusable web data extraction code.

See the documentation.

Developing

Setup your local Python environment via:

  1. pip install -r requirements-dev.txt

  2. pre-commit install

Now everytime you perform a git commit, these tools will run against the staged files:

  • black

  • isort

  • flake8

You can also directly invoke pre-commit run –all-files or tox -e linters to run them without performing a commit.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

web-poet-0.8.0.tar.gz (59.8 kB view details)

Uploaded Source

Built Distribution

web_poet-0.8.0-py3-none-any.whl (40.6 kB view details)

Uploaded Python 3

File details

Details for the file web-poet-0.8.0.tar.gz.

File metadata

  • Download URL: web-poet-0.8.0.tar.gz
  • Upload date:
  • Size: 59.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for web-poet-0.8.0.tar.gz
Algorithm Hash digest
SHA256 0cf2d01f8c82cdeab84887e3ebf5b391b561517727a059c458b1b0670d1f5c58
MD5 daf8e25343a6a2ef0177e983e6751c91
BLAKE2b-256 875a707612fa063f13dd699686cb362cc7e06a5f12a3b88a8303c4219925d8e7

See more details on using hashes here.

Provenance

File details

Details for the file web_poet-0.8.0-py3-none-any.whl.

File metadata

  • Download URL: web_poet-0.8.0-py3-none-any.whl
  • Upload date:
  • Size: 40.6 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.11.2

File hashes

Hashes for web_poet-0.8.0-py3-none-any.whl
Algorithm Hash digest
SHA256 d0dea7562db62be4b52a30c81821a8a86f65237d49fe2429e2c8d76726f97e05
MD5 37804b20d43badcc3078a52026fe623c
BLAKE2b-256 74ae784253898362fd07fc97c77586b8a8346e566d14889114b0b6b9a2931b69

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page