Skip to main content

Screen scraping and web crawling framework

Project description

Pomp is a screen scraping and web crawling framework. Pomp is inspired by and similar to Scrapy, but has a simpler implementation that lacks the hard Twisted dependency.

Features:

  • Pure python

  • Only one dependency for Python 2.x - concurrent.futures (backport of package for Python 2.x)

  • Supports one file applications; Pomps doesn’t force a specific project layout or other restrictions.

  • Pomp is a meta framework like Paste: you may use it to create your own scraping framework.

  • Extensible networking: you may use any sync or async method.

  • No parsing libraries in the core; use you preferred approach.

  • Pomp instances may be distributed and are designed to work with an external queue.

Pomp makes no attempt to accomodate:

  • redirects

  • proxies

  • caching

  • database integration

  • cookies

  • authentication

  • etc.

If you want proxies, redirects, or similar, you may use the excellent requests library as the Pomp downloader.

Pomp examples

Pomp docs

Continuous integration status by drone.io:

Latest CI test codecov

PyPI status:

Latest PyPI version Number of PyPI downloads Have wheel License

Docs status:

Documentation Status

Pomp is written and maintained by Evgeniy Tatarkin and is licensed under the BSD license.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pomp-0.2.1.tar.gz (17.5 kB view details)

Uploaded Source

Built Distribution

pomp-0.2.1-py2.py3-none-any.whl (18.1 kB view details)

Uploaded Python 2 Python 3

File details

Details for the file pomp-0.2.1.tar.gz.

File metadata

  • Download URL: pomp-0.2.1.tar.gz
  • Upload date:
  • Size: 17.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for pomp-0.2.1.tar.gz
Algorithm Hash digest
SHA256 344fa4fd040e1b33d6acb43a6d120da160c937c2be872ab73fbed9bfba6556db
MD5 20280cc5a558495c4052926713bcc021
BLAKE2b-256 3c2905e1ff26705c56012977bd350b43c6eec52828539c77d9b9f5c1fa74b3e8

See more details on using hashes here.

File details

Details for the file pomp-0.2.1-py2.py3-none-any.whl.

File metadata

File hashes

Hashes for pomp-0.2.1-py2.py3-none-any.whl
Algorithm Hash digest
SHA256 c102fa2ac533ba10481999aa5b9ee09bbe7ba60061f88929a27d0191f0b32c3b
MD5 b84c6af60d90539ec1adac555b242205
BLAKE2b-256 8f7d3879625b7b20922497feadf2f40681c98d9008ed4eef83dbf556640e94d4

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page