Skip to main content

Extract information from HTML pages that have some kind of a repetitive pattern

Project description

This package finds repetitive format patterns in an HTML page that contains one or more lists and extracts the sub-html text that creates the patterns. The idea is that in a typical HTML data page containing a list of items, there will be a repetitive pattern for the human eye (the page format). This pattern can be recognized automatically, and the data in the list can be extracted.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

HtmlList-2.2.2.zip (393.0 kB view details)

Uploaded Source

HtmlList-2.2.2.tar.gz (359.5 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

HtmlList-2.2.2-py2.6.egg (456.6 kB view details)

Uploaded Egg

File details

Details for the file HtmlList-2.2.2.zip.

File metadata

  • Download URL: HtmlList-2.2.2.zip
  • Upload date:
  • Size: 393.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for HtmlList-2.2.2.zip
Algorithm Hash digest
SHA256 7bad82b4affab215be0dd894224bf1aee018b795df5904ef70e7703fb9ed309e
MD5 9f6b6623378d8a5bb7226770206972b2
BLAKE2b-256 3b6af3fd7af649e28d7b0eb5b82a178eaf850a81b56d9a1bb4385d4ccf04e09e

See more details on using hashes here.

File details

Details for the file HtmlList-2.2.2.tar.gz.

File metadata

  • Download URL: HtmlList-2.2.2.tar.gz
  • Upload date:
  • Size: 359.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for HtmlList-2.2.2.tar.gz
Algorithm Hash digest
SHA256 79d053fd0b1aee64ec7348701a353b1afe059e8a95acecd4f145a31d44e12205
MD5 5e595f8797062667e2cab31fb8ea2898
BLAKE2b-256 ba2030b77561d9735f1000cbb89585083ad74c3d72ce817055cffa5a47c1684c

See more details on using hashes here.

File details

Details for the file HtmlList-2.2.2-py2.6.egg.

File metadata

  • Download URL: HtmlList-2.2.2-py2.6.egg
  • Upload date:
  • Size: 456.6 kB
  • Tags: Egg
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for HtmlList-2.2.2-py2.6.egg
Algorithm Hash digest
SHA256 949f4c4bc1044e8112426bbf8851bb6153406aeb4137b172fe9429cdc2e0a469
MD5 9e6622f96ff3850544b36291634378ff
BLAKE2b-256 198fcd4be81bb212204a0956f43653e9fdd392b085eb16d7a0fe45c75dec6a97

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page