Skip to main content

A small example package

Project description

Format HTML to python object

Support xpath

Sample config

ep_conf = {"url": "{{video_url}}",
            "root": "{{root}}",
            "items": "//div[@class=\"container\"]/div[2]/div/div[@class=\"myui-panel-bg\"]/div/div[2]/div[@id=\"playlist{{st_cde}}\"]/ul/li[*]/a",
            #    "items": "//div[@class=\"myui-panel-bg\"]/div/div[@class=\"tab-content\"]/div[@id=\"playlist{{st_cde}}\"]/ul/li[*]/a",
            "data": {"root": {"fix": "{{root}}"},
                    "menu_desc": {"fix": "{{menu_desc}}"},
                    "smenu_desc": {"fix": "{{smenu_desc}}"},
                    "video_desc": {"fix": "{{video_desc}}"},
                    "video_img": {"fix": "{{video_img}}"},
                    "st_cde": {"fix": "{{st_cde}}"},
                    "st_desc": {"fix": "{{st_desc}}"},
                    "ep_cde": {"_attr": "href", "_act": [{"split": {"-": 3}}, {"re": ".html$"}, {"check": {"type": "int"}}]},
                    "ep_url": {"_attr": "href", "_act": [{"pfx": "{{root}}"}]},
                    "ep_desc": {"get": "_text"}
                    }}

Execution

import html2obj
ext = html2obj.extract()
obj = ext.extract(conf=ep_conf)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

html2obj-genwch-0.0.8.tar.gz (8.5 kB view details)

Uploaded Source

Built Distribution

html2obj_genwch-0.0.8-py3-none-any.whl (9.2 kB view details)

Uploaded Python 3

File details

Details for the file html2obj-genwch-0.0.8.tar.gz.

File metadata

  • Download URL: html2obj-genwch-0.0.8.tar.gz
  • Upload date:
  • Size: 8.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.5

File hashes

Hashes for html2obj-genwch-0.0.8.tar.gz
Algorithm Hash digest
SHA256 30f7a04476fef5b2593a93f1b573eb311c2b0125682265bfe6cade11991222eb
MD5 23d108636e965aefe252f2436d944f35
BLAKE2b-256 61281d54cf85c5ea997a6d2f81dd5568601a78623b2ba74255f0d736a533b45e

See more details on using hashes here.

Provenance

File details

Details for the file html2obj_genwch-0.0.8-py3-none-any.whl.

File metadata

  • Download URL: html2obj_genwch-0.0.8-py3-none-any.whl
  • Upload date:
  • Size: 9.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.4.1 importlib_metadata/4.0.1 pkginfo/1.7.0 requests/2.25.1 requests-toolbelt/0.9.1 tqdm/4.60.0 CPython/3.9.5

File hashes

Hashes for html2obj_genwch-0.0.8-py3-none-any.whl
Algorithm Hash digest
SHA256 deff883a99ac6f50e6f827880f2da890679ed5d71b4295d07f828c0c7677a80e
MD5 d8fbc6bfc0515e13b064805cb6d4c052
BLAKE2b-256 365bd44612dc23a24189245c3f4324c060eb2c09f5669f1d2d21e7f9f6fb2168

See more details on using hashes here.

Provenance

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page