A small example package
Project description
Format HTML to python object
Support xpath
Sample config
ep_conf = {"url": "{{video_url}}",
"root": "{{root}}",
"items": "//div[@class=\"container\"]/div[2]/div/div[@class=\"myui-panel-bg\"]/div/div[2]/div[@id=\"playlist{{st_cde}}\"]/ul/li[*]/a",
# "items": "//div[@class=\"myui-panel-bg\"]/div/div[@class=\"tab-content\"]/div[@id=\"playlist{{st_cde}}\"]/ul/li[*]/a",
"data": {"root": {"fix": "{{root}}"},
"menu_desc": {"fix": "{{menu_desc}}"},
"smenu_desc": {"fix": "{{smenu_desc}}"},
"video_desc": {"fix": "{{video_desc}}"},
"video_img": {"fix": "{{video_img}}"},
"st_cde": {"fix": "{{st_cde}}"},
"st_desc": {"fix": "{{st_desc}}"},
"ep_cde": {"_attr": "href", "_act": [{"split": {"-": 3}}, {"re": ".html$"}, {"check": {"type": "int"}}]},
"ep_url": {"_attr": "href", "_act": [{"pfx": "{{root}}"}]},
"ep_desc": {"get": "_text"}
}}
Execution
import html2obj
ext = html2obj.extract()
obj = ext.extract(conf=ep_conf)
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
html2obj-genwch-0.0.8.tar.gz
(8.5 kB
view hashes)
Built Distribution
Close
Hashes for html2obj_genwch-0.0.8-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | deff883a99ac6f50e6f827880f2da890679ed5d71b4295d07f828c0c7677a80e |
|
MD5 | d8fbc6bfc0515e13b064805cb6d4c052 |
|
BLAKE2b-256 | 365bd44612dc23a24189245c3f4324c060eb2c09f5669f1d2d21e7f9f6fb2168 |