A simplest HTML parsing library.

These details have not been verified by PyPI

Project links

Homepage

Project description

README

A simplest html parsing library.

Key features:

no third-party dependencies
no need to know CSS, Xpath or complicated rules to find element
interaction with native python lambda syntax or function-predicate
opportunity to work with damaged html
ability to use element relations (find ancestor, descendant, siblings)
standard find first element or find all by current filter

Installation

Via pip:

pip install py_parse

First example

Lets get src attribute (link) of the Google logo on google.com

import requests
from py_parse import parse

# get content of the google web page
content = requests.get('https://www.google.com/').text
# find first element with img-tag and 'alt' attribute equal to Google (logo)
google_logo = parse(content).find(lambda e: e.tag == 'img' and e.alt == 'Google')
# prints src attribute of the logo element
print(google_logo.src)

You will see following result

/images/branding/googlelogo/1x/googlelogo_white_background_color_272x92dp.png

If there is no element with current filter, you will get exception with filters text (if lamda was used)^ For code above lets say we use wrong filter

google_logo = parse(content).find(lambda e: e.tag == 'img' and e.alt == 'Wrong')

You will see following result

...traceback...
py_parse.exceptions.NoSuchElementError: No elements with current filter (e.tag == 'img' and e.alt == 'Wrong')

TODO - child, ancestor, sibling, descendant, mailformed html, check tags, autoclose

Contact me

Lexman2@yandex.ru

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.2.0

Jan 23, 2021

0.1.9

Dec 12, 2020

0.1.8

Dec 12, 2020

0.1.7

Dec 4, 2020

0.1.6

Dec 3, 2020

0.1.3

Oct 24, 2020

This version

0.1.2

Oct 24, 2020

0.1.1

Oct 24, 2020

0.1.0

Oct 24, 2020

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

py_parse-0.1.2.tar.gz (7.4 kB view hashes)

Uploaded Oct 24, 2020 Source

Built Distribution

py_parse-0.1.2-py3-none-any.whl (8.7 kB view hashes)

Uploaded Oct 24, 2020 Python 3

Hashes for py_parse-0.1.2.tar.gz

Hashes for py_parse-0.1.2.tar.gz
Algorithm	Hash digest
SHA256	`9bc956add75b0f50efe0bb122773e0ab029ac1357189299af954d990fa25f398`
MD5	`9ced8f528a7e8dc8698531a3320a762c`
BLAKE2b-256	`6d73f3d2579dd7a085897c4f082ca2e3dad6684f9f6561a7ec6bb5278bb9338c`

Hashes for py_parse-0.1.2-py3-none-any.whl

Hashes for py_parse-0.1.2-py3-none-any.whl
Algorithm	Hash digest
SHA256	`057a8e181a9623c9700395c8b9eb33e216efe8cf1fda57b475e39ac5602ca581`
MD5	`1841d8ecea1085b03cc66f77a792fdac`
BLAKE2b-256	`2459fa43d956d071e5fc56355a17dafb4fed6eec106df87a2d9e258d0b28ead9`