Python ctypes bindings for reliq
Project description
reliq-python
A python module for reliq library.
Requirements
Installation
pip install reliq
Import
from reliq import reliq
Usage
from reliq import reliq, ReliqError
html = ""
with open('index.html','r') as f:
html = f.read()
rq = reliq(html) #parse html
expr = reliq.expr(r"""
div .user; {
a href; {
.name * l@[0] | "%i"
.link * l@[0] | "%(href)v"
},
.score.u span .score,
.info dl; {
.key dt | "%i",
.value dd | "%i"
},
.achievements.a li class=b>"achievement-" | "%i\n"
}
""") #expressions can be compiled
users = []
links = []
images = []
#filter()
# returns object holding list of results such object
# behaves like an array, but can be converted to array with
# self() - objects with lvl() 0
# children() - objects with lvl() 1
# descendants() - objects with lvl > 0
# full() - same as indexing filter(), all objects
for i in rq.filter(r'table; tr').self()[:-2]:
#"i"
# It has a set of functions for getting its properties:
# tag() tag name
# insides() string containing contents inside tag
# desc_count() count of descendants
# lvl() level in html structure
# attribsl() number of attributes
# attribs() returns dictionary of attributes
if i.child_count() < 3 and i[0].tag() == "div":
continue
#objects can be accessed as an array which is the same
#as array returned by descendants() method
link = i[5].attribs()['href']
if re.match('^https://$',href):
links.append(link)
continue
#search() returns str, in this case expression is already compiled
user = json.loads(i.search(expr))
users.append(user)
#reliq objects have __str__ method
#get_data() returns data from which the html structure has been compiled
#if the second argument of filter() is True the returned
#object will use independent data, allowing garbage collector
#to free the previous unused data
#fsearch()
# executes expression at parsing saving memory, and because
# of that it supports only chain expressions i.e use of
# grouping brackets and separating commas will throw an exception
for i in reliq.fsearch(r'ul; img src | "%(src)v\n"',html).split('\n')[:-1]:
images.append(i)
try: #handle errors
reliq.fsearch('p / /','<p></p>')
except ReliqError:
print("error")
Projects using reliq
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
reliq-0.0.23.tar.gz
(122.4 kB
view details)
Built Distribution
reliq-0.0.23-py3-none-any.whl
(118.1 kB
view details)
File details
Details for the file reliq-0.0.23.tar.gz
.
File metadata
- Download URL: reliq-0.0.23.tar.gz
- Upload date:
- Size: 122.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.1.dev0+g94f810c.d20240510 CPython/3.12.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3d131159a82a3b5d1ca3064a4f3c844cc2c158b1a1e0f8f5d25acb977bd1ee7c |
|
MD5 | 2c9bfa007d5521d9358992f25b9a71a0 |
|
BLAKE2b-256 | a1d9acce5abf8b636628db5b28e792aff91671eb2c858d7a06c95586e5158e8c |
File details
Details for the file reliq-0.0.23-py3-none-any.whl
.
File metadata
- Download URL: reliq-0.0.23-py3-none-any.whl
- Upload date:
- Size: 118.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/5.0.1.dev0+g94f810c.d20240510 CPython/3.12.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | af7d21f486cb15217316772cac4e59102f2bb6f4032f7994e9468799d663b79e |
|
MD5 | b41aa93ea4187d4f10aaf840d9961d2f |
|
BLAKE2b-256 | b757d36e2e57224505efef88a6c871fe11a429a1379afb86a25d8159657b7495 |