A common library for netloc rule use case.
Project description
os-netloc-rule
A library for matching netloc.
Netloc is a concept to describe the location of a URL, it can be treated as 3-tuple (host, port, scheme)
Netloc match is a very common and useful operation on processing URL. For example, netloc blacklist is a series rules of netloc with ALLOWED or DISALLOWED:
abc.example.com|80|http ALLOWED
.example.com|| DISALLOWED
You can skip processing http://www.example.com/001.html
becase it match .example.com|| DISALLOWED
.
Install
pip install os-netloc-rule
APIs
-
load netloc and rule
from os_netloc_rule import Matcher netloc_rules = [ ('www.example.com||', 1), ('abc.example.com||', 2), ('abc.example.com|8080|', 3), ] matcher = Matcher() for netloc_string, rule in rules: matcher.load_from_string(netloc_string, rule)
-
match netloc
matcher.match('www.example.com') matcher.match('abc.example.com', '8080', 'http')
-
if there are same netloc with different rule, the latter covers the former by default. But you can custom your own
cmp
function when loading rulesdef cmp(former, latter): return -1 if former > latter else 1 matcher.load(host, port, scheme, rule, cmp=cmp)
-
iter netloc
for netloc, rule in matcher.iter(): pass
Unit Tests
tox
License
MIT licensed.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
File details
Details for the file os-netloc-rule-0.3.tar.gz
.
File metadata
- Download URL: os-netloc-rule-0.3.tar.gz
- Upload date:
- Size: 5.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/1.15.0 pkginfo/1.5.0.1 requests/2.23.0 setuptools/44.1.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/2.7.15
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | a076f12f5d1a2f63e0c55d86c45358e5d3e23d6f62e58a03e35636b9bbfd88f9 |
|
MD5 | 5356e8e78e2f5e98c6d54e2a8ea39dba |
|
BLAKE2b-256 | 2297a160cd64aeaf0882e37c0f0190eacd794ed4ac817234b5541e9a92a6c663 |