Like re, but cuter.
Project description
qre - like re, but cuter
| Samples | |
|---|---|
| Pattern | "* pattern" |
| Input | "First pattern" |
| Result | 👍🏼 |
| Pattern | "World's [] pattern" |
| Input | "World's coolest pattern" |
| Result | ["coolest"] |
| Pattern | "product id: [product_id], units: [units: int], price: [unit_price: decimal]" |
| Input | "Product ID: A123, Units: 3, Price: 1.23" (case-insensitive) |
| Result | {"product_id": "A123", "units": 3, "unit_price": Decimal("1.23")} |
This package owes everything, including most of the codebase, to Thomas Feldtmann's simplematch. All poor design decisions are mine. See this collection for other alternatives.
Status: Comprehensively tested but never used for anything real. All evolution is expected to be incremental.
Quick start
pip install qre
My first little match:
import qre
assert qre.search("in [place]", "Match made in heaven") == {"place": "heaven"}
qre is mostly focused on collecting named groups from the input strings, so the return value is
an easy-to-access dict. Groups are denoted with brackets, which means that patterns are friendly
with f-strings.
For unnamed groups the returned object has been tweaked a little bit - they can be found as a list
in the unnamed attribute:
assert qre.match("[] [:int]", "Lesson 1").unnamed == ["Lesson", 1]
Type specifiers can be used with both named and unnamed groups. They act both as specs for the pattern to find and, when applicable, as converters to the right type.
Currently available types are:
intfloatdecimaldate(ISO)datetime(ISO)uuidlettersidentifier(lettersplus numbers and underscore)emailurlipv4ipv6creditcardopen(any one of(,[,{)close(),], or})
You can register your own types and conversions with register_type(name, regex, converter=str).
As qre's goal is not to replicate the functionality of re, this can also act as the "escape hatch"
when you need just a little bit more than what qre offers.
Here's how to use register_type to turn an emoji into a textual description:
qre.register_type("mood", r"[😀😞]", lambda emoji: {"😀": "good", "😞": "bad"}.get(emoji, "unknown"))
assert qre.search("[mood:mood]", "I'm feeling 😀 today!") == {"mood": "good" }
Note that register_type manipulates a global object, so you only need to register custom types
once, probably somewhere towards the start of your program.
PRs for generally useful types are highly welcome.
Matching functions
Matching functions expect a pattern and a string to match against. The optional
case_sensitive argument is true by default.
match- Matchpatternagainst the whole of thestringmatch_start- Matchpatternagainst the beginning of thestringmatch_end- Matchpatternagainst the end of thestringsearch- Return the first match of thepatternin thestringsearch_all- Return all matches of thepatternin thestringas a list
All of the functions always return an object that is either truthy or falsy depending on whether
there was a match or not. They never return None, and the unnamed attribute contains at least an
empty list, so the returned object is always safe to iterate.
Alternatively, you can use the Matcher object. It has the following useful attributes:
regexfor debugging the generated regex, or for copying it for use with plainreconvertersfor debugging the converters in use
matcher = qre.Matcher("value: [quantitative:float]|[qualitative]", case_sensitive=False)
assert matcher.match("Value: 1.0") == {"quantitative": 1.0} # Or any of the other functions above
assert matcher.regex == "value:\\ (?P<quantitative>[+-]?(?:[0-9]*[.])?[0-9]+)|(?P<qualitative>.*)"
assert matcher.converters == {'quantitative': float}
As a final usage scenario, you can call qre on the command line:
$ python qre.py
usage: qre.py [-h] [--regex] pattern string
Pattern syntax summary
- Wildcards:
*- any character 0+ times+- any 1 character?- any 1 character, maybe
- Operators:
|- either of two characters or groups
- Groups:
[name]- named group called "name", returned in the main dict.[]- unnamed group, returned in theunnamedlist[name:4],[:4]- group that is 4 characters wide[name:int],[:int]- group that matches the type and is converted to that Python type
- Escaping:
[*],[+],[?],[|]- literal symbol, not wildcard[[,]]- literal brackets, not groups
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file qre-22.10.2.tar.gz.
File metadata
- Download URL: qre-22.10.2.tar.gz
- Upload date:
- Size: 8.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.14 CPython/3.10.6 Darwin/21.6.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
607acb84e9f2611d2c4b3fb10cc476e113a2f9eac2e9e040ecffa17691dde5cb
|
|
| MD5 |
c61e8161ef5cf677b058ef4661b4e293
|
|
| BLAKE2b-256 |
8bc90d79c6176ddb0cc9fda59800d6e744a9564204ecff6735721e6c8a6a0efe
|
File details
Details for the file qre-22.10.2-py3-none-any.whl.
File metadata
- Download URL: qre-22.10.2-py3-none-any.whl
- Upload date:
- Size: 7.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.1.14 CPython/3.10.6 Darwin/21.6.0
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
8a32c1e77c83dac0bfc5c0a1c018e68fef3470db7ee27587c004bf90d46ad1ef
|
|
| MD5 |
64245154908e2a6275d96fcd382afb54
|
|
| BLAKE2b-256 |
d595647ce3262b191df880f0fb6ac8b79b8cb273b7b321472a1c38dadd8c6ef4
|