A Python library that provides a convenient interface for compiling and matching regular expression patterns using the re2 library.
Project description
re2shield
re2shield
is a Python library that provides a shield for working with regular expressions using the re2
module.
It allows you to compile and search for patterns in text using the powerful regular expression engine provided by re2
.
It allows you to hide the complexity of regular expressions and work with pattern identifiers instead.
This project utilizes the google/re2
library, which is licensed under the BSD 3-Clause License. Please refer to the LICENSE file of google/re2
for more information.
Installation
Before installing re2shield
, make sure that re2
is installed on your system. You can install re2
by following the instructions on the google/re2 GitHub repository.
Alternatively, you can use the re2-installer.sh
script located in the package/installed
directory. This script automates the installation process for re2
and its dependencies. Simply run the script using the following command:
git clone https://github.com/Npc-coder/re2shield.git
cd re2shield/installed
sh re2-installer.sh
You can install re2shield
using pip:
pip install re2shield
or
git clone https://github.com/Npc-coder/re2shield.git
cd re2shield
pip install .
Updates
Version 0.1.8
- Bug Fix
- Fixed a bug where pattern count were not being displayed correctly.
Please refer to the Usage section for examples of how to use these new features.
Usage
Importing the Library
Here is a simple example demonstrating how to use re2shield:
import re2shield
if __name__ == "__main__":
db = re2shield.Database()
print(db)
# Load patterns from file
try:
db = re2shield.load('patterns.db')
print(db) # Prints the number of patterns in the database
except FileNotFoundError:
# If pattern file doesn't exist, compile the patterns
patterns = [
(r'\b[A-Za-z0-9._%+-]+@[A-Za-z0-9.-]+\.[A-Z|a-z]{2,}\b', 1),
(r'\b\d{3}[-.\s]??\d{3}[-.\s]??\d{4}\b', 2),
(r'\d+', 3)
]
expressions, ids = zip(*patterns)
db.compile(expressions=expressions, ids=ids, overwrite=False)
print(db) # Prints the number of patterns in the database and version
db.dump('patterns.db')
# Find patterns in text
def match_handler(id, from_, to, flags, context):
print(f"Match found for pattern {id} from {from_} to {to}: {context}")
db.scan('test@ex12ample12.com', match_handler)
In this example, we create a re2shield.Database object, compile a list of patterns with their corresponding identifiers, and then search for those patterns in the provided text.
The match_handler function is called for each match found, allowing you to process the matches as desired.
Use Case
One of the key advantages of re2shield is its ability to hide the actual regular expression patterns from users during distribution. By compiling the patterns with re2 and using pattern identifiers, you can distribute your code without exposing the underlying regular expression logic. This provides an additional layer of abstraction and enhances the security of your regular expression patterns.
Features
- Hide the complexity of regular expressions
- Work with pattern identifiers instead of exposing the actual regular expression patterns
- Compile patterns for efficient matching
- Find all occurrences of patterns in text
- Customize the handling of matches using callback functions
License
This project is licensed under the BSD 3-Clause License. See the LICENSE file for details.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file re2shield-0.1.8.tar.gz
.
File metadata
- Download URL: re2shield-0.1.8.tar.gz
- Upload date:
- Size: 4.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 608e2ac8697795aa308a69fa4480788312986f4768522a4ffdb36aa1df04c19b |
|
MD5 | e942e3fa94521feab5a14ebda09c9990 |
|
BLAKE2b-256 | fc3da75089a5171e5414cfa25c018737a9262f2124f1b0afc18540df179661f6 |
File details
Details for the file re2shield-0.1.8-py3-none-any.whl
.
File metadata
- Download URL: re2shield-0.1.8-py3-none-any.whl
- Upload date:
- Size: 5.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.16
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8a7a5ffac9ae65944baf87bd6a33d94cc5c1c67a4db0c8c7a13c32518e659aa6 |
|
MD5 | b4c920376e5c01c4e4898c3aaf73ee33 |
|
BLAKE2b-256 | 142c5be344a805da066fb63f21232664c799ea188a2c5aeab624c22a256230a7 |