Skip to main content

Regular expression module for forensics and big data

Project description

This is a general purpose regular expression matching engine, with particularly good performance for searching large byte buffers, for example large files or raw disk images, using multiple encodings. It was writted to solve perfomance issues in big data extraction tasks including artefact discovery for digital forensics.

jsre is:

  • Fast: When matching complex patterns or a large number of keywords on large input buffers it is substantially faster than current regular expression engines. jsre is designed to scale well in the face of complexity: its relative performance improves with increasing pattern complexity.

  • Unicode Encoding Neutral: A regular expression is written as a string, the user separately specifies what encodings are to be searched when the expression is compiled. All Python codecs are supported and the capability provided is compilant with Unicode regular expression level 1 requirements.

  • Deployable: The compiled matching engine has a small memory footprint limited to below 10MByte, allowing processing to be easily distributed across multiple CPUs.

  • Portable: The software uses a single Python type extension and only standard C and Python libraries. Installs with pip on Windows or Linux.

jsre includes additional functions that are specific to its intended application, they include alternative expression indexing, the processing of overlapped buffers and the specification of stride and offset for search anchors (e.g. for searching at fixed positions in disk sectors).

Version 1.1 is based on a new library for managing character classes and associated set operations. This has significantly improved compilation speeds, allowing the use of this module in more geneal purpos re applications.

As far as possible jsre provides a similar interface to the standard Python re module. See documentation examples for an introduction to the module and its application-specific features. This documentation assumes that the reader is familiar with regular expressions and their use; newcomers may find it easier to first read the Python re documentation and tutorials.

Contact: howard.chivers@york.ac.uk

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

jsre-1.1.1.zip (4.9 MB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

jsre-1.1.1-cp35-cp35m-win_amd64.whl (4.9 MB view details)

Uploaded CPython 3.5mWindows x86-64

File details

Details for the file jsre-1.1.1.zip.

File metadata

  • Download URL: jsre-1.1.1.zip
  • Upload date:
  • Size: 4.9 MB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.11.1 setuptools/20.10.1 requests-toolbelt/0.8.0 tqdm/4.23.4 CPython/3.5.2

File hashes

Hashes for jsre-1.1.1.zip
Algorithm Hash digest
SHA256 c8c4018b7a87eaedd1f53d595a8ec05d1a39d761568524f62c9032da1bf83632
MD5 22a251f2e16f5e00c564c505cfeb2c95
BLAKE2b-256 ab8eb2205c69f71101cda6ba43b53ffea5f69df1735e3f0a4b4e3fd5318d4f51

See more details on using hashes here.

File details

Details for the file jsre-1.1.1-cp35-cp35m-win_amd64.whl.

File metadata

  • Download URL: jsre-1.1.1-cp35-cp35m-win_amd64.whl
  • Upload date:
  • Size: 4.9 MB
  • Tags: CPython 3.5m, Windows x86-64
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/1.11.0 pkginfo/1.4.2 requests/2.11.1 setuptools/20.10.1 requests-toolbelt/0.8.0 tqdm/4.23.4 CPython/3.5.2

File hashes

Hashes for jsre-1.1.1-cp35-cp35m-win_amd64.whl
Algorithm Hash digest
SHA256 548fcae45cd84b40978449aac47a1122ba094d2b7cfd30924b25d817d77cd0e4
MD5 432e4cb77faf4b05ed3ccb6702a54b7c
BLAKE2b-256 b84d54f4640c3c1b1386f2d8d0f7ca28b8e6961d23fbb532d4d51bebd030f714

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page