An extension for spaCy, making pattern matching as flexible as using regular expressions.
Project description
spaCyEx
spaCyEx
is a powerful extension for spaCy, designed to make pattern matching as flexible and easy as using regular expressions. It builds upon the existing capabilities of spaCy's Matcher
, enhancing it with a more accessible syntax for defining complex patterns. spaCyEx
allows for intuitive and detailed text pattern specifications, perfect for extracting detailed linguistic features from texts.
Installation
You can install spaCyEx
via pip:
pip install spacyex
Features
- Create complex matching patterns using a simple syntax.
- Supports all dictionary properties that are used by spaCy's
Matcher
. - Easy integration with existing spaCy pipelines.
Usage
Here is a simple example to get started with spaCyEx
:
import spacyex as se
import spacy
nlp = spacy.load("en_core_web_sm")
text = "John Smith runs fast, but Jacob Smith walks slowly."
pattern = "(ent_type=person|op={2}) (lemma=in[run,walk]) (pos=ADV)"
results = se.search(pattern, text, nlp)
for match in results:
print(match[0].text, "Start:", match[1], "End:", match[2])
This code will match sequences in the text based on the defined pattern, using named entities, lemmas, and parts of speech.
Roadmap
- Support for all dictionary properties in patterns.
- Additional utilities and helper functions for more complex pattern scenarios.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file spacyex-0.0.2.tar.gz
.
File metadata
- Download URL: spacyex-0.0.2.tar.gz
- Upload date:
- Size: 2.9 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6331f90f6cdedaa1e0462fbe4d4179db06c3e545dc009deb3dfa860c87f95a35 |
|
MD5 | e7e3fa1b006785eb650b32792b168b0d |
|
BLAKE2b-256 | 96412f569240372beb4567e797265e9fae66f89e12187516985edc2af5cbece4 |
File details
Details for the file spacyex-0.0.2-py3-none-any.whl
.
File metadata
- Download URL: spacyex-0.0.2-py3-none-any.whl
- Upload date:
- Size: 2.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.11.5
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 105f7d5c553efe62d8db64fbb26745d2b76a139d149fbf40d79497d594d344b1 |
|
MD5 | dd8c8de3486bb473be0e08217f84b1ec |
|
BLAKE2b-256 | f42521b1548540af49a3192924957e4d420b4c8d946b1230abdcef9e83103e2b |