Build and match patterns for semantic role labelling
Project description
Role Pattern
Build and match linguistic patterns for role labelling. Provides an example-driven approach to generate and refine patterns.
Uses graph-based pattern matching, built on SpaCy.
Installation
With pip:
pip install role-pattern-nlp
Example usage
# First, parse a string to create a SpaCy Doc object
import en_core_web_sm
text = "Forging involves the shaping of metal using localized compressive forces."
nlp = en_core_web_sm.load()
doc = nlp(text)
from role_pattern_nlp import RolePatternBuilder
# Provide an example by mapping role labels to tokens
match_example = {
'arg1': [doc[0]], # [Forging]
'pred': [doc[1]], # [involves]
'arg2': [doc[3]], # [shaping]
}
''' Create a dictionary of all the features we want the RolePatternBuilder to have access to
when building and refining patterns '''
feature_dict = {'DEP': 'dep_', 'TAG': 'tag_'}
# Instantiate the pattern builder
role_pattern_builder = RolePatternBuilder(feature_dict)
# Build a pattern. It will use all the features in the feature_dict by default
role_pattern = role_pattern_builder.build(match_example)
# Match against any doc with the role_pattern
matches = role_pattern.match(doc)
print(matches)
'''
[{'arg1': [Forging], 'arg2': [shaping], 'pred': [involves]}]
'''
See examples/ for demonstration as to how to refine a pattern using negative examples.
API
RolePattern
RolePattern.spacy_dep_pattern
The dependency pattern in the form used to create the SpaCy DependencyMatcher object.
RolePattern.token_labels
The list of labels that corresponds to the tokens matched by the pattern.
Built with
- SpaCy - DependencyMatcher
- SpaCy pattern builder
- networkx - Used by SpaCy pattern builder
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
role-pattern-nlp-0.0.8.tar.gz
(8.9 kB
view hashes)
Built Distribution
Close
Hashes for role_pattern_nlp-0.0.8-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 346df79ed7f136cbeb303cbba7867b95d5c0a7ab980f444c0671cb14f02286b3 |
|
MD5 | fed4e5f2f2016ca45c13e11072ec6f36 |
|
BLAKE2b-256 | 57338ea31fec08db2559d3dda5cf7dac5c76fa3389ad30768909b42ab9d08d57 |