A configurable, language independent coreferencer and (non) named entity recognizer
Project description
eXternally configurable REference and Non Named Entity Recognizer
https://corpling.uis.georgetown.edu/xrenner/
Usage:
xrenner.py [options] INFILE (> OUTFILE)
Options:
- -m, --model
input model directory name, in models/, default ‘eng’
- -o, --output
output format, default: sgml; alternatives: html, paula, webanno, conll, onto, unittest
- -x, --override
specify a section in the model’s override.ini file with alternative settings, default=None; possible values such as ‘OntoNotes’, ‘GUM’
- -v, --verbose
output run time and summary
- -t, --test
run unit tests and quit
- --version
print xrenner version and quit
Input format:
1 Wikinews Wikinews NP NNP _ 2 nsubj _ _ 2 interviews interview VVZ VBZ _ 0 root _ _ 3 President president NN NN _ 2 dobj _ _ 4 of of IN IN _ 3 prep _ _ 5 the the DT DT _ 7 det _ _ 6 International international NP NNP _ 7 amod _ _ 7 Brotherhood brotherhood NP NNP _ 4 pobj _ _ 8 of of IN IN _ 7 prep _ _ 9 Magicians magician NPS NNPS _ 8 pobj _ _ 1 Wednesday Wednesday NP NNP _ 0 root _ _ 2 , , , , _ 0 punct _ _ 3 October October NP NNP _ 4 nn _ _ 4 9 9 CD CD _ 1 appos _ _ 5 , , , , _ 0 punct _ _ 6 2013 2013 CD CD _ 3 tmod _ _
Installation:
Download and use the main xrenner.py script on an input file, or install from PyPI and import as a module:
> pip install xrenner
Examples:
python xrenner.py example_in.conll10 > example_out.sgml
python xrenner.py -x GUM example_in.conll10 > example_out.sgml
python xrenner.py -o conll example_in.conll10 > example_out.conll
python xrenner.py -m eng -o conll example_in.conll10 > example_out.conll
Module usage:
from xrenner import Xrenner
xrenner = Xrenner()
my_conllx_result = some_parser.parse("John visited Spain. His visit went well.")
print xrenner.analyze(my_conllx_result,"sgml")
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.