A collection of python utilities and scripts for identifying named entities in text and XML
Python scripts and utilities for looking up names and linking them to authoritative identifiers. See the top-level namedropper repository for more information.
Currently uses DBpedia Spotlight for recognition of named entities in text, with support for matching identified DBpedia resources (currently only for Persons) with the equivalent resource in VIAF (Virtual International Authority File).
We recommend the use of pip to install the latest released version of this package and its dependencies:
pip install namedropper
This will also make the lookup-names and count-nametags scripts available.
More detailed documentation, including script usage information, is available at http://namedropper.readthedocs.org/
NameDropper Python module and scripts are distributed under the Apache 2.0 License.
To install dependencies for your local check out of the code, run pip install in the namedropper-py directory (the use of virtualenv is recommended):
pip install -e .
If you want to run unit tests or build sphinx documentation, you will also need to install development dependencies:
pip install namedropper[dev]
To run all unit tests:
nosetests # for normal development nosetests --with-coverage --cover-package=namedropper --cover-xml --with-xunit # for continuous integration
To run unit tests for a specific module, use syntax like this:
To generate sphinx documentation:
cd doc make html