GATE NLP implementation in Python.
Project description
Python library gatenlp
This is a package for representing the basic elements of text processing and NLP in a way that is very similar to the Java GATE NLP framework, for manipulating GateNLP documents and for interacting with GATE Java and the GATE python plugin.
NOTE: The previous Pypi project "gatenlp" has moved to gatenlphiltlab
Overview
This package is a Python implementation of text processing and NLP similar to
Java GATE NLP.
Currently it is possible to load GATE documents from their BdocJS (JSON) representation
or create GATE documents from scratch. This creates an object of type
gatenlp.Document
which offers an API for adding, retrieving and changing
stand-off annotations and document features
in much the same way as this is done in Java GATE.
This document representation also optionally allows to track all changes
done to the document in a "change log" (a gatenlp.ChangeLog
instance).
Such changes can later be applied to Java GATE documents.
This library also implements the functionality for the interaction with a Java GATE process in two different ways:
- The Java GATE Python plugin can invoke a python process to annotate GATE documents with python code
- the python code can remote-control a Jave GATE instance
More information and documentation
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.