An extensive library for processing FoLiA documents. FoLiA stands for Format for Linguistic Annotation and is a very rich XML-based format used by various Natural Language Processing tools.
Project description
This Python module provides an extensive library for parsing, creating and otherwise processing documents in the Format for Linguistic Annotation, aka FoLiA. It has been in active development since 2010 and used by numerous Natural Language Processing (NLP) tools.
This library used to be part of PyNLPL (pynlpl.formats.folia), but has been migrated to this standalone library in spring 2019.
Acknowledgement
FoLiA development is funded in the scope of the larger CLARIN-NL project and its successor CLARIAH.
The following modules are available:
folia.main - The main library
folia.setdefinition - A module for FoLiA Set Definitions
folia.fql - Support for the FoLiA Query Language (FQL)
Installation
Download and install the latest stable version directly from the Python Package Index with pip install folia (or pip3 for Python 3.7+ on most systems). For global installations prepend sudo.
Alternatively, clone this repository and run python setup.py install (or python3 setup.py install for Python 3 on most system. Prepend sudo for global installations.
The Python FoLiA library is also included in our LaMachine distribution.
Documentation
API Documentation and tutorials can be found here .
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.