29 projects
python3-timbl
Python 3 language binding for the Tilburg Memory-Based Learner
stam
STAM is a library for dealing with standoff annotations on text, this is the python binding.
analiticcl
Analiticcl is an approximate string matching or fuzzy-matching system that can be used to find variants for spelling correction or text normalisation
FoLiA-tools
FoLiA-tools contains various Python-based command line tools for working with FoLiA XML (Format for Linguistic Annotation)
sesdiff
Generates a shortest edit script (Myers' diff algorithm) to indicate how to get from the strings in column A to the strings in column B. Also provides the edit distance (levenshtein). This is the Python binding.
FoLiA
An extensive library for processing FoLiA documents. FoLiA stands for Format for Linguistic Annotation and is a very rich XML-based format used by various Natural Language Processing tools.
python-ucto
This is a Python binding to the tokenizer Ucto. Tokenisation is one of the first step in almost any Natural Language Processing task, yet it is not always as trivial a task as it appears to be. This binding makes the power of the ucto tokeniser available to Python. Ucto itself is a regular-expression based, extensible, and advanced tokeniser written in C++ (https://languagemachines.github.io/ucto).
FoLiA-Linguistic-Annotation-Tool
FLAT is a web-based linguistic annotation environment based around the FoLiA format (https://proycon.github.io/folia), a rich XML-based format for linguistic annotation. Flat allows users to view annotated FoLiA documents and enrich these documents with new annotations, a wide variety of linguistic annotation types is supported through the FoLiA paradigm.
CodeMetaPy
Generate and manage CodeMeta software metadata
CLAM
Turns command-line tools into fully-fledged RESTful webservices with an auto-generated web-interface for human end-users.
Spacy2FoLiA
Library that adds FoLiA (format for linguistic annotation) support to spaCy
foliadocserve
The FoLiA Document Server is a backend HTTP service to interact with documents in the FoLiA format, a rich XML-based format for linguistic annotation (http://proycon.github.io/folia). It provides an interface to efficiently edit FoLiA documents through the FoLiA Query Language (FQL).
python-frog
Python binding to Frog, an NLP suite for Dutch doing part-of-speech tagging, lemmatisation, morphological analysis, named-entity recognition, shallow parsing, and dependency parsing.
codemeta-server
Web API serving codemeta software metadata using codemeta and schema.org, provides a SPARQL endpoint and also offers a human web-interface
Glem
GLEM is a lemmatizer for Ancient Greek.
colibricore
Colibri Core is an NLP tool as well as a C++ and Python library (all included in this package) for working with basic linguistic constructions such as n-grams and skipgrams (i.e patterns with one or more gaps, either of fixed or dynamic size) in a quick and memory-efficient way. At the core is the tool ``colibri-patternmodeller`` which allows you to build, view, manipulate and query pattern models.
codemeta2html
Convert software metadata in codemeta to html for visualisation
piereling
Piereling is a webservice and web-application to convert between a variety of document formats, mostly from and to FoLiA XML. It is intended for NLP pipelines.
CLAMServices
A collection of CLAM Webservices for various of our NLP tools
lamastats
Simple visitor analytics application for presenting usage statistics on several components included in LaMachine.
Gecco
Generic Environment for Context-Aware Correction of Orthography
hanzigrid
Generate a Chinese character grid for study
WikiEnte
Entity extraction using DBPedia through spotlight
PyNLPl
PyNLPl, pronounced as 'pineapple', is a Python library for Natural Language Processing. It contains various modules useful for common, and less common, NLP tasks. PyNLPl contains modules for basic tasks, clients for interfacting with server, and modules for parsing several file formats common in NLP, most notably FoLiA.
NAFFoLiAPy
Converters between two formats for linguistic annotation: FoLiA and NAF
BabelEnte
Entity extractioN, Translation and Evaluation using BabelFy
clin28tools
Scripts for the CLIN28 Shared Task on spelling correction
BabelPy
BabelFy API Client
python-timbl
Python language binding for the Tilburg Memory-Based Learner