5 projects
posnoise
POSNoise: An Effective Countermeasure Against Topic Biases in Authorship Analysis
textunitlib
TextUnitLib: A Python library for extracting diverse text units from textual data
constituent-treelib
A lightweight Python library for constructing, processing, and visualizing constituent trees.
pdf-essentials
An easy-to-use Python library for annotating, manipulating and processing PDF files
alphabetic
A Python module for retrieving script types of writing systems including alphabets, abjads, abugidas, syllabaries, logographs, featurals as well as Latin script codes