9 projects
mask-rcnn-documents
wikivents
A simple Python package to represent events from Wikipedia and Wikidata resources.
document-tracking-resources
Package to load the resources required by the document_tracking package.
document-tracking
Algorithms to track documents and build news stories from them. It implements the Miranda et al. (2018) algorithm, as well as other alternatives and baselines to track documents.
news-tracking
Programs used to manage experiments with the document_tracking infrastructure.
compute-tf-idf-vectors
Utility to compute sparse TF-IDF vector representation for dataset in the document_tracking_resources format based on a feature file.
compute-dense-vectors
Utility to compute dense vector representation for dataset in the document_tracking_resources format base on dense transformers models.
document-processing
Pre-process documents for Natural Language Processing using spaCy models
newseyevent
A simple Python package to build event based requests for the NewsEye API