Skip to main content

Programs used to manage experiments with the document_tracking infrastructure.

Project description

Track news stories from news articles

This project is the front-end of the document_tracking package, which proposes algorithms to track news documents (long articles, telegraphic dispatches, etc.) into news stories (group of documents reporting similar events). It manipulates data following the document_tracking_resources format, and all your datasets should complain with it.

Installation

pip install news_tracking

Utilities

Once installed, you will be provided new commands which act as a front-end to the utilities your need.

  • news_tracking_miranda: run the Miranda et al. algorithm (see the document_tracking package for more information). on you datasets. It needs to have a model, as the algorithm is supervised.
  • news_tracking_miranda_training: train and export a model for the Miranda et al. algorithm.
  • news_tracking_kmeans: run K-Means on a dataset, it can act as a baseline algorithm.
  • news_tracking_evaluation: evaluate the clustering of algorithms using Standard and BCubed metrics.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distributions

No source distribution files available for this release.See tutorial on generating distribution archives.

Built Distribution

news_tracking-1.0.2.202209211709-py3-none-any.whl (28.6 kB view details)

Uploaded Python 3

File details

Details for the file news_tracking-1.0.2.202209211709-py3-none-any.whl.

File metadata

File hashes

Hashes for news_tracking-1.0.2.202209211709-py3-none-any.whl
Algorithm Hash digest
SHA256 ac003c9e72cff183c5a12807c4fc535f68a0180c12364c2f585a3cb3147966f2
MD5 3447a20d00b68f669b91bbc4cb02ec36
BLAKE2b-256 fd3a8f0d5d22c60623707ac8e0def36e10f36adf45cbcadb0e2bb367e3dd8212

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page