Programs used to manage experiments with the document_tracking infrastructure.
Project description
Track news stories from news articles
This project is the front-end of the document_tracking
package, which proposes algorithms to track news documents (long articles, telegraphic dispatches, etc.) into news stories (group of documents reporting similar events). It manipulates data following the document_tracking_resources
format, and all your datasets should complain with it.
Installation
pip install news_tracking
Utilities
Once installed, you will be provided new commands which act as a front-end to the utilities your need.
news_tracking_miranda
: run the Miranda et al. algorithm (see thedocument_tracking
package for more information). on you datasets. It needs to have a model, as the algorithm is supervised.news_tracking_miranda_training
: train and export a model for the Miranda et al. algorithm.news_tracking_kmeans
: run K-Means on a dataset, it can act as a baseline algorithm.news_tracking_evaluation
: evaluate the clustering of algorithms using Standard and BCubed metrics.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distributions
No source distribution files available for this release.See tutorial on generating distribution archives.
Built Distribution
File details
Details for the file news_tracking-1.0.2.202209211710-py3-none-any.whl
.
File metadata
- Download URL: news_tracking-1.0.2.202209211710-py3-none-any.whl
- Upload date:
- Size: 28.6 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.10.6
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | bbcccfeb776627f87ba47b23d43d5ea42470f885a7ca8ef843c9348ca77f270f |
|
MD5 | cc4de2c9e07404d79f62df23f942b6d5 |
|
BLAKE2b-256 | 01d07f27d4d92096fbceeaafe473a1b9ba169ef5710a49ee7af29056fb9a5227 |