Module provides implementation of multilingual crisis social media summarization model.
Project description
CLiQS Python module
CLiQS Python module provides implementation of multilingual crisis social media summarization model.
Please, if you use CLiQS for your research consider citing:
Fedor Vitiugin, Carlos Castillo: Cross-Lingual Query-Based Summarization of Crisis-Related Social Media: An Abstractive Approach Using Transformers. In ACM Hypertext 2022. ACM Press. https://doi.org/10.1145/3511095.3531279
Installation
- Install the module via pip:
pip install cliqs
- Download LASER and CLiQS models:
python -m laserembeddings download-models
python -m cliqs download-models
- Before running the script, please check installation of SpaCy models for language that you plan to use.
python -m spacy download fr_core_news_sm # for French
Test use
Download test data 'example.csv' file and put in the current directory.
Example of use:
import pandas as pd
from cliqs import CliqSum
sum = CliqSum()
tweets = pd.read_csv('example.csv')
summary = sum.summarize(tweets, 'Damage', 'fr')
print(summary)
cyclone seroja a touché terre en Australie, entre Kalbarri et northampton, l'oeil est encore bien dessiné mais devrait rapidement se déstructurer. cyclone seroja devrait prendre le dessus et atteindre le stade de cyclone 65kt ce WE avant de toucher terre sur côte ouest Australie dimanche soir.
- example.csv —- data file with three columns: id, text, en_text (translation of texts to English).
- Damage -- information category. Current version supports 6 categories: Casualties, Damage, Danger, Sensor, Service aand Weather.
- fr -- language of texts in file.
Resources
Code for training custom models — CLiQS-CM GitHub repository
Dataset for text classification — tweets dataset
Dataset for summary evaluation — summaries dataset
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file cliqs-1.0.0.tar.gz
.
File metadata
- Download URL: cliqs-1.0.0.tar.gz
- Upload date:
- Size: 12.2 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d4e15f424e5b7850f2f82623ee7bbb5856b4b6ac0d2877c8a3cea43e39951754 |
|
MD5 | 3c42c2c3b841c2d617c61000c1927281 |
|
BLAKE2b-256 | 46ae6cbcfa8de7664897490824674e7699b62fc3e0608a1c3d1919fb99ef20c2 |
File details
Details for the file cliqs-1.0.0-py3-none-any.whl
.
File metadata
- Download URL: cliqs-1.0.0-py3-none-any.whl
- Upload date:
- Size: 11.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.8.0
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8ae42c8a2050ad2bf8269603bdde0b93cbc254563b6e900b1b33fc871f61ecba |
|
MD5 | 31023033a7b0004ebbf77fca9b7ef8dd |
|
BLAKE2b-256 | 569a78422e53d50d644bea4d46eec583afeaad205029449680bbbfcb3ab4417a |