Skip to main content

Module provides implementation of multilingual crisis social media summarization model.

Project description

CLiQS Python module

CLiQS Python module provides implementation of multilingual crisis social media summarization model.

Please, if you use CLiQS for your research consider citing:

Fedor Vitiugin, Carlos Castillo: Cross-Lingual Query-Based Summarization of Crisis-Related Social Media: An Abstractive Approach Using Transformers. In ACM Hypertext 2022. ACM Press. https://doi.org/10.1145/3511095.3531279

Installation

  1. Install the module via pip:
pip install cliqs
  1. Download LASER and CLiQS models:
python -m laserembeddings download-models
python -m cliqs download-models
  1. Before running the script, please check installation of SpaCy models for language that you plan to use.
python -m spacy download fr_core_news_sm # for French

Test use

Download test data 'example.csv' file and put in the current directory.

Example of use:

import pandas as pd
from cliqs import CliqSum

sum = CliqSum()

tweets = pd.read_csv('example.csv')
summary = sum.summarize(tweets, 'Damage', 'fr')

print(summary)

cyclone seroja a touché terre en Australie, entre Kalbarri et northampton, l'oeil est encore bien dessiné mais devrait rapidement se déstructurer. cyclone seroja devrait prendre le dessus et atteindre le stade de cyclone 65kt ce WE avant de toucher terre sur côte ouest Australie dimanche soir.

  • example.csv —- data file with three columns: id, text, en_text (translation of texts to English).
  • Damage -- information category. Current version supports 6 categories: Casualties, Damage, Danger, Sensor, Service aand Weather.
  • fr -- language of texts in file.

Resources

Code for training custom models — CLiQS-CM GitHub repository

Dataset for text classification — tweets dataset

Dataset for summary evaluation — summaries dataset

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

cliqs-1.0.0.tar.gz (12.2 kB view details)

Uploaded Source

Built Distribution

cliqs-1.0.0-py3-none-any.whl (11.9 kB view details)

Uploaded Python 3

File details

Details for the file cliqs-1.0.0.tar.gz.

File metadata

  • Download URL: cliqs-1.0.0.tar.gz
  • Upload date:
  • Size: 12.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.0

File hashes

Hashes for cliqs-1.0.0.tar.gz
Algorithm Hash digest
SHA256 d4e15f424e5b7850f2f82623ee7bbb5856b4b6ac0d2877c8a3cea43e39951754
MD5 3c42c2c3b841c2d617c61000c1927281
BLAKE2b-256 46ae6cbcfa8de7664897490824674e7699b62fc3e0608a1c3d1919fb99ef20c2

See more details on using hashes here.

File details

Details for the file cliqs-1.0.0-py3-none-any.whl.

File metadata

  • Download URL: cliqs-1.0.0-py3-none-any.whl
  • Upload date:
  • Size: 11.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.0

File hashes

Hashes for cliqs-1.0.0-py3-none-any.whl
Algorithm Hash digest
SHA256 8ae42c8a2050ad2bf8269603bdde0b93cbc254563b6e900b1b33fc871f61ecba
MD5 31023033a7b0004ebbf77fca9b7ef8dd
BLAKE2b-256 569a78422e53d50d644bea4d46eec583afeaad205029449680bbbfcb3ab4417a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page