Skip to main content

Easy-to-use, high-quality target-dependent sentiment classification for English news articles

Project description

NewsSentiment: easy-to-use, high-quality target-dependent sentiment classification for news articles

NewsSentiment is an easy-to-use Python library that achieves state-of-the-art performance for target-dependent sentiment classification on news articles. NewsSentiment uses the currently best performing targeted sentiment classifier for news articles. In contrast to regular sentiment classification, targeted sentiment classification allows you to provide a target in a sentence. Only for this target, the sentiment is then predicted. This is more reliable in many cases, as demonstrated by the following simplistic example: "I like Bert, but I hate Robert."

We designed NewsSentiment to serve as an easy-to-use wrapper around the sophisticated GRU-TSC model, which was trained on the NewsMTSC dataset consisting of more than 10k labeled sentences sampled from political news articles. More information on the dataset and the model can be found here. The dataset, the model, and its source code can be viewed in our GitHub repository.

Installation

It's super easy, we promise!

You just need a Python 3.8 environment. See here if you don't have Python or a different version (run python --version in a terminal to see your version). Then run:

pip3 install NewsSentiment        # without cuda support (choose this if you don't know what cuda is)
pip3 install NewsSentiment[cuda]  # with cuda support

You're all set now :-)

Target-dependent Sentiment Classification

Note that using NewsSentiment the first time will take a few minutes because it needs to download the fine-tuned language model. Please do not abort this initial download. Since this is a one-time process, future use of NewsSentiment will be much faster.

from NewsSentiment import TargetSentimentClassifier
tsc = TargetSentimentClassifier()

data = [
    ("I like ", "Peter", " but I don't like Robert."),
    ("", "Mark Meadows", "'s coverup of Trump’s coup attempt is falling apart."),
]

sentiments = tsc.infer(targets=data)

for i, result in enumerate(sentiments):
    print("Sentiment: ", i, result[0])

This method will internally split the data into batches of size 16 for increased speed. You can adjust the batch size using the batch_size parameter, e.g., batch_size=32.

Alternatively, you can also use the infer_from_text method to infer sentiment for a single target:

sentiment = tsc.infer_from_text("I like " ,"Peter", " but I don't like Robert.")
print(sentiment[0])

How to identify a person in a sentence?

In case your data is not separated as shown in the examples above, i.e., in three segments, you will need to identify one (or more) targets first. How this is done best depends on your project and analysis task but you may, for example, use NER. This example shows a simple way of doing so.

Acknowledgements

Thanks to Tilman Hornung for adding the batching functionality and various other improvements.

How to cite

If you use the dataset or model, please cite our paper (PDF):

@InProceedings{Hamborg2021b,
  author    = {Hamborg, Felix and Donnay, Karsten},
  title     = {NewsMTSC: (Multi-)Target-dependent Sentiment Classification in News Articles},
  booktitle = {Proceedings of the 16th Conference of the European Chapter of the Association for Computational Linguistics (EACL 2021)},
  year      = {2021},
  month     = {Apr.},
  location  = {Virtual Event},
}

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

NewsSentiment-1.2.28.tar.gz (889.3 kB view details)

Uploaded Source

Built Distribution

NewsSentiment-1.2.28-py3-none-any.whl (919.0 kB view details)

Uploaded Python 3

File details

Details for the file NewsSentiment-1.2.28.tar.gz.

File metadata

  • Download URL: NewsSentiment-1.2.28.tar.gz
  • Upload date:
  • Size: 889.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.18

File hashes

Hashes for NewsSentiment-1.2.28.tar.gz
Algorithm Hash digest
SHA256 f26da590b3bad6f63225668c2eaf63e1dcae8c1923f81e4fa3082935ef5141e9
MD5 6bff3466d58976cf4e942043a06db2af
BLAKE2b-256 d164ae206dd6e24c0da623ec1ef6f51653361cb6d8683b980563c4fdfa92d68c

See more details on using hashes here.

File details

Details for the file NewsSentiment-1.2.28-py3-none-any.whl.

File metadata

File hashes

Hashes for NewsSentiment-1.2.28-py3-none-any.whl
Algorithm Hash digest
SHA256 5a0e695a521b69a7535de7e12b06fb17a0e70d19d8afb9e49ff7a5fb783f516d
MD5 54982fe29d2501371fc2b7f793649450
BLAKE2b-256 ed2da27a9b0cec5f06b0d20ea38f48c8c1948aa0d26fdbe2e4e058decea67e98

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page