Skip to main content

A Metric for Entity-Aware Radiology Report Generation

Project description

logo

RaTEScore

RaTEScore: A Metric for Entity-Aware Radiology Text Similarity

Overview

RaTEScore is a novel, entity-aware metric to assess the quality of medical reports generated by AI models. It emphasizes crucial medical entities such as diagnostic outcomes and anatomical details, and is robust against complex medical synonyms and sensitive to negation expressions. The evaluations demonstrate that RaTEScore aligns more closely with human preference than existing metrics.

Here is an illustration of the Computation of RaTEScore.

For more detailed about our pipeline, please refer to our paper.

Installation

Environment Preparation

pip install RaTEScore

Usage

from RaTEScore import RaTEScore

pred_report = ['There are no intracranial hemorrhages.',
              'The musculature and soft tissues are intact.']

gt_report = ['There is no finding to suggest intracranial hemorrhage.',
            'The muscle compartments are intact.']

ratescore = RaTEScore()
scores = ratescore.compute_score(pred_report, gt_report)

RaTE-NER

RaTE-NER dataset is a large-scale, radiological named entity recognition (NER) dataset. We set up this dataset to serve our Medical Entity Recognition module of our proposed metric. To download or find out more about our dataset, please refer to Hugginface and our paper.

RaTE-Eval

To effectively measure the alignment between automatic evaluation metrics and radiologists' assessments in medical text generation tasks, we have established a comprehensive benchmark, RaTE-Eval, that encompasses three tasks:

  • Sentences-level Human Rating.
  • Paragraph-level Human Rating.
  • Rating on the Synthetic Reports.

To download or find out more about our dataset, please refer to Hugginface and our paper.

Default Scheme

For Medical Entity Recognition module, we default to use our fine-tuned version of DeBERTa on the RaTE-NER dataset. It is based on whole the IOB scheme.

For Synonym Disambiguation Encoding module, we default to ultilize BioLORD-2023-C. We have discussed this part in our paper. You can also change this to other pretrained bert models.

Contact

If you have any questions, please feel free to contact zwk0629@sjtu.edu.cn.

Citation


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ratescore-0.6.0.tar.gz (9.2 kB view details)

Uploaded Source

Built Distributions

If you're not sure about the file name format, learn more about wheel file names.

ratescore-0.6.0-py3-none-any.whl (9.2 kB view details)

Uploaded Python 3

RaTEScore-0.6.0-py3-none-any.whl (9.1 kB view details)

Uploaded Python 3

File details

Details for the file ratescore-0.6.0.tar.gz.

File metadata

  • Download URL: ratescore-0.6.0.tar.gz
  • Upload date:
  • Size: 9.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.10.10

File hashes

Hashes for ratescore-0.6.0.tar.gz
Algorithm Hash digest
SHA256 6e57b0d8958330d1bf0dceb8ed97e5979f4abc2824c65c320ee87e3764094858
MD5 2db850d301bf014b612c65e1176a6163
BLAKE2b-256 f76c6fac3e303eef30b14225b8fa3e18cd89a5f5c00547a6067ddc7587e59c2b

See more details on using hashes here.

File details

Details for the file ratescore-0.6.0-py3-none-any.whl.

File metadata

  • Download URL: ratescore-0.6.0-py3-none-any.whl
  • Upload date:
  • Size: 9.2 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.10.10

File hashes

Hashes for ratescore-0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 3b0e0c9e9fe3628595a3bcd723fff94a0e2eea6d5896b4af44e7bed1f9765b99
MD5 1e621d6c5ced3b6ccff85af9d69eafa1
BLAKE2b-256 32bc7d827c8211888da7647544a1d98a2e701b91ad026f1ce6119d735f8b66ff

See more details on using hashes here.

File details

Details for the file RaTEScore-0.6.0-py3-none-any.whl.

File metadata

  • Download URL: RaTEScore-0.6.0-py3-none-any.whl
  • Upload date:
  • Size: 9.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.0 CPython/3.10.10

File hashes

Hashes for RaTEScore-0.6.0-py3-none-any.whl
Algorithm Hash digest
SHA256 0bf27492cadc3dc95efec03647da02a35d2d1b395e7f2bc7b630fc5b59c0787f
MD5 576602de9ca79d183b5356124a3c7a28
BLAKE2b-256 47818c6356eecab2c7f7aa1982df1ad3309a42249b929a182240548ea4f527ba

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page