Skip to main content

SESCORE2: Learning Text Generation Evaluation via Synthesizing Realistic Mistakes

Project description

SESCORE2: Learning Text Generation Evaluation via Synthesizing Realistic Mistakes

SESCORE2, is a SSL method to train a metric for general text generation tasks without human ratings. We develop a technique to synthesize candidate sentences with varying levels of mistakes for training. To make these self-constructed samples realistic, we introduce retrieval augmented synthesis on anchor text; It outperforms SEScore in four text generation tasks with three languages (The overall kendall correlation improves 14.3%).

Paper: https://arxiv.org/abs/2212.09305

Author Email: wendaxu@cs.ucsb.edu

Maintainer Email: zihan_ma@ucsb.edu

Install all dependencies:

```
pip install SEScore2
```

Instructions to score sentences using SEScore2:

Currently, the PyPI version only support English Checkpoint. To run SEScore2 for text generation evaluation:

```
from sescore2 import SEScore2

scorer = SEScore2('en') # Download and load in metric with specified language, en (English), de (German), ja ('Japanese')

refs = ["Jova becomes Western Hemisphere's strongest hurricane so far in 2023 ... for now", "Jova becomes Western Hemisphere's strongest hurricane so far in 2023 ... for now"]

outs = ["Jova set to become Western Hemisphere's most powerful hurricane in 2023...so far", "Jova set to become Western Hemisphere's weakest hurricane in 2023"]

scores_ls = scorer.score(refs, outs, 1)
```

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sescore2-1.0.1.tar.gz (75.8 kB view details)

Uploaded Source

File details

Details for the file sescore2-1.0.1.tar.gz.

File metadata

  • Download URL: sescore2-1.0.1.tar.gz
  • Upload date:
  • Size: 75.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.8.10

File hashes

Hashes for sescore2-1.0.1.tar.gz
Algorithm Hash digest
SHA256 10bc341e87a248c0878bf1f8a89e72cfe72e2824bc4626d2a115e3ecd08786f2
MD5 3e130752d28f1b6c66dfc1db8edd9865
BLAKE2b-256 674fd66b29a967c1fe8e36d1b26a27300905f3fd63df0b98ceda9bd0c64866b6

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page