Skip to main content

Full Python ROUGE Score Implementation (not a wrapper)

Project description

# Rouge
*A full Python librarie for the ROUGE metric [(paper)](http://www.aclweb.org/anthology/W04-1013).*

### Disclaimer
This implementation is independant from the "official" ROUGE script (aka. `ROUGE-155`).
Results may be *slighlty* different, see [discussions in #2](https://github.com/pltrdy/rouge/issues/2).

## Quickstart
#### Clone & Install
```shell
git clone https://github.com/pltrdy/rouge
cd pyrouge
python setup.py install
# or
pip install -U .
```
or from pip:
```
pip install rouge
```
#### Use it from the shell (JSON Output)
```
$rouge -h
usage: rouge [-h] [-f] [-a] hypothesis reference

Rouge Metric Calculator

positional arguments:
hypothesis Text of file path
reference Text or file path

optional arguments:
-h, --help show this help message and exit
-f, --file File mode
-a, --avg Average mode

```

e.g.


```shell
# Single Sentence
rouge "transcript is a written version of each day 's cnn student" \
"this page includes the show transcript use the transcript to help students with"

# Scoring using two files (line by line)
rouge -f ./tests/hyp.txt ./ref.txt

# Avg scoring - 2 files
rouge -f ./tests/hyp.txt ./ref.txt --avg
```

#### As a library

###### Score 1 sentence

```python
from rouge import Rouge

hypothesis = "the #### transcript is a written version of each day 's cnn student news program use this transcript to he lp students with reading comprehension and vocabulary use the weekly newsquiz to test your knowledge of storie s you saw on cnn student news"

reference = "this page includes the show transcript use the transcript to help students with reading comprehension and vocabulary at the bottom of the page , comment for a chance to be mentioned on cnn student news . you must be a teac her or a student age # # or older to request a mention on the cnn student news roll call . the weekly newsquiz tests students ' knowledge of even ts in the news"

rouge = Rouge()
scores = rouge.get_scores(hypothesis, reference)
```

*Output:*

```json
{
"rouge-1": {
"f": 0.5238095189484127,
"p": 0.6285714285714286,
"r": 0.4489795918367347
},
"rouge-2": {
"f": 0.27027026566025497,
"p": 0.375,
"r": 0.2112676056338028
},
"rouge-l": {
"f": 0.28711800978275975,
"p": 0.4418604651162791,
"r": 0.25675675675675674
}
}
```

###### Score multiple sentences
```python
import json
from rouge import Rouge

# Load some sentences
with open('./tests/data.json') as f:
data = json.load(f)

hyps, refs = map(list, zip(*[[d['hyp'], d['ref']] for d in data]))
rouge = Rouge()
scores = rouge.get_scores(hyps, refs)
# or
scores = rouge.get_scores(hyps, refs, avg=True)
```

*Output (`avg=False`)*: a list of `n` dicts:

```
{"rouge-1": {"f": _, "p": _, "r": _}, "rouge-2" : { .. }, "rouge-3": { ... }}
```


*Output (`avg=True`)*: a single dict with average values:

```
{"rouge-1": {"f": _, "p": _, "r": _}, "rouge-2" : { ..     }, "rouge-3": { ... }}
```

###### Score two files (line by line)
Given two files `hyp_path`, `ref_path`, with the same number (`n`) of lines, calculate score for each of this lines, or, the average over the whole file.

```python
from rouge import FilesRouge

files_rouge = FilesRouge(hyp_path, ref_path)
scores = files_rouge.get_scores()
# or
scores = files_rouge.get_scores(avg=True)
```

**Note** that you can avoid consuming too much memory by using `batch_line=l`. This way, the script will read only `l` lines at a time. (otherwise it loads the whole files).


Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rouge-0.3.1.tar.gz (10.8 kB view details)

Uploaded Source

Built Distribution

rouge-0.3.1-py3-none-any.whl (7.4 kB view details)

Uploaded Python 3

File details

Details for the file rouge-0.3.1.tar.gz.

File metadata

  • Download URL: rouge-0.3.1.tar.gz
  • Upload date:
  • Size: 10.8 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No

File hashes

Hashes for rouge-0.3.1.tar.gz
Algorithm Hash digest
SHA256 f131c13661f96a048df4d372d83598a65efbbfaeafed491ee96313cc6aa7364b
MD5 0d0220fcfbecd7a2f0c3fa0639edef1a
BLAKE2b-256 2f9b0a857f3811f9095ec909cdc513db42ac5dbb983d2a86c0f65d47914cab1b

See more details on using hashes here.

File details

Details for the file rouge-0.3.1-py3-none-any.whl.

File metadata

File hashes

Hashes for rouge-0.3.1-py3-none-any.whl
Algorithm Hash digest
SHA256 96948febb4506c5a203c47d55e5f83b82baeb230ff84ae8fcf15331ee0094fc8
MD5 74eb41847184a836410d47ce88e5aa94
BLAKE2b-256 8f89af359c22e1d858e0299d4cc9219f36b504817c9797acad23081247867845

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page