Project description

NER-evaluation

This is a Python implementation of NER MUC evaluation. Refer to the blog Evaluation Metrics of Name Entity Recognition for explanations of MUC metric.

Installation

pip install eval4ner

Usage

Evaluate single prediction

import eval4ner.muc as muc
import pprint
grount_truth = [('PER', 'John Jones'), ('PER', 'Peter Peters'), ('LOC', 'York')]
prediction = [('PER', 'John Jones and Peter Peters came to York')]
text = 'John Jones and Peter Peters came to York'
one_result = muc.evaluate_one(prediction, grount_truth, text)
pprint.pprint(one_result)

Output

{'exact': {'actual': 1,
           'correct': 0,
           'f1_score': 0,
           'incorrect': 1,
           'missed': 2,
           'partial': 0,
           'possible': 3,
           'precision': 0.0,
           'recall': 0.0,
           'spurius': 0},
 'partial': {'actual': 1,
             'correct': 0,
             'f1_score': 0.25,
             'incorrect': 0,
             'missed': 2,
             'partial': 1,
             'possible': 3,
             'precision': 0.5,
             'recall': 0.16666666666666666,
             'spurius': 0},
 'strict': {'actual': 1,
            'correct': 0,
            'f1_score': 0,
            'incorrect': 1,
            'missed': 2,
            'partial': 0,
            'possible': 3,
            'precision': 0.0,
            'recall': 0.0,
            'spurius': 0},
 'type': {'actual': 1,
          'correct': 1,
          'f1_score': 0.5,
          'incorrect': 0,
          'missed': 2,
          'partial': 0,
          'possible': 3,
          'precision': 1.0,
          'recall': 0.3333333333333333,
          'spurius': 0}}

Evaluate all predictions

import eval4ner.muc as muc
# ground truth
grount_truths = [
    [('PER', 'John Jones'), ('PER', 'Peter Peters'), ('LOC', 'York')],
    [('PER', 'John Jones'), ('PER', 'Peter Peters'), ('LOC', 'York')],
    [('PER', 'John Jones'), ('PER', 'Peter Peters'), ('LOC', 'York')]
]
# NER model prediction
predictions = [
    [('PER', 'John Jones and Peter Peters came to York')],
    [('LOC', 'John Jones'), ('PER', 'Peters'), ('LOC', 'York')],
    [('PER', 'John Jones'), ('PER', 'Peter Peters'), ('LOC', 'York')]
]
# input texts
texts = [
    'John Jones and Peter Peters came to York',
    'John Jones and Peter Peters came to York',
    'John Jones and Peter Peters came to York'
]
muc.evaluate_all(predictions, grount_truths * 1, texts, verbose=True)

Output:

 NER evaluation scores:
  strict mode, Precision=0.4444, Recall=0.4444, F1:0.4444
   exact mode, Precision=0.5556, Recall=0.5556, F1:0.5556
 partial mode, Precision=0.7778, Recall=0.6667, F1:0.6944
    type mode, Precision=0.8889, Recall=0.6667, F1:0.7222

Cite

@misc{eval4ner,
  title={eval4ner},
  author={Yekun Chai},
  year={2018},
  howpublished={\url{https://cyk1337.github.io/notes/2018/11/21/NLP/NER/Evaluation-metrics-of-Name-Entity-Recognition-systems/}},
}

References

Project details

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

0.1.0

Aug 9, 2022

This version

0.0.5

Nov 28, 2021

0.0.4

Apr 21, 2021

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

eval4ner-0.0.5.tar.gz (5.6 kB view hashes)

Uploaded Nov 28, 2021 Source

Built Distributions

eval4ner-0.0.5-py3-none-any.whl (6.3 kB view hashes)

Uploaded Nov 28, 2021 Python 3

eval4ner-0.0.5-py2.py3-none-any.whl (6.3 kB view hashes)

Uploaded Nov 28, 2021 Python 2 Python 3

Hashes for eval4ner-0.0.5.tar.gz

Hashes for eval4ner-0.0.5.tar.gz
Algorithm	Hash digest
SHA256	`119ea745519780e759d4597006d536632e26b9fb18e3a01695c4db94a52cd6ab`
MD5	`b4246236f54c7daecd127d69da90c86a`
BLAKE2b-256	`99990ec0b84e33ab2063c89cd49e067a040c47b29b460ace842848da404fc43e`

Hashes for eval4ner-0.0.5-py3-none-any.whl

Hashes for eval4ner-0.0.5-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7e72c9d209441e519e4ca11a590d4ea1299670210c3680b821c4cad402d4ddec`
MD5	`1ac56a8c882212a7f2bb4dc2dc47a409`
BLAKE2b-256	`a4e8e9749516c0c60d6dc17c75bde95cae55c7e248a7e3ba02697a64ebcc20b5`

Hashes for eval4ner-0.0.5-py2.py3-none-any.whl

Hashes for eval4ner-0.0.5-py2.py3-none-any.whl
Algorithm	Hash digest
SHA256	`fd64f210f1c5fadc6ac0f660d466f76d3e4b8e4a4352688eba451d4183386486`
MD5	`4540e5786d2b2e00e96acc186fd5ac55`
BLAKE2b-256	`1cf2093f539c68ef494c2f4c17cbd6b8bcba8c883f4dbfc444dda231ec2c30a3`