library for question evaluation including KDA, Knowledge Dependent Answerability introduced in EMNLP 2022 work.
Project description
question-score
Repository for KDA(Knowledge-dependent Answerability), EMNLP 2022 work
How to use
pip install -e .
from question_score import KDA
kda = KDA()
print(
kda.kda_small(
"passage",
"question",
["option1", "option2", "option3", "option4"],
1
)
)
What does the score mean?
You can check the explanation of KDA on https://arxiv.org/abs/2211.11902 now. The official link from EMNLP 2022 will soon be released.
You can use $KDA_{small}$ or $KDA_{large}$ instead of heavy metric using all model. Below is the performance of the submetrics, which mentioned on the appendix of the paper.
Sub Metric | Model Count ( Total Size ) | KDA (Valid) | Likert (Test) |
---|---|---|---|
KDA_small | 4 (3.5GB) | 0.740 | 0.377 |
KDA_large | 10 (19.2GB) | 0.784 | 0.421 |
Citation
@inproceedings{moon2022eval,
title={Evaluating the Knowledge Dependency of Questions,
author={Moon, H., Yang, Y., Shin, J., Yu, H., Lee, S., Jeong, M., Park, J., Kim, M., & Choi, S.},
booktitle={Proceedings of the 2022 Conference on Empirical Methods in Natural Language Processing},
year={2022}
}
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
question_score-0.0.1.tar.gz
(12.4 kB
view details)
Built Distribution
File details
Details for the file question_score-0.0.1.tar.gz
.
File metadata
- Download URL: question_score-0.0.1.tar.gz
- Upload date:
- Size: 12.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | ee365f24554210d80b8cfc8fd0a570c558b9178da2d8b7ad3859636cbe72f324 |
|
MD5 | bf749ab2588f4034e7022dfceb7d8a67 |
|
BLAKE2b-256 | 60f604e56662a9f1d1b0fb121f4c91905d6a71f80f9f1d4ced3fc75a9b4cf69b |
File details
Details for the file question_score-0.0.1-py3-none-any.whl
.
File metadata
- Download URL: question_score-0.0.1-py3-none-any.whl
- Upload date:
- Size: 12.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.1 CPython/3.9.7
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 5217e874508909f6f631ce3911ee76255bdda1cea3c3e998f169df08a4a591d6 |
|
MD5 | 19f548ae35a0a51993d05198186a2054 |
|
BLAKE2b-256 | 4bba4b3ac099e4f3052dbbea6d184b574a3f26b8271b4f9e4bfa1bf84c9f108a |