Evaluation method for the DRAGON benchmark
Project description
DRAGON Evaluation
Evaluation method for the DRAGON (Diagnostic Report Analysis: General Optimization of NLP) challenge.
Installation
A pre-built Docker container with the DRAGON evaluation method is available:
docker pull joeranbosma/dragon_eval
The DRAGON evaluation method can be pip-installed:
pip install dragon_eval
Or, alternatively, it can be installed from source:
pip install git+https://github.com/DIAGNijmegen/dragon_eval
The evaluation method was tested with Python 3.10. See requirements.txt for a full list of exact package versions.
Usage
The Docker container can be used to evaluate the synthetic datasets as specified in evaluate.sh. To evaluate the synthetic tasks, place the predictions to evaluate in the test-predictions
folder and run ./evaluate.sh
.
The DRAGON evaluation method can also be used from the command line (if installed with pip):
python -m dragon_eval --ground-truth-path="ground-truth" --predictions-path=test-predictions --output-file=metrics.json --folds 0 1 2 3 4 --tasks 000 001 002 003 004 005 006 007
The command above should work when executed from the dragon_eval
folder, which needs to be cloned locally for the ground truth and prediction files to be present. Change the paths above when executing the command from a different place or storing the files in a different place. The tasks and folds to evaluate can be changed with the respective parameters.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file dragon_eval-0.1.tar.gz
.
File metadata
- Download URL: dragon_eval-0.1.tar.gz
- Upload date:
- Size: 11.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | d5035b4837876f3ed927c5fd5388303fa59261d598f0a1399303f0d7ef23665d |
|
MD5 | e41728b78188ba2c6c861a77850a73ac |
|
BLAKE2b-256 | 94b7f19ccff79d0509f1d0cc5b52f134c7d0a40e8e8247a6a61b5338e9d7f90e |
File details
Details for the file dragon_eval-0.1-py3-none-any.whl
.
File metadata
- Download URL: dragon_eval-0.1-py3-none-any.whl
- Upload date:
- Size: 11.2 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.10.12
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 8eaab6921a0c35854a18384230e134845ab78fdec2f2e36a437c4a0a0e989cb9 |
|
MD5 | d87f21c76b434a0c4a2095042ffe5f4c |
|
BLAKE2b-256 | 9926b1f42ebdfb17f177ac47f473d275927ef8c10ff89026603adbaa1ebb73f8 |