Skip to main content

Evaluation method for the DRAGON benchmark

Project description

DRAGON Evaluation

Evaluation method for the DRAGON (Diagnostic Report Analysis: General Optimization of NLP) challenge.

Installation

A pre-built Docker container with the DRAGON evaluation method is available:

docker pull joeranbosma/dragon_eval

The DRAGON evaluation method can be pip-installed:

pip install dragon_eval

Or, alternatively, it can be installed from source:

pip install git+https://github.com/DIAGNijmegen/dragon_eval

The evaluation method was tested with Python 3.10. See requirements.txt for a full list of exact package versions.

Usage

The Docker container can be used to evaluate the synthetic datasets as specified in evaluate.sh. To evaluate the synthetic tasks, place the predictions to evaluate in the test-predictions folder and run ./evaluate.sh.

The DRAGON evaluation method can also be used from the command line (if installed with pip):

python -m dragon_eval --ground-truth-path="ground-truth" --predictions-path=test-predictions --output-file=metrics.json --folds 0 1 2 3 4 --tasks 000 001 002 003 004 005 006 007

The command above should work when executed from the dragon_eval folder, which needs to be cloned locally for the ground truth and prediction files to be present. Change the paths above when executing the command from a different place or storing the files in a different place. The tasks and folds to evaluate can be changed with the respective parameters.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

dragon_eval-0.2.1.tar.gz (11.3 kB view details)

Uploaded Source

Built Distribution

dragon_eval-0.2.1-py3-none-any.whl (11.1 kB view details)

Uploaded Python 3

File details

Details for the file dragon_eval-0.2.1.tar.gz.

File metadata

  • Download URL: dragon_eval-0.2.1.tar.gz
  • Upload date:
  • Size: 11.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.12

File hashes

Hashes for dragon_eval-0.2.1.tar.gz
Algorithm Hash digest
SHA256 d49051dc7b6bc87479d3c0073f70210427e40b1a8157ab4a58c3ee38814b86b9
MD5 3c2a19831099b83785503e5c0bc7f388
BLAKE2b-256 e04b0e70d7b9e5a97066d83a7273d6f86d4c00aca3e4252aefa9cd8221f5f03e

See more details on using hashes here.

File details

Details for the file dragon_eval-0.2.1-py3-none-any.whl.

File metadata

  • Download URL: dragon_eval-0.2.1-py3-none-any.whl
  • Upload date:
  • Size: 11.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/4.0.2 CPython/3.10.12

File hashes

Hashes for dragon_eval-0.2.1-py3-none-any.whl
Algorithm Hash digest
SHA256 8181d9b8bc753cf0335786c9421c609d1988df3434a8381e7c4b6d015a24b0bd
MD5 78a98acf5681c0f04a03422317bf65c4
BLAKE2b-256 a2451c4fb43552abd7b52da684e40819f7f4e7ee55a47cdf151d1b8a6e4bea5b

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page