Skip to main content

A gaggle of rerankers for CovidQA and CORD-19

Project description

PyGaggle

A gaggle of rerankers for CovidQA and CORD-19.

Installation

  1. For pip, do pip install pygaggle. If you prefer Anaconda, use conda env create -f environment.yml && conda activate pygaggle.

  2. Install PyTorch 1.4+.

  3. Download the index: sh scripts/update-index.sh.

  4. Make sure you have an installation of Java 11+: javac --version.

  5. Install Anserini.

Running rerankers on CovidQA

By default, the script uses data/lucene-index-covid-paragraph for the index path. If this is undesirable, set the environment variable CORD19_INDEX_PATH to the path of the index.

Unsupervised Methods

BM25: python -um pygaggle.run.evaluate_kaggle_highlighter --method bm25

BERT: python -um pygaggle.run.evaluate_kaggle_highlighter --method transformer --model-name bert-base-cased

SciBERT: python -um pygaggle.run.evaluate_kaggle_highlighter --method transformer --model-name allenai/scibert_scivocab_cased

BioBERT: python -um pygaggle.run.evaluate_kaggle_highlighter --method transformer --model-name biobert

Supervised Methods

T5 (MARCO): python -um pygaggle.run.evaluate_kaggle_highlighter --method t5

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

pygaggle-0.0.1.tar.gz (14.6 kB view hashes)

Uploaded Source

Built Distribution

pygaggle-0.0.1-py3-none-any.whl (24.1 kB view hashes)

Uploaded Python 3

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page