Skip to main content

Covid-19 related datasets for ASReview

Project description

ASReview for COVID19

Extension to add publications on COVID-19 to ASReview.

ASReview against COVID-19

The Active learning for Systematic Reviews software ASReview implements learning algorithms that interactively query the researcher during the title and abstract reading phase of a systematic search. This way of interactive training is known as active learning. ASReview offers support for classical learning algorithms and state-of-the-art learning algorithms like neural networks. The software can be used for traditional systematic reviews for which the user uploads a dataset of papers, or one can make use of the built-in datasets.

To help combat the COVID-19 crisis, the ASReview team released an extension that integrates the latest scientific datasets on COVID-19 in the ASReview software.

CORD-19 dataset

The CORD-19 dataset is a dataset with scientific publications on COVID-19 and coronavirus-related research (e.g. SARS, MERS, etc.) from PubMed Central, the WHO COVID-19 database of publications, the preprint servers bioRxiv and medRxiv and papers contributed by specific publishers (currently Elsevier). The dataset is compiled and maintained by a collaboration of the Allen Institute for AI, the Chan Zuckerberg Initiative, Georgetown University’s Center for Security and Emerging Technology, Microsoft Research, and the National Library of Medicine of the National Institutes of Health. Version 8 of the dataset (csv, dated April 17, 2020) contains metadata of 52.4K publications on COVID-19 and coronavirus-related research. The CORD-19 dataset is updated weekly.

ASReview plugin

To help combat the COVID-19 crisis, the ASReview team has decided to release a package that provides the latest scientific datasets on COVID-19. These are integrated automatically into ASReview once we install the correct packages, so reviewers can start reviewing the latest scientific literature on COVID-19 as soon as possible! Two versions of the CORD-19 dataset (publications relating to COVID-19) are made available in ASReview:

  • full CORD-19 dataset
  • CORD-19 dataset with publications from December 2019 onwards

The current datasets are based on CORD-19 version 8 (released 2020-04-17)

The datasets are updated in ASReview plugin shortly after the release by the Allen Institute for AI.

Installation and usage

The COVID-19 plug-in requires ASReview 0.8 or higher. Install ASReview by following the instructions in Installation of ASReview.

Install the extension with pip:

pip install asreview-covid19

The datasets are immediately available after starting ASReview.

asreview oracle

The datasets are selectable in Step 2 of the project initialization. For more information on the usage of ASReview, please have a look at the Quick Tour.

ASReview CORD19 datasets

License and contact

The ASReview software and the plugin have an Apache 2.0 LICENSE. For the datasets, please see the license of the CORD-19 dataset

This project is coordinated by by Rens van de Schoot (@Rensvandeschoot) and Daniel Oberski (@daob) and is part of the research work conducted by the Department of Methodology & Statistics, Faculty of Social and Behavioral Sciences, Utrecht University, The Netherlands. Maintainers are Jonathan de Bruin (@J535D165) and Raoul Schram (@qubixes).

Got ideas for improvement? For any questions or remarks, please send an email to

Project details

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Files for asreview-covid19, version 0.8
Filename, size File type Python version Upload date Hashes
Filename, size asreview_covid19-0.8-py3-none-any.whl (8.3 kB) File type Wheel Python version py3 Upload date Hashes View
Filename, size asreview-covid19-0.8.tar.gz (4.3 kB) File type Source Python version None Upload date Hashes View

Supported by

AWS AWS Cloud computing Datadog Datadog Monitoring DigiCert DigiCert EV certificate Facebook / Instagram Facebook / Instagram PSF Sponsor Fastly Fastly CDN Google Google Object Storage and Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Salesforce Salesforce PSF Sponsor Sentry Sentry Error logging StatusPage StatusPage Status page