Skip to main content

Covid-19 related datasets for ASReview

Project description

ASReview for COVID19

Extension to add publications on COVID-19 to ASReview.

ASReview against COVID-19

The Active learning for Systematic Reviews software ASReview implements learning algorithms that interactively query the researcher during the title and abstract reading phase of a systematic search. This way of interactive training is known as active learning. ASReview offers support for classical learning algorithms and state-of-the-art learning algorithms like neural networks. The software can be used for traditional systematic reviews for which the user uploads a dataset of papers, or one can make use of the built-in datasets.

To help combat the COVID-19 crisis, the ASReview team released an extension that integrates the latest scientific datasets on COVID-19 in the ASReview software.

CORD-19 dataset

The CORD-19 dataset is a dataset with scientific publications on COVID-19 and coronavirus-related research (e.g. SARS, MERS, etc.) from PubMed Central, the WHO COVID-19 database of publications, the preprint servers bioRxiv, medRxiv and arXiv, and papers contributed by specific publishers (currently Elsevier). The dataset is compiled and maintained by a collaboration of the Allen Institute for AI, the Chan Zuckerberg Initiative, Georgetown University’s Center for Security and Emerging Technology, Microsoft Research, and the National Library of Medicine of the National Institutes of Health. The full dataset contains metadata of 60K publications on COVID-19 and coronavirus-related research. The CORD-19 dataset is updated weekly.

The most recent version of the dataset can be downloaded here:
https://ai2-semanticscholar-cord-19.s3-us-west-2.amazonaws.com/latest/metadata.csv.
All versions are archived on Zenodo.

COVID19 preprints dataset

The COVID19 preprints dataset is created by Nicholas Fraser and Bianca Kramer, by collecting metadata of COVID19-related preprints from over 15 preprint servers with DOIs registered with Crossref or DataCite, and from arXiv. The dataset contains metadata of 7K preprints on COVID-19 and coronavirus-related research. The COVID19 preprints dataset is updated weekly.

The most recent version of the dataset can be downloaded here (csv):
https://github.com/nicholasmfraser/covid19_preprints/blob/master/data/covid19_preprints.csv.
All versions are archived on Figshare

ASReview plugin

To help combat the COVID-19 crisis, the ASReview team has decided to release a package that provides the latest scientific datasets on COVID-19. These are integrated automatically into ASReview once we install the correct packages, so reviewers can start reviewing the latest scientific literature on COVID-19 as soon as possible! Two versions of the CORD-19 dataset (publications relating to COVID-19) are made available in ASReview, as well as the COVID19 preprints dataset

  • full CORD-19 dataset
  • CORD-19 dataset with publications from December 2019 onwards
  • COVID19 preprints dataset

The current datasets are based on CORD-19 version 10 (released 2020-05-01) and COVID19 preprints version 6 (released 2020-05-03)

The datasets are updated in ASReview plugin shortly after their release.

Installation and usage

The COVID-19 plug-in requires ASReview 0.8 or higher. Install ASReview by following the instructions in Installation of ASReview.

Install the extension with pip:

pip install asreview-covid19

The datasets are immediately available after starting ASReview.

asreview oracle

The datasets are selectable in Step 2 of the project initialization. For more information on the usage of ASReview, please have a look at the Quick Tour.

ASReview CORD19 datasets

License, citation and contact

DOI License

The ASReview software and the plugin have an Apache 2.0 LICENSE. For the datasets, please see the license of the CORD-19 dataset https://pages.semanticscholar.org/coronavirus-research. The COVID19 preprints dataset has a CC0 license.

Visit https://doi.org/10.5281/zenodo.3764749 to get the citation style of your preference.

This project is coordinated by by Rens van de Schoot (@Rensvandeschoot) and Daniel Oberski (@daob) and is part of the research work conducted by the Department of Methodology & Statistics, Faculty of Social and Behavioral Sciences, Utrecht University, The Netherlands. Maintainers are Jonathan de Bruin (@J535D165) and Raoul Schram (@qubixes).

Got ideas for improvement? For any questions or remarks, please send an email to asreview@uu.nl.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

asreview-covid19-0.9.tar.gz (5.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

asreview_covid19-0.9-py3-none-any.whl (8.9 kB view details)

Uploaded Python 3

File details

Details for the file asreview-covid19-0.9.tar.gz.

File metadata

  • Download URL: asreview-covid19-0.9.tar.gz
  • Upload date:
  • Size: 5.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.7

File hashes

Hashes for asreview-covid19-0.9.tar.gz
Algorithm Hash digest
SHA256 2e9a9ed68306a8306c4854c99502df3eec7a3dcf074c9569c399f3d20a51a92b
MD5 1b9535533beeb1afe44337477131e919
BLAKE2b-256 9eb1676420a9a508e86581ffc61bd0fbcd09cc1293837eb1f45611c2e894349a

See more details on using hashes here.

File details

Details for the file asreview_covid19-0.9-py3-none-any.whl.

File metadata

  • Download URL: asreview_covid19-0.9-py3-none-any.whl
  • Upload date:
  • Size: 8.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.23.0 setuptools/46.1.3 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.7.7

File hashes

Hashes for asreview_covid19-0.9-py3-none-any.whl
Algorithm Hash digest
SHA256 d21e8d0e71528a985ca177c688051ad6a0b457fc676888c0cd50a35579b9caac
MD5 d92ce2617639b28e88b95f6d400fa2eb
BLAKE2b-256 2ee578e8e509f4e2ea63d50cc19944749ba07e0bb4f8bd25a1f3294c4fc9e338

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page