Skip to main content

Unified data hub for a better understanding of COVID-19 https://covid19datahub.io

Project description

Python Interface to COVID-19 Data Hub

DOI

The goal of COVID-19 Data Hub is to provide the research community with a unified dataset by collecting worldwide fine-grained case data, merged with exogenous variables helpful for a better understanding of COVID-19. Please agree to the Terms of Use and cite the following reference when using it:

Reference

Guidotti, E., Ardia, D., (2020).
COVID-19 Data Hub
Journal of Open Source Software, 5(51):2376
https://doi.org/10.21105/joss.02376

Setup and usage

Install from pip with

pip install covid19dh

Importing main covid19() function with

from covid19dh import covid19

x,src = covid19("ITA") # load data

Package is regularly updated. Update with

pip install --upgrade covid19dh

Return values

Call of covid19() returns in all cases 2 arguments, pandas dataframes,

  • the data and
  • references to the sources.

Parametrization

Country

Country specifies an administrative region, that the data are fetched from. This is connected with source data comes from. It can be given as ISO3, ISO2, numeric ISO or country name (case-insensitively).

Fetching data from a particular country is done with

x,src = covid19("ESP")

List of ISO codes can be found here.

Filter can also specify multiple countries at the same time

x,src = covid19(["ESP","PT","andorra",250])

Country can be omitted, then whole world data is used.

x,src = covid19()

Date filter

Date can be specified with datetime.datetime, datetime.date or as a str in format YYYY-mm-dd.

from datetime import datetime

x,src = covid19("SWE", start = datetime(2020,4,1), end = "2020-05-01")

Level

Levels work the same way as in all the other our data fetchers.

  1. Country level
  2. State, region or canton level
  3. City or municipality level
from datetime import date

x,src = covid19("USA", level = 2, start = date(2020,5,1))

Cache

Library keeps downloaded data and sources in simple way during runtime. By default, using the cached data is enabled.

Caching can be disabled (e.g. for long running programs) by

x,src = covid19("FRA", cache=False)

More advanced caching is coming.

Vintage

Data Hub enables to fetch the vintage data, data archive collected on each day. The data collecting is stable.

To fetch e.g. US data that were accessible on 10th April 2020 type

x,src = covid19("USA", end = "2020-04-22", vintage = True)

The vintage data are collected at the end of the day, but published with approximately 48 hour delay, once the day is completed in all the timezones.

Hence if vintage = True, but end is not set, warning is raised and None is returned.

x,src = covid19("USA", vintage=True) # too early to get today's vintage
UserWarning: vintage data not available yet

Citations

Sources to data is returned as a second value.

from covid19dh import covid19
x,src = covid19("CZE")

Apart from that a following message is printed on covid19() call.

We have invested a lot of time and effort in creating COVID-19 Data Hub, please cite the following when using it:

        Guidotti, E., Ardia, D., (2020), "COVID-19 Data Hub", Journal of Open Source Software 5(51):2376, doi: 10.21105/joss.02376.

A BibTeX entry for LaTeX users is

        @Article{,
                title = {COVID-19 Data Hub},
                year = {2020},
                doi = {10.21105/joss.02376},
                author = {Emanuele Guidotti and David Ardia},
                journal = {Journal of Open Source Software},
                volume = {5},
                number = {51},
                pages = {2376},
        }

To hide this message use 'verbose = FALSE'.

This feature can be turned off by setting verbose to False.

from covid19dh import covid19
x,src = covid19("CZE", verbose = False) 

Pandas dataframe src has following structure

    iso_alpha_3  administrative_area_level  ...                     institution                                        textVersion
137         CZE                        1.0  ...                             NaN                                                NaN
138         CZE                        1.0  ...                             NaN                                                NaN
139         CZE                        2.0  ...                             NaN                                                NaN
140         CZE                        2.0  ...                             NaN                                                NaN
141         CZE                        2.0  ...                             NaN                                                NaN
142         CZE                        2.0  ...                             NaN                                                NaN
143         CZE                        3.0  ...                             NaN                                                NaN
144         CZE                        3.0  ...                             NaN                                                NaN
145         CZE                        3.0  ...                             NaN                                                NaN
539         NaN                        NaN  ...                             NaN                                                NaN
540         NaN                        NaN  ...                             NaN                                                NaN
541         NaN                        NaN  ...                             NaN                                                NaN
542         NaN                        NaN  ...                             NaN                                                NaN
543         NaN                        NaN  ...                             NaN                                                NaN
544         NaN                        NaN  ...                             NaN                                                NaN
545         NaN                        NaN  ...  Blavatnik School of Government  Hale Thomas, Sam Webster, Anna Petherick, Toby...
546         NaN                        NaN  ...  Blavatnik School of Government  Hale Thomas, Sam Webster, Anna Petherick, Toby...
547         NaN                        NaN  ...  Blavatnik School of Government  Hale Thomas, Sam Webster, Anna Petherick, Toby...
548         NaN                        NaN  ...  Blavatnik School of Government  Hale Thomas, Sam Webster, Anna Petherick, Toby...
549         NaN                        NaN  ...  Blavatnik School of Government  Hale Thomas, Sam Webster, Anna Petherick, Toby...
550         NaN                        NaN  ...  Blavatnik School of Government  Hale Thomas, Sam Webster, Anna Petherick, Toby...
551         NaN                        NaN  ...  Blavatnik School of Government  Hale Thomas, Sam Webster, Anna Petherick, Toby...
552         NaN                        NaN  ...  Blavatnik School of Government  Hale Thomas, Sam Webster, Anna Petherick, Toby...
553         NaN                        NaN  ...  Blavatnik School of Government  Hale Thomas, Sam Webster, Anna Petherick, Toby...
554         NaN                        NaN  ...  Blavatnik School of Government  Hale Thomas, Sam Webster, Anna Petherick, Toby...
555         NaN                        NaN  ...                             NaN                                                NaN

Dataframe columns are

  • iso_alpha_3, administrative_area_level,
  • data_type
  • url
  • title, author, institution
  • year
  • bibtype, textVersion

In progress

  • conversion of the sources
    • printed list of references
    • BibTeX format

Contribution

Developed by Martin Benes

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

covid19dh-1.1.4.tar.gz (11.3 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

covid19dh-1.1.4-py3-none-any.whl (21.3 kB view details)

Uploaded Python 3

File details

Details for the file covid19dh-1.1.4.tar.gz.

File metadata

  • Download URL: covid19dh-1.1.4.tar.gz
  • Upload date:
  • Size: 11.3 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.8.3rc1

File hashes

Hashes for covid19dh-1.1.4.tar.gz
Algorithm Hash digest
SHA256 6104f0527bb110a6278ef927b06778c3c5084fff1c931695d2c283f75b89efd3
MD5 2185a9ae8ed36602841bf6233e9e873c
BLAKE2b-256 0bd7bb8c10ec192e887f5aeaf3651c5498e6dbceaabc773c6a0cc2b0232db948

See more details on using hashes here.

File details

Details for the file covid19dh-1.1.4-py3-none-any.whl.

File metadata

  • Download URL: covid19dh-1.1.4-py3-none-any.whl
  • Upload date:
  • Size: 21.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.1.1 pkginfo/1.5.0.1 requests/2.24.0 setuptools/41.2.0 requests-toolbelt/0.9.1 tqdm/4.46.0 CPython/3.8.3rc1

File hashes

Hashes for covid19dh-1.1.4-py3-none-any.whl
Algorithm Hash digest
SHA256 d53c5bba4c6298581bc659ab49f7a773be4d6ff3be149cabdec29313ca226bdc
MD5 10ae295abe0d39e6b21a3e784b62f4bc
BLAKE2b-256 624f48daad2b23b571d1445842815161b8e11a8fdd965abd226ac63433a680d8

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page