lag time and case fatality rate (CFR)
Project description
covidlag
Data science is useful to investigate the progression of the pandemic.
covidlag is an open-source program which is available in public and can be installed by the PyPI package command (pip). covidlag can calculate the lag time between infection peaks and death peaks. covidlag can also compute death rate per infection or case fatality ratio (CFR). CFR is the proportion of individuals diagnosed with a disease who die from that disease and is therefore a measure of severity among detected cases.
CFR is a dynamic value which is continuously changing. In the conventional methods, CFR is used for retrospective observational studies and are not suitable for continuous time series data.
According to CDC: https://www.cdc.gov/foodnet/reports/data/case-fatality.html
the CFR is calculated by: CFR=(number of cases in which patient died/number of cases).
The CDC method for CFR is appropriate for annual statistics, but not for time series data analysis.
In the current CFR calculation, we need to determine two indicators: the number of sampled days and its range of start and ending dates.
Two indicators significantly influence CFR.
COVID-19 variants and vaccinations can significantly change the lag time and the CFR so that the convetional distribution approaches are not suitable for ongoing time series data.
There is no algorithm to select two appropriate indicators.
The proposed algorithm is based on a robust correlation between infection and death.
The extreme values such as maxima and minima can be used for computing the accurate lag time and the CFR.
The detailed method is under review.
How to install covidlag
Covidlag is available in public and can be installed by the following PyPI packaging command:
$ pip install covidlag
How to run covidlag
covidlag needs at least three parameters (country name, sampled days, regression polinomial degree).
Run the following command composed of the country name, sampled days, the degree of polynomial curve-fitting, and options (L: left, R: right, C: center):
$ covidlag 'United States' 600 13 L
This example shows that r-squared of infections:0.803 and r-squared of deaths:0.733The death peaks are [ 66, 182, 332, 464, 574]. The case peaks are [ 47, 150, 309, 441, 562].
maxima information death peak: 2020-04-23 death peak: 2020-08-17 death peak: 2021-01-14 death peak: 2021-05-26 death peak: 2021-09-13 case peak: 2020-04-04 case peak: 2020-07-16 case peak: 2020-12-22 case peak: 2021-05-03 case peak: 2021-09-01 maxiddeath (array([ 66, 182, 332, 464, 574]),) maxidcase (array([ 47, 150, 309, 441, 562]),) ydeath[maxiddeath] [1963 1010 3105 627 1915] ycase[maxidcase] [ 32699 59662 207017 43332 177464] ================================== minima information death minima: 2020-03-02 death minima: 2020-06-26 death minima: 2020-10-07 death minima: 2021-04-26 death minima: 2021-07-17 death minima: 2021-09-28 case minima: 2020-02-28 case minima: 2020-05-11 case minima: 2020-09-09 case minima: 2021-04-12 case minima: 2021-06-28 case minima: 2021-09-29 miniddeath (array([ 14, 130, 233, 434, 516, 589]),) minidcase (array([ 11, 84, 205, 420, 497, 590]),) ydeath[miniddeath] [-284 594 682 540 113 1735] ycase[minidcase] [-16493 13753 31474 41737 12714 88203]
The lag time between infections and deaths using maxima is 66-47=19 days, 182-150=32 days, 332-309=23 days, 464-441=23 days, 574-562=12 days.
The number of every death peak is [1963 1010 3105 627 1915].
The number of every case peak is [ 32699 59662 207017 43332 177464].
The death rate per infection is 1963/32699=0.060,1010/59662=0.017, 3105/207017=0.015,627/43332=0.014,1915/177464=0.011
CFR in the US is slowly decreasing. But, it is still high compared with that of Japan.
$ covidlag Japan 400 13 L
This example is for "Japan" with the "13th" degree polynomial curve-fitting using "400" sampled days. The result shows the polynomial curve-fitting that r-squared of infections is 0.923 and r-squared of deaths is 0.706.
The death peaks are [152 267 374]. death peak: 2021-01-29 <- death peak: 2021-05-24 <- death peak: 2021-09-08 <- The case peaks are [27 133 252 258]. case peak: 2020-09-26 case peak: 2021-01-10 <- case peak: 2021-05-09 <- case peak: 2021-08-23 <- The lag time between infections and deaths is 152-133=19 days, 267-252=15 days, and 374-258=16 days respectively.
The number of every death peak is [88 92 54].
The number of every case peak is [526 4421 5902 20029]
Therefore, death rate of peaks or CFR is 88/4421=0.019, 92/5902=0.015, and 54/20029=0.0026 respectively.
The CFR is significantly decreasing.
$ covidlag Canada 400 13 L
death peak: 2020-09-07 death peak: 2021-01-06 death peak: 2021-05-08 death peak: 2021-09-25 case peak: 2020-09-07 case peak: 2020-12-26 case peak: 2021-04-21 case peak: 2021-09-19The lag time is 11 days, 17 days,...
The death rate per infection is 0.018,0.006,...
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.