Skip to main content

Advanced epidemiological modeling and forecasting using discrete SIRD/SIRDV models with time series analysis

Project description

Epydemics: Forecasting COVID-19 using time series and machine learning

Version 0.9.0 - Native Multi-Frequency (Daily, Business Day, Weekly, Monthly, Annual)

epydemics is a Python library for epidemiological modeling and forecasting. It provides tools for creating, fitting, and evaluating discrete SIRD/SIRDV models with time-dependent parameters. The library is designed to be flexible and extensible, allowing users to easily incorporate their own data and models.

📖 New Users? Start with the User Guide to understand when and how to use epydemics.

Features

  • Discrete SIRD Model: A discrete Susceptible-Infected-Recovered-Deceased (SIRD) model with time-dependent parameters.
  • SIRDV Vaccination Model (v0.7.0): Extended model including Vaccinated compartment (V) and vaccination rate (δ).
  • Time Series Forecasting: Use of VAR (Vector Autoregression) models to forecast epidemic rates with logit transformation.
  • Multi-Frequency Support (v0.9.0): Native processing of daily, business day, weekly, monthly, and annual data with automatic detection and handler-based validation (no artificial reindexing).
  • Temporal Aggregation (v0.9.0): Frequency-aware aggregation that skips resampling when source and target frequencies match.
  • Business Day Support (v0.9.0): Dedicated handler for trading-day calendars (252 days/year) with validated lags and recovery windows.
  • Data Container: A convenient class for loading, preprocessing, and storing epidemiological data.
  • Parallel Simulations: Multi-core support for faster Monte Carlo simulations (27 scenarios for SIRD, 81 for SIRDV).
  • Result Caching: File-based caching to avoid recomputing identical analyses.
  • Model Evaluation: Tools for evaluating model performance with MAE, MSE, RMSE, MAPE, SMAPE metrics.
  • Visualization: Professional plotting functions for results and forecasts.

✅ Native Multi-Frequency: v0.9.0 processes annual/monthly/weekly/business-day data without artificial daily reindexing. See User Guide for frequency guidance.

SIRDV Model (New in v0.7.0)

The SIRDV model extends the classical SIRD model by incorporating vaccination:

Compartments:

  • S: Susceptible
  • I: Infected (active cases)
  • R: Recovered
  • D: Deaths
  • V: Vaccinated (new)

Rates:

  • α: Infection rate
  • β: Recovery rate
  • γ: Mortality rate
  • δ: Vaccination rate (new)

Key Features:

  • Automatic detection from vaccination data
  • 81 simulation scenarios (3⁴ confidence levels)
  • Conservation law: N = S + I + R + D + V
  • Parallel execution recommended for performance

Getting Started

To get started with epydemics, we recommend following the tutorial in TUTORIAL.md.

Installation

You can install epydemics from PyPI:

pip install epydemics

To install the latest development version, you can clone this repository and install it in editable mode:

git clone https://github.com/julihocc/epydemics.git
cd epydemics
pip install -e .

Documentation

User Documentation:

  • User Guide: Complete guide on when to use epydemics, data preparation, and frequency handling
  • Tutorial: Step-by-step guide for COVID-19 forecasting
  • CHANGELOG.md: Version history and detailed changes

Developer Documentation:

Release Documentation:

Further work

Recent additions in v0.9.0:

  • Native Multi-Frequency: Daily, business day, weekly, monthly, annual handlers; no artificial reindexing
  • Frequency-Aware Aggregation: Skip resampling when source and target frequencies match
  • Frequency-Aware Seasonality: Adaptive seasonal detection per frequency
  • Business Day Support: Trading-day defaults (252 days/year, 10-lag VAR, 10-day recovery)

Previous releases (v0.6.1-v0.7.0):

  • SIRDV Model Support: Automatic detection and modeling of vaccination data
  • Parallel Simulations: Multi-core execution for improved performance
  • Result Caching: Optional caching to avoid recomputation
  • Enhanced Testing: Comprehensive test coverage with slow test markers

Future directions (v1.0.0+):

  • Incidence-First Mode: Direct modeling of incident cases (not just cumulative)
  • Importation Modeling: Handle external case introductions for eliminated diseases
  • Probabilistic & Scenario Forecasting: Monte Carlo and intervention comparison workflows
  • More advanced time series models: SARIMAX, Prophet, or deep learning approaches
  • Support for other epidemiological models: Extend to SEIR, SEIRD, or metapopulation models
  • Improved visualization: Interactive dashboards and real-time updating plots

References

Allen u.a. 2008 Allen, L.J.S. ; Brauer, F. ; Driessche, P. van den ; Bauch, C.T. ; Wu, J. ; Castillo-Chavez, C. ; Earn, D. ; Feng, Z. ; Lewis, M.A. ; Li, J. u.a.: Mathematical Epidemiology. Springer Berlin Heidelberg, 2008 (Lecture Notes in Mathematics).– URL https://books. google.com/books?id=gcP5l1a22rQC.– ISBN 9783540789109

Andrade u.a. 2021 Andrade, Marinho G. ; Achcar, Jorge A. ; Conce icc˜ ao, Katiane S. ; Ravishanker, Nalini: Time Series Regression Models for COVID-19 Deaths. In: J. Data Sci 19 (2021), Nr. 2, S. 269–292

Hawas 2020 Hawas, Mohamed: Generated time-series prediction data of COVID-19s daily infections in Brazil by using recurrent neural networks. In: Data in brief 32 (2020), S. 106175

Maleki u.a. 2020 Maleki, Mohsen ; Mahmoudi, Mohammad R. ; Wraith, Darren ; Pho, Kim-Hung: Time series modelling to forecast the confirmed and recovered cases of COVID-19. In: Travel medicine and infectious disease 37 (2020), S. 101742

Martcheva 2015 Martcheva, M.: An Introduction to Mathematical Epi demiology. Springer US, 2015 (Texts in Applied Mathematics).– URL https: //books.google.com/books?id=tt7HCgAAQBAJ.– ISBN 9781489976123

Singh u.a. 2020 Singh, Vijander ; Poonia, Ramesh C. ; Kumar, Sandeep ; Dass, Pranav ; Agarwal, Pankaj ; Bhatnagar, Vaibhav ; Raja, Linesh: Prediction of COVID-19 coronavirus pandemic based on time series data using Support Vector Machine. In: Journal of Discrete Mathematical Sciences and Cryptography 23 (2020), Nr. 8, S. 1583–1597

Wacker und Schluter 2020 Wacker, Benjamin; Schluter, Jan: Time continuous and time-discrete SIR models revisited: theory and applications. In: Advances in Difference Equations 2020 (2020), Nr. 1, S. 1–44.– ISSN 1687-1847.– URL https://doi.org/10.1186/s13662-020-02907-9

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

epydemics-0.9.0.tar.gz (77.0 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

epydemics-0.9.0-py3-none-any.whl (78.5 kB view details)

Uploaded Python 3

File details

Details for the file epydemics-0.9.0.tar.gz.

File metadata

  • Download URL: epydemics-0.9.0.tar.gz
  • Upload date:
  • Size: 77.0 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.1

File hashes

Hashes for epydemics-0.9.0.tar.gz
Algorithm Hash digest
SHA256 7a1ea63d9325d8849d65487c5a03241b25cd3984e7df24abef8b3b9e0d071e1d
MD5 fd888af4fc3682f32c98dbc1208c7fb2
BLAKE2b-256 3186e22ab1b2f59f60ed9ee9475c020cebb4a033bf0f566f383662af6ef6bcd5

See more details on using hashes here.

File details

Details for the file epydemics-0.9.0-py3-none-any.whl.

File metadata

  • Download URL: epydemics-0.9.0-py3-none-any.whl
  • Upload date:
  • Size: 78.5 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.1

File hashes

Hashes for epydemics-0.9.0-py3-none-any.whl
Algorithm Hash digest
SHA256 fc3f90d86c6596ba88995dbfcd2a5e62ad0f5be100c7f8b7eb3a734e4e74abbc
MD5 3782fa838e891d99871627d3f927dbe1
BLAKE2b-256 fd2be1cf0241114363ae16484b944ddf8d4cab0eab9a40c32e5331683d2f8d95

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page