Skip to main content

Advanced epidemiological modeling and forecasting using discrete SIRD/SIRDV models with time series analysis

Project description

Epydemics: Forecasting COVID-19 using time series and machine learning

Version 0.9.0 - Native Multi-Frequency (Daily, Business Day, Weekly, Monthly, Annual)

epydemics is a Python library for epidemiological modeling and forecasting. It provides tools for creating, fitting, and evaluating discrete SIRD/SIRDV models with time-dependent parameters. The library is designed to be flexible and extensible, allowing users to easily incorporate their own data and models.

📖 New Users? Start with the User Guide to understand when and how to use epydemics.

Features

  • Discrete SIRD Model: A discrete Susceptible-Infected-Recovered-Deceased (SIRD) model with time-dependent parameters.
  • SIRDV Vaccination Model (v0.7.0): Extended model including Vaccinated compartment (V) and vaccination rate (δ).
  • Time Series Forecasting: Use of VAR (Vector Autoregression) models to forecast epidemic rates with logit transformation.
  • Multi-Frequency Support (v0.9.0): Native processing of daily, business day, weekly, monthly, and annual data with automatic detection and handler-based validation (no artificial reindexing).
  • Temporal Aggregation (v0.9.0): Frequency-aware aggregation that skips resampling when source and target frequencies match.
  • Business Day Support (v0.9.0): Dedicated handler for trading-day calendars (252 days/year) with validated lags and recovery windows.
  • Data Container: A convenient class for loading, preprocessing, and storing epidemiological data.
  • Parallel Simulations: Multi-core support for faster Monte Carlo simulations (27 scenarios for SIRD, 81 for SIRDV).
  • Result Caching: File-based caching to avoid recomputing identical analyses.
  • Model Evaluation: Tools for evaluating model performance with MAE, MSE, RMSE, MAPE, SMAPE metrics.
  • Visualization: Professional plotting functions for results and forecasts.

✅ Native Multi-Frequency: v0.9.0 processes annual/monthly/weekly/business-day data without artificial daily reindexing. See User Guide for frequency guidance.

SIRDV Model (New in v0.7.0)

The SIRDV model extends the classical SIRD model by incorporating vaccination:

Compartments:

  • S: Susceptible
  • I: Infected (active cases)
  • R: Recovered
  • D: Deaths
  • V: Vaccinated (new)

Rates:

  • α: Infection rate
  • β: Recovery rate
  • γ: Mortality rate
  • δ: Vaccination rate (new)

Key Features:

  • Automatic detection from vaccination data
  • 81 simulation scenarios (3⁴ confidence levels)
  • Conservation law: N = S + I + R + D + V
  • Parallel execution recommended for performance

Getting Started

To get started with epydemics, we recommend following the tutorial in TUTORIAL.md.

Installation

You can install epydemics from PyPI:

pip install epydemics

To install the latest development version, you can clone this repository and install it in editable mode:

git clone https://github.com/julihocc/epydemics.git
cd epydemics
pip install -e .

Documentation

User Documentation:

  • User Guide: Complete guide on when to use epydemics, data preparation, and frequency handling
  • Tutorial: Step-by-step guide for COVID-19 forecasting
  • CHANGELOG.md: Version history and detailed changes

Developer Documentation:

Release Documentation:

Further work

Recent additions in v0.9.0:

  • Native Multi-Frequency: Daily, business day, weekly, monthly, annual handlers; no artificial reindexing
  • Frequency-Aware Aggregation: Skip resampling when source and target frequencies match
  • Frequency-Aware Seasonality: Adaptive seasonal detection per frequency
  • Business Day Support: Trading-day defaults (252 days/year, 10-lag VAR, 10-day recovery)

Previous releases (v0.6.1-v0.7.0):

  • SIRDV Model Support: Automatic detection and modeling of vaccination data
  • Parallel Simulations: Multi-core execution for improved performance
  • Result Caching: Optional caching to avoid recomputation
  • Enhanced Testing: Comprehensive test coverage with slow test markers

Future directions (v1.0.0+):

  • Incidence-First Mode: Direct modeling of incident cases (not just cumulative)
  • Importation Modeling: Handle external case introductions for eliminated diseases
  • Probabilistic & Scenario Forecasting: Monte Carlo and intervention comparison workflows
  • More advanced time series models: SARIMAX, Prophet, or deep learning approaches
  • Support for other epidemiological models: Extend to SEIR, SEIRD, or metapopulation models
  • Improved visualization: Interactive dashboards and real-time updating plots

References

Allen u.a. 2008 Allen, L.J.S. ; Brauer, F. ; Driessche, P. van den ; Bauch, C.T. ; Wu, J. ; Castillo-Chavez, C. ; Earn, D. ; Feng, Z. ; Lewis, M.A. ; Li, J. u.a.: Mathematical Epidemiology. Springer Berlin Heidelberg, 2008 (Lecture Notes in Mathematics).– URL https://books. google.com/books?id=gcP5l1a22rQC.– ISBN 9783540789109

Andrade u.a. 2021 Andrade, Marinho G. ; Achcar, Jorge A. ; Conce icc˜ ao, Katiane S. ; Ravishanker, Nalini: Time Series Regression Models for COVID-19 Deaths. In: J. Data Sci 19 (2021), Nr. 2, S. 269–292

Hawas 2020 Hawas, Mohamed: Generated time-series prediction data of COVID-19s daily infections in Brazil by using recurrent neural networks. In: Data in brief 32 (2020), S. 106175

Maleki u.a. 2020 Maleki, Mohsen ; Mahmoudi, Mohammad R. ; Wraith, Darren ; Pho, Kim-Hung: Time series modelling to forecast the confirmed and recovered cases of COVID-19. In: Travel medicine and infectious disease 37 (2020), S. 101742

Martcheva 2015 Martcheva, M.: An Introduction to Mathematical Epi demiology. Springer US, 2015 (Texts in Applied Mathematics).– URL https: //books.google.com/books?id=tt7HCgAAQBAJ.– ISBN 9781489976123

Singh u.a. 2020 Singh, Vijander ; Poonia, Ramesh C. ; Kumar, Sandeep ; Dass, Pranav ; Agarwal, Pankaj ; Bhatnagar, Vaibhav ; Raja, Linesh: Prediction of COVID-19 coronavirus pandemic based on time series data using Support Vector Machine. In: Journal of Discrete Mathematical Sciences and Cryptography 23 (2020), Nr. 8, S. 1583–1597

Wacker und Schluter 2020 Wacker, Benjamin; Schluter, Jan: Time continuous and time-discrete SIR models revisited: theory and applications. In: Advances in Difference Equations 2020 (2020), Nr. 1, S. 1–44.– ISSN 1687-1847.– URL https://doi.org/10.1186/s13662-020-02907-9

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

epydemics-0.9.1.tar.gz (78.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

epydemics-0.9.1-py3-none-any.whl (79.9 kB view details)

Uploaded Python 3

File details

Details for the file epydemics-0.9.1.tar.gz.

File metadata

  • Download URL: epydemics-0.9.1.tar.gz
  • Upload date:
  • Size: 78.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.1

File hashes

Hashes for epydemics-0.9.1.tar.gz
Algorithm Hash digest
SHA256 740bab5b6cf86f4c62077139691d7c3cb781b8093a28d9345183c89a1a85e475
MD5 48f9eef18e04b79490e9926486c357fa
BLAKE2b-256 06bfc4e92e5d5875224a85374286e4f26ebd9599e0fabe3a28bd8e07b6a080d0

See more details on using hashes here.

File details

Details for the file epydemics-0.9.1-py3-none-any.whl.

File metadata

  • Download URL: epydemics-0.9.1-py3-none-any.whl
  • Upload date:
  • Size: 79.9 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.12.1

File hashes

Hashes for epydemics-0.9.1-py3-none-any.whl
Algorithm Hash digest
SHA256 07d3d21090db27acc40ba7f0afb1bd7123ceb9754c1ea439720d1bff4da9af6c
MD5 80abd9800cbaf551f9ac95296d3819e7
BLAKE2b-256 71a35e733aaae0823dd55455e7070f236f09079d45e2ed890faece33b35d2644

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page