Skip to main content

Gymnasium environments for reinforcement learning in antimicrobial stewardship and antibiotic treatment optimization

Project description

AntibioSim

Python 3.10+ License: MIT Tests: 170 PyPI

Gymnasium environments for reinforcement learning in antimicrobial stewardship and antibiotic treatment optimization.

AntibioSim provides four Gymnasium-compatible environments that model key clinical decision points in antibiotic therapy: drug selection, dose optimization, therapy switching, and ward-level resistance control. Each environment is grounded in established pharmacokinetic/pharmacodynamic (PK/PD) models and bacterial dynamics from the antimicrobial stewardship literature.

Environments

Environment Task Action Space Difficulty
AntibioticSelection-v0 Choose optimal antibiotic from formulary Discrete(5) Easy
DoseOptimization-v0 Optimize dosing for PK/PD target attainment Box(1) Medium
TherapySwitch-v0 IV-to-oral switch and escalation decisions Discrete(4) Medium
ResistanceControl-v0 Ward-level antibiotic policy for resistance control MultiDiscrete(3^5) Hard

Installation

pip install antibiosim

For training:

pip install antibiosim[train]

Quick Start

import gymnasium as gym
import antibiosim

env = gym.make("antibiosim/AntibioticSelection-v0")
obs, info = env.reset()

for _ in range(14):
    action = env.action_space.sample()
    obs, reward, terminated, truncated, info = env.step(action)
    if terminated or truncated:
        break

Domain Models

  • Pharmacokinetics: One-compartment and two-compartment PK models (Drusano, 2004)
  • Pharmacodynamics: Emax and sigmoidal Emax models (Regoes et al., 2004)
  • Bacterial dynamics: Logistic growth with antibiotic kill (Austin et al., 1999)
  • Resistance: Two-population susceptible/resistant dynamics (Levin & Bonten, 2004)

Baseline Agents

  • Random: Uniform random action selection
  • Heuristic: Clinical guideline-based rules (Barlam et al., 2016)
  • PPO: Proximal Policy Optimization via Stable-Baselines3

Training

# Train PPO on all environments
python -m antibiosim.training.train_all

# Train a single environment
antibiosim-train --env antibiosim/DoseOptimization-v0

Citation

@software{dhia2026antibiosim,
  author = {Dhia, Hass},
  title = {AntibioSim: Gymnasium Environments for Reinforcement Learning in Antimicrobial Stewardship},
  year = {2026},
  url = {https://github.com/HassDhia/antibiosim},
  version = {0.1.0}
}

License

MIT License. Copyright (c) 2026 Hass Dhia, Smart Technology Investments Research Institute.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

antibiosim-0.1.0.tar.gz (35.7 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

antibiosim-0.1.0-py3-none-any.whl (33.4 kB view details)

Uploaded Python 3

File details

Details for the file antibiosim-0.1.0.tar.gz.

File metadata

  • Download URL: antibiosim-0.1.0.tar.gz
  • Upload date:
  • Size: 35.7 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.6

File hashes

Hashes for antibiosim-0.1.0.tar.gz
Algorithm Hash digest
SHA256 84b9a18ba04422d787fc67f676e9ba86a4318bf636827f7066b55a0c602f6642
MD5 9d49e4e7d8520280a1999f220bb00ca3
BLAKE2b-256 e7a6b68066bf59ca086a790a778f76fa08ab78e04d829359c8343296dbd2f4ae

See more details on using hashes here.

File details

Details for the file antibiosim-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: antibiosim-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 33.4 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/6.2.0 CPython/3.11.6

File hashes

Hashes for antibiosim-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 fb1cd5c456bfd728860bc4286bf61fcb46592fbb8e9ff69c36206ae70aee31de
MD5 82b9d1b05b0a260de9d7c9517f7c9c43
BLAKE2b-256 4c54b3958ce9e4c8727e43b39b43baf668415bef34b9bd420544476e5ef6a058

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page