Gymnasium environments for reinforcement learning in antimicrobial stewardship and antibiotic treatment optimization
Project description
AntibioSim
Gymnasium environments for reinforcement learning in antimicrobial stewardship and antibiotic treatment optimization.
AntibioSim provides four Gymnasium-compatible environments that model key clinical decision points in antibiotic therapy: drug selection, dose optimization, therapy switching, and ward-level resistance control. Each environment is grounded in established pharmacokinetic/pharmacodynamic (PK/PD) models and bacterial dynamics from the antimicrobial stewardship literature.
Environments
| Environment | Task | Action Space | Difficulty |
|---|---|---|---|
AntibioticSelection-v0 |
Choose optimal antibiotic from formulary | Discrete(5) | Easy |
DoseOptimization-v0 |
Optimize dosing for PK/PD target attainment | Box(1) | Medium |
TherapySwitch-v0 |
IV-to-oral switch and escalation decisions | Discrete(4) | Medium |
ResistanceControl-v0 |
Ward-level antibiotic policy for resistance control | MultiDiscrete(3^5) | Hard |
Installation
pip install antibiosim
For training:
pip install antibiosim[train]
Quick Start
import gymnasium as gym
import antibiosim
env = gym.make("antibiosim/AntibioticSelection-v0")
obs, info = env.reset()
for _ in range(14):
action = env.action_space.sample()
obs, reward, terminated, truncated, info = env.step(action)
if terminated or truncated:
break
Domain Models
- Pharmacokinetics: One-compartment and two-compartment PK models (Drusano, 2004)
- Pharmacodynamics: Emax and sigmoidal Emax models (Regoes et al., 2004)
- Bacterial dynamics: Logistic growth with antibiotic kill (Austin et al., 1999)
- Resistance: Two-population susceptible/resistant dynamics (Levin & Bonten, 2004)
Baseline Agents
- Random: Uniform random action selection
- Heuristic: Clinical guideline-based rules (Barlam et al., 2016)
- PPO: Proximal Policy Optimization via Stable-Baselines3
Training
# Train PPO on all environments
python -m antibiosim.training.train_all
# Train a single environment
antibiosim-train --env antibiosim/DoseOptimization-v0
Citation
@software{dhia2026antibiosim,
author = {Dhia, Hass},
title = {AntibioSim: Gymnasium Environments for Reinforcement Learning in Antimicrobial Stewardship},
year = {2026},
url = {https://github.com/HassDhia/antibiosim},
version = {0.1.0}
}
License
MIT License. Copyright (c) 2026 Hass Dhia, Smart Technology Investments Research Institute.
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file antibiosim-0.1.0.tar.gz.
File metadata
- Download URL: antibiosim-0.1.0.tar.gz
- Upload date:
- Size: 35.7 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
84b9a18ba04422d787fc67f676e9ba86a4318bf636827f7066b55a0c602f6642
|
|
| MD5 |
9d49e4e7d8520280a1999f220bb00ca3
|
|
| BLAKE2b-256 |
e7a6b68066bf59ca086a790a778f76fa08ab78e04d829359c8343296dbd2f4ae
|
File details
Details for the file antibiosim-0.1.0-py3-none-any.whl.
File metadata
- Download URL: antibiosim-0.1.0-py3-none-any.whl
- Upload date:
- Size: 33.4 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.2.0 CPython/3.11.6
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
fb1cd5c456bfd728860bc4286bf61fcb46592fbb8e9ff69c36206ae70aee31de
|
|
| MD5 |
82b9d1b05b0a260de9d7c9517f7c9c43
|
|
| BLAKE2b-256 |
4c54b3958ce9e4c8727e43b39b43baf668415bef34b9bd420544476e5ef6a058
|