A simulation toolkit for non-stationary Markov decision processes

These details have been verified by PyPI

Project links

Issues
Repo

Owner

ScopeLab

GitHub Statistics

These details have not been verified by PyPI

Project links

Homepage

Project description

NS-Gym: A Comprehensive and Open-Source Simulation Framework for Non-Stationary Markov Decision Processes

NS-Gym (Non-Stationary Gym) is a flexible framework providing a standardized abstraction for both modeling Non-Stationary Markov Decision Processes (NS-MDPs) and the key problem types that a decision-making entity may encounter in such environments.

Built on top of the popular Gymnasium library, NS-Gym provides a set of wrappers for existing environments, making it easy to incorporate non-stationary dynamics and manage the nature of agent-environment interaction specific to NS-MDPs.

A key feature of NS-Gym is emulating the core problem types of decision-making in non-stationary settings; these problem types concern not only the ability to adapt to changes in the environment but also the ability to detect and characterize these changes.

We currently support the Gymnasium classic control suite of environments, MuJoCo environments, and stochastic environments like FrozenLake. NS-Gym is designed to be easily extensible, allowing users to create their own non-stationary environments by defining parameter change schedules and update functions. We welcome contributions from the community to expand the library of non-stationary environments and algorithms!

Project Webpage and Documentation

Visit our project webpage for tutorials, core concepts, and documentation: nsgym.io

Installation

To install NS-Gym, you can use pip directly from GitHub. uv is recommended but not required:

uv pip install git+https://github.com/scope-lab-vu/ns_gym

We'll eventually release NS-Gym on PyPI for easier installation.

White Paper

NS-Gym: Open-Source Simulation Environments and Benchmarks for Non-Stationary Markov Decision Processes

Read the the preprint on ArXiv (An updated version to be published at NeurIPS 2025 Dataset and Benchmarks track is coming soon). See tag submission/ns_gymv0 for benchmarking code used in the paper.

Citation

@article{keplinger2025ns,
  title={NS-Gym: Open-Source Simulation Environments and Benchmarks for Non-Stationary Markov Decision Processes},
  author={Keplinger, Nathaniel S and Luo, Baiting and Bektas, Iliyas and Zhang, Yunuo and Wray, Kyle Hollins and Laszka, Aron and Dubey, Abhishek and Mukhopadhyay, Ayan},
  journal={arXiv preprint arXiv:2501.09646},
  year={2025}
}

Decision Making Algorithm Support

NS-Gym is designed to be compatible with existing reinforcement learning libraries such as Stable Baselines3. Additionally, NS-Gym provides baseline algorithms designed explicitly for non-stationary environments, as well as a leaderboard to compare algorithm performance on various non-stationary tasks.

NS-Gym in Action

Here are three examples of non-stationary environments created using NS-Gym. Each demonstrates a transition from an initial MDP ($\mathcal{MDP}_0$) to a modified MDP ($\mathcal{MDP}_1$) by changing environment parameters ($\theta_0 \rightsquigarrow \theta_1$).

Stationary MDP	Change Type	Non-Stationary MDP
CartPole: Stationary MDP	At timestep $t$ gravity massively increases according to a user-defined step function.	CartPole: Non-Stationary MDP
	$\Large \theta_0 \rightsquigarrow \theta_1$
FrozenLake: Stationary MDP	Probability of moving in the intended direction goes to 0 just before reaching the goal.	FrozenLake: Non-Stationary MDP
	$\Large \theta_0 \rightsquigarrow \theta_1$
Ant: Stationary MDP	Magnitude of gravity gradually decreases at each timestep following a geometric progression.	Ant: Non-Stationary MDP
	$\Large \theta_0 \rightsquigarrow \theta_1$

Note: This type of parameter shift is just one example of how an NS-MDP can be implemented. The policies controlling the agents are detailed in the full documentation.

Quickstart

Suppose we want to model a non-stationary environment in the classical CartPole environment, where the pole’s mass increases by 0.1 units at each time step, and the system’s gravity increases through a random walk every three time steps. NS-Gym let's us emulate "runtime" monitors tha can detect changes in the environment (but not the magnitude of the changes) and a "model updater" the tells us the magnitude of the changes. In this case we simple want the decision making entity to be notified that there has been a change in the environment but not know to what extent. The corresponds to the decision-making agent to having a "basic notification level". The following code snippet shows the general experimental setup in this CartPole Gymnasium environment using NS-Gym.

###### Step 1: Import necessary gym and ns_gym modules
import gymnasium as gym
import ns_gym
from ns_gym.wrappers import NSClassicControlWrapper
from ns_gym.schedulers import ContinuousScheduler, PeriodicScheduler
from ns_gym.update_functions import RandomWalk, IncrementUpdate
from ns_gym.benchmark_algorithms import MCTS


###### Step 2: Create a standard gym environment ####
env = gym.make("CartPole-v1")
#############

########## Step 3: to describe the evolution of the non-stationary parameters, 
# we define the two schedulers and update functions that model the semi-Markov chain over the relevant parameters
############
scheduler_1 = ContinuousScheduler()
scheduler_2 = PeriodicScheduler(period=3)

update_function1= IncrementUpdate(scheduler_1, k=0.1)
update_function2 = RandomWalk(scheduler_2)

##### Step 4: map parameters to update functions
tunable_params = {"masspole":update_function1, "gravity": update_function2}

######## Step 5: set notification level and pass environment and parameters into wrapper
ns_env = NSClassicControlWrapper(env,tunable_params,change_notification=True)

######### Step 6: set up ns-environment and agent interaction loop. i.e ... 
done = False
truncated = False

episode_reward = 0

obs,info = ns_env.reset()

planning_env = ns_env.get_planning_env()
mcst_agent = MCTS(planning_env, state=obs["state"], d=50, m=100,c=1.4,gamma=0.99)
done = False
truncated = False

timestep = 0
while not (done or truncated):
    action = mcst_agent.act(obs,planning_env)
    obs, reward, done, truncated, info = ns_env.step(action)


    if timestep % 10 == 0:
        print("Timestep: ", timestep)
        print("obs: ", obs)
        print("reward: ", reward)   
        print("########")
        print("\n")
    planning_env = ns_env.get_planning_env()
    episode_reward += reward.reward
    timestep += 1

print("Episode Reward: ", episode_reward)

If we run this code snippet the environment obversvation and reward at timestep 0 may look like this:

Timestep:  0

obs:  {'state': array([-0.03006991,  0.19717823,  0.02711801, -0.3215324 ], dtype=float32), 
        'env_change': {'masspole': 1, 'gravity': 1}, 
        'delta_change': {'masspole': 0.0, 'gravity': 0.0}, 
        'relative_time': 1}

reward:  Reward(reward=1.0, 
                env_change={'masspole': 1, 'gravity': 1}, 
                delta_change={'masspole': 0.0, 'gravity': 0.0}, 
                relative_time=1)

The obs dictionary of the following terms:

state: the standard Gymnasium observation of the environment.
env_change: a dictionary indicating whether this parameter has changed (1 indicates a change, 0 indicates no change). This is only available if change_notification=True is set in the wrapper.
delta_change: a dictionary indicating the magnitude of change for each parameter. This is only available if delta_change_notification=True is set in the wrapper. Defaults to zero if not set.
relative_time: Current time step of environment.

The reward object is a data class rather than a dictionary that contains the same terms. While the observation is a dictionary to maintain compatibility with Gymnasium, the reward is a dataclass to allow for easier extension in the future for non-stationary rewards while working with a more robust data structure.

Tutorial:

A more comprehensive tutorial can be found here

Development and testing

We welcome any contributions to this NS-Gym project! If you find a bug or want to add a new feature, please feel free to open an issue or submit a pull request.

Clone the repository, install the required dependencies in editable mode, and run the tests to ensure everything is working correctly. We use UV for package management.

git clone https://github.com/scope-lab-vu/ns_gym.git
cd ns_gym
uv pip install -e ".[all]" --force-reinstall

To run all test in the project run:

pytest tests/

Project details

These details have been verified by PyPI

Project links

Issues
Repo

Owner

ScopeLab

GitHub Statistics

These details have not been verified by PyPI

Project links

Homepage

Release history Release notifications | RSS feed

1.0.10

Mar 4, 2026

1.0.9

Feb 10, 2026

1.0.8

Feb 3, 2026

1.0.7

Jan 30, 2026

1.0.6

Jan 30, 2026

1.0.5

Jan 30, 2026

1.0.4

Jan 28, 2026

This version

1.0.3

Jan 2, 2026

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

ns_gym-1.0.3.tar.gz (94.8 MB view details)

Uploaded Jan 2, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

ns_gym-1.0.3-py3-none-any.whl (9.1 MB view details)

Uploaded Jan 2, 2026 Python 3

File details

Details for the file ns_gym-1.0.3.tar.gz.

File metadata

Download URL: ns_gym-1.0.3.tar.gz
Upload date: Jan 2, 2026
Size: 94.8 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ns_gym-1.0.3.tar.gz
Algorithm	Hash digest
SHA256	`1dad81a1f3047bf03b4bf8f4b9682dfa705765380869eb6caebe945c6bd8a2a2`
MD5	`5979f56a70c8d83d32369ecc6efec7cc`
BLAKE2b-256	`b5efe64dbd5ee393b2b1bd1887b06cd7a7897119eadc0c9e1ec2af4a96d6abdb`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ns_gym-1.0.3.tar.gz:

Publisher: publish-pypi.yml on scope-lab-vu/ns_gym

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ns_gym-1.0.3.tar.gz
- Subject digest: 1dad81a1f3047bf03b4bf8f4b9682dfa705765380869eb6caebe945c6bd8a2a2
- Sigstore transparency entry: 788438186
- Sigstore integration time: Jan 2, 2026
Source repository:
- Permalink: scope-lab-vu/ns_gym@37377d147dc695f477d1b634ddd188c152cc0d54
- Branch / Tag: refs/tags/v1.0.3
- Owner: https://github.com/scope-lab-vu
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@37377d147dc695f477d1b634ddd188c152cc0d54
- Trigger Event: push

File details

Details for the file ns_gym-1.0.3-py3-none-any.whl.

File metadata

Download URL: ns_gym-1.0.3-py3-none-any.whl
Upload date: Jan 2, 2026
Size: 9.1 MB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.7

File hashes

Hashes for ns_gym-1.0.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`3e3656b291f2f1c75e1a526bbc62621daa471ea61257d1f479a7fc15aa1543fb`
MD5	`d68ce3fb8e2f0d1fb5437641a5594dc7`
BLAKE2b-256	`f2352a4eaa4eded0bd7239e5f1e07eb623854fa80e03ddef970121073aadcb24`

See more details on using hashes here.

Provenance

The following attestation bundles were made for ns_gym-1.0.3-py3-none-any.whl:

Publisher: publish-pypi.yml on scope-lab-vu/ns_gym

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: ns_gym-1.0.3-py3-none-any.whl
- Subject digest: 3e3656b291f2f1c75e1a526bbc62621daa471ea61257d1f479a7fc15aa1543fb
- Sigstore transparency entry: 788438189
- Sigstore integration time: Jan 2, 2026
Source repository:
- Permalink: scope-lab-vu/ns_gym@37377d147dc695f477d1b634ddd188c152cc0d54
- Branch / Tag: refs/tags/v1.0.3
- Owner: https://github.com/scope-lab-vu
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: publish-pypi.yml@37377d147dc695f477d1b634ddd188c152cc0d54
- Trigger Event: push

ns-gym 1.0.3

Navigation

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Project links

Meta

Classifiers

Project description

NS-Gym: A Comprehensive and Open-Source Simulation Framework for Non-Stationary Markov Decision Processes

Project Webpage and Documentation

Installation

White Paper

Citation

Decision Making Algorithm Support

NS-Gym in Action

Quickstart

Tutorial:

Development and testing

Project details

Verified details

Project links

Owner

GitHub Statistics

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance