A highly scalable and customizable safe reinforcement learning environment.

These details have not been verified by PyPI

Project links

Project description

Safety-Gymnasium

Why Safety-Gymnasium? | Documentation | Install guide | Customization | Future Plan

This library is currently under heavy development - if you have suggestions on the API or use cases you'd like to be covered, please open a GitHub issue or reach out. We'd love to hear about how you're using the library.

Safety-Gymnasium is a highly scalable and customizable Safe Reinforcement Learning library. It aims to deliver a good view of benchmarking Safe Reinforcement Learning (Safe RL) algorithms and a standardized set of environments. We provide a set of standard APIs which are compatible with information on constraints. Users can explore new insights via an elegant code framework and well-designed environments.

Why Safety-Gymnasium?

Here we provide a table for comparison of Safety-Gymnasium and existing SafeRL Environments libraries.

SafeRL Envs	Engine	Vectorized Environments	New Gym API⁽³⁾	Vision Input
Safety Gym	`mujoco-py`⁽¹⁾	❌	❌	minimally supported
safe-control-gym	PyBullet	❌	❌	❌
Velocity-Constraints⁽²⁾	N/A	❌	❌	❌
mujoco-circle	PyTorch	❌	❌	❌
Safety-Gymnasium	MuJoCo 2.3.0+	✅	✅	✅

^{(1): Maintenance (expect bug fixes and minor updates); the last commit is 19 Nov 2021. Safety Gym depends on mujoco-py 2.0.2.7, which was updated on Oct 12, 2019.}
^{(2): There is no official library for speed-related environments, and its associated cost constraints are constructed from info. But the task is widely used in the study of SafeRL, and we encapsulate it in Safety-Gymnasium.}
^{(3): In the gym 0.26.0 release update, a new API of interaction was redefined.}

Environments

We designed a variety of safety-enhanced learning tasks and integrated the contributions from the RL community: safety-velocity, safety-run, safety-circle, safety-goal, safety-button, etc. We introduce a unified safety-enhanced learning benchmark environment library called Safety-Gymnasium.

Further, to facilitate the progress of community research, we redesigned Safety Gym and removed the dependency on mujoco-py. We built it on top of MuJoCo and fixed some bugs, more specific bug reports can refer to Safety Gym's BUG Report.

Here is a list of all the environments we support for now; some are being tested in our baselines, and we will gradually release them in later updates.

Category	Task	Agent	Example
Safe Navigation	Goal[012]	Point, Car, Doggo, Racecar, Ant	SafetyPointGoal1-v0
	Button[012]
	Push[012]
	Circle[012]
Velocity	Velocity	HalfCheetah, Hopper, Swimmer, Walker2d, Ant, Humanoid	SafetyAntVelocity-v1

Note: We newly support Doggo agent in v0.4.0, the corresponding benchmarks, pictures, and documentation will be updated soon. And PRs are welcome.

Here are some screenshots of the Safe Navigation tasks.

Agents

Point	Car	Racecar
Ant	Coming soon…	Coming soon…

Tasks

Goal0	Goal1	Goal2
Button0	Button1	Button2
Push0	Push1	Push2
Circle0	Circle1	Circle2
Coming soon…	Coming soon…	Coming soon…

Vision-base Safe RL

Vision-based safety reinforcement learning lacks realistic scenarios. Although the original Safety Gym could minimally support visual input, the scenarios were too similar. To facilitate the validation of visual-based safety reinforcement learning algorithms, we have developed a set of realistic vision-based SafeRL tasks, which are currently being validated on the baseline. In the later updates, we will release that part of the environment of Safety-Gymnasium.

For the appetizer, the images are as follows:

Environment Usage

Notes: We support explicitly expressing the cost based on Gymnasium APIs. The step method returns 6 items (next_obervation, reward, cost, terminated, truncated, info) with an extra cost field.

import safety_gymnasium

env_id = 'SafetyPointGoal1-v0'
env = safety_gymnasium.make(env_id)

obs, info = env.reset()
while True:
    act = env.action_space.sample()
    obs, reward, cost, terminated, truncated, info = env.step(act)
    if terminated or truncated:
        break
    env.render()

We also provide two convenience wrappers for converting the Safety-Gymnasium environment to the standard Gymnasium API and vice versa.

# Safety-Gymnasium API: step returns (next_obervation, reward, cost, terminated, truncated, info)
# Gymnasium API:        step returns (next_obervation, reward, terminated, truncated, info) and cost is in the `info` dict associated with a str key `'cost'`

safety_gymnasium_env = safety_gymnasium.make(env_id)
gymnasium_env = safety_gymnasium.wrappers.SafetyGymnasium2Gymnasium(safety_gymnasium_env)

safety_gymnasium_env = safety_gymnasium.wrappers.Gymnasium2SafetyGymnasium(gymnasium_env)

Users can apply Gymnasium wrappers easily with:

import gymnasium
import safety_gymnasium

def make_safe_env(env_id):
    safe_env = safety_gymnasium.make(env_id)
    env = safety_gymnasium.wrappers.SafetyGymnasium2Gymnasium(safe_env)
    env = gymnasium.wrappers.SomeWrapper1(env)
    env = gymnasium.wrappers.SomeWrapper2(env, argname1=arg1, argname2=arg2)
    ...
    env = gymnasium.wrappers.SomeWrapperN(env)
    safe_env = safety_gymnasium.wrappers.Gymnasium2SafetyGymnasium(env)
    return safe_env

import functools

import gymnasium
import safety_gymnasium

def make_safe_env(env_id):
    return safety_gymnasium.wrappers.with_gymnasium_wrappers(
        safety_gymnasium.make(env_id),
        gymnasium.wrappers.SomeWrapper1,
        functools.partial(gymnasium.wrappers.SomeWrapper2, argname1=arg1, argname2=arg2),
        ...,
        gymnasium.wrappers.SomeWrapperN,
    )

In addition, for all Safety-Gymnasium environments, we also provide corresponding Gymnasium environments with a suffix Gymnasium in the environment id. For example:

import gymnasium
import safety_gymnasium

safety_gymnasium.make('SafetyPointGoal1-v0')    # step returns (next_obervation, reward, cost, terminated, truncated, info)
gymnasium.make('SafetyPointGoal1Gymnasium-v0')  # step returns (next_obervation, reward, terminated, truncated, info)

Installation

Install from PyPI

pip install safety-gymnasium

Install from source

conda create -n <envname> python=3.8
conda activate <envname>

git clone git@github.com:OmniSafeAI/safety-gymnasium.git
cd safety-gymnasium
pip install -e .

Important Notes

If you failed to render on your server, you can try:

echo "export MUJOCO_GL=osmesa" >> ~/.bashrc
source ~/.bashrc
apt-get install libosmesa6-dev
apt-get install python3-opengl

Customize your environments

We construct a highly expandable framework of code so that you can easily comprehend it and design your environments to facilitate your research with no more than 100 lines of code on average.

For details, please refer to our documentation. Here is a minimal example:

# import the objects you want to use
# or you can define specific objects by yourself, just make sure obeying our specification
from safety_gymnasium.assets.geoms import Apples
from safety_gymnasium.bases import BaseTask

# inherit the basetask
class MytaskLevel0(BaseTask):
    def __init__(self, config):
        super().__init__(config=config)
        # define some properties
        self.num_steps = 500
        self.agent.placements = [(-0.8, -0.8, 0.8, 0.8)]
        self.agent.keepout = 0
        self.lidar_conf.max_dist = 6
        # add objects into environments
        self.add_geoms(Apples(num=2, size=0.3))

    def calculate_reward(self):
        # implement your reward function
        # Note: cost calculation is based on objects, so it's automatic
        reward = 1
        return reward

    def specific_reset(self):
        # depending on your task

    def specific_step(self):
        # depending on your task

    def update_world(self):
        # depending on your task

    @property
    def goal_achieved(self):
        # depending on your task

Future Plans

Vision-based environments
Bring in other robots
Tested on different Python versions
Tested on Windows and macOS

License

Safety-Gymnasium is released under Apache License 2.0.

Project details

These details have not been verified by PyPI

Project links

Release history Release notifications | RSS feed

1.0.0

Jul 21, 2023

0.4.1

Jul 12, 2023

This version

0.4.0

May 10, 2023

0.3.0

May 10, 2023

0.2.0

Apr 5, 2023

0.1.3

Mar 26, 2023

0.1.2

Mar 14, 2023

0.1.1

Mar 1, 2023

0.1.0

Feb 11, 2023

0.1.0b0 pre-release

Feb 7, 2023

0.0.1 yanked

Nov 19, 2022

Reason this release was yanked:

This version no use.

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

safety-gymnasium-0.4.0.tar.gz (26.6 MB view details)

Uploaded May 10, 2023 Source

Built Distribution

safety_gymnasium-0.4.0-py3-none-any.whl (26.7 MB view details)

Uploaded May 10, 2023 Python 3

File details

Details for the file safety-gymnasium-0.4.0.tar.gz.

File metadata

Download URL: safety-gymnasium-0.4.0.tar.gz
Upload date: May 10, 2023
Size: 26.6 MB
Tags: Source
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.8.16

File hashes

Hashes for safety-gymnasium-0.4.0.tar.gz
Algorithm	Hash digest
SHA256	`fec978ce83e59389ea92f21846e73c15a0772cddfd401e40fa3a4392b9cb9a02`
MD5	`14c0f8224fb4931b21ce134b2e6550dc`
BLAKE2b-256	`88d9792324ecd9216946f031c26f5a40ff8db48f01010a08d494473789106104`

See more details on using hashes here.

File details

Details for the file safety_gymnasium-0.4.0-py3-none-any.whl.

File metadata

Download URL: safety_gymnasium-0.4.0-py3-none-any.whl
Upload date: May 10, 2023
Size: 26.7 MB
Tags: Python 3
Uploaded using Trusted Publishing? No
Uploaded via: twine/4.0.2 CPython/3.8.16

File hashes

Hashes for safety_gymnasium-0.4.0-py3-none-any.whl
Algorithm	Hash digest
SHA256	`7a429483f0a6cd07d3a9bfad66364aa0d528195ef3353f4853399797da8e8f1c`
MD5	`3db7dfec4fc033a1de414fbb383bca4f`
BLAKE2b-256	`9549be31008c8736cc9a70a6a65dc406991d32ce8f79f02bcb3c1a32f0aeb682`

See more details on using hashes here.

safety-gymnasium 0.4.0

Navigation

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Project description

Safety-Gymnasium

Why Safety-Gymnasium?

Environments

Agents

Tasks

Vision-base Safe RL

Environment Usage

Installation

Install from PyPI

Install from source

Important Notes

Customize your environments

Future Plans

License

Project details

Verified details

Maintainers

Unverified details

Project links

Meta

Classifiers

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

File details

File metadata

File hashes