Contrib package of Stable Baselines3, experimental code.
Project description
Stable-Baselines3 - Contrib (SB3-Contrib)
Contrib package for Stable-Baselines3 - Experimental reinforcement learning (RL) code. "sb3-contrib" for short.
What is SB3-Contrib?
A place for RL algorithms and tools that are considered experimental, e.g. implementations of the latest publications. Goal is to keep the simplicity, documentation and style of stable-baselines3 but for less matured implementations.
Why create this repository?
Over the span of stable-baselines and stable-baselines3, the community has been eager to contribute in form of better logging utilities, environment wrappers, extended support (e.g. different action spaces) and learning algorithms.
However sometimes these utilities were too niche to be considered for stable-baselines or proved to be too difficult to integrate well into the existing code without creating a mess. sb3-contrib aims to fix this by not requiring the neatest code integration with existing code and not setting limits on what is too niche: almost everything remotely useful goes! We hope this allows us to provide reliable implementations following stable-baselines usual standards (consistent style, documentation, etc) beyond the relatively small scope of utilities in the main repository.
Features
See documentation for the full list of included features.
RL Algorithms:
- Truncated Quantile Critics (TQC)
- Quantile Regression DQN (QR-DQN)
- PPO with invalid action masking (MaskablePPO)
- Trust Region Policy Optimization (TRPO)
- Augmented Random Search (ARS)
Gym Wrappers:
Documentation
Documentation is available online: https://sb3-contrib.readthedocs.io/
Installation
Note: You need the master
version of Stable Baselines3.
To install Stable Baselines3 master
version:
pip install git+https://github.com/DLR-RM/stable-baselines3
To install Stable Baselines3 contrib master
version:
pip install git+https://github.com/Stable-Baselines-Team/stable-baselines3-contrib
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for sb3_contrib-2.0.0a9-py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | e6e8e1c8c78f4b8c335bf745128e624baf8c3d79f60afc75ab319cf46ace5755 |
|
MD5 | 21cdb05fc0c046dcd32db4930cad0a4f |
|
BLAKE2b-256 | ed82fbebc9a2f73578b6ed76fabf632ec9dbce427e9072fecff8ed7f9b62d5bf |