Jax version of Stable Baselines, implementations of reinforcement learning algorithms.
Project description
Stable Baselines Jax (SB3 + JAX = SBX)
See https://github.com/araffin/sbx
Proof of concept version of Stable-Baselines3 in Jax.
Implemented algorithms:
- Soft Actor-Critic (SAC) and SAC-N
- Truncated Quantile Critics (TQC)
- Dropout Q-Functions for Doubly Efficient Reinforcement Learning (DroQ)
- Proximal Policy Optimization (PPO)
- Deep Q Network (DQN)
- Twin Delayed DDPG (TD3)
- Deep Deterministic Policy Gradient (DDPG)
- Batch Normalization in Deep Reinforcement Learning (CrossQ)
- Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning (SimBa)
Example
from sbx import DDPG, DQN, PPO, SAC, TD3, TQC, CrossQ
model = TQC("MlpPolicy", "Pendulum-v1", verbose=1)
model.learn(total_timesteps=10_000, progress_bar=True)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sbx_rl-0.21.0.tar.gz
(50.1 kB
view details)
Built Distribution
sbx_rl-0.21.0-py3-none-any.whl
(61.0 kB
view details)
File details
Details for the file sbx_rl-0.21.0.tar.gz
.
File metadata
- Download URL: sbx_rl-0.21.0.tar.gz
- Upload date:
- Size: 50.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.14
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
d377260afc23392dcc4dc923241e407288b595b9e913859ce6c6c964bcb3f5b1
|
|
MD5 |
b8029ab6cdcd16a769a96f2ba1e0cb5c
|
|
BLAKE2b-256 |
b936dede52569060071f9b4ffd9e358c19b51e862baf13b6cf9b264310bd0e38
|
File details
Details for the file sbx_rl-0.21.0-py3-none-any.whl
.
File metadata
- Download URL: sbx_rl-0.21.0-py3-none-any.whl
- Upload date:
- Size: 61.0 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.14
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
e2a39e737c0b935e12d89b41ab4e72c61f0f95de1c3718a6725c44324844ae31
|
|
MD5 |
4ac349b779626e20e592d61e41b4e3f5
|
|
BLAKE2b-256 |
3381395cd7fea909d83c8ca908b57dc197194df797db059b32a06e0571f704e2
|