Jax version of Stable Baselines, implementations of reinforcement learning algorithms.
Project description
Stable Baselines Jax (SB3 + JAX = SBX)
See https://github.com/araffin/sbx
Proof of concept version of Stable-Baselines3 in Jax.
Implemented algorithms:
- Soft Actor-Critic (SAC) and SAC-N
- Truncated Quantile Critics (TQC)
- Dropout Q-Functions for Doubly Efficient Reinforcement Learning (DroQ)
- Proximal Policy Optimization (PPO)
- Deep Q Network (DQN)
- Twin Delayed DDPG (TD3)
- Deep Deterministic Policy Gradient (DDPG)
- Batch Normalization in Deep Reinforcement Learning (CrossQ)
- Simplicity Bias for Scaling Up Parameters in Deep Reinforcement Learning (SimBa)
Example
from sbx import DDPG, DQN, PPO, SAC, TD3, TQC, CrossQ
model = TQC("MlpPolicy", "Pendulum-v1", verbose=1)
model.learn(total_timesteps=10_000, progress_bar=True)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sbx_rl-0.26.0.tar.gz
(52.8 kB
view details)
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
sbx_rl-0.26.0-py3-none-any.whl
(62.9 kB
view details)
File details
Details for the file sbx_rl-0.26.0.tar.gz.
File metadata
- Download URL: sbx_rl-0.26.0.tar.gz
- Upload date:
- Size: 52.8 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
d5f77d17778dc6a94a2387684905011d5bf273d8c09e7613b896aef80bcc35c9
|
|
| MD5 |
f0440933e8772f771ca0be45679241ab
|
|
| BLAKE2b-256 |
046b09706724c09aba415fc97d8d132049df361381f34db9ea77897a659ba3c1
|
File details
Details for the file sbx_rl-0.26.0-py3-none-any.whl.
File metadata
- Download URL: sbx_rl-0.26.0-py3-none-any.whl
- Upload date:
- Size: 62.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/6.1.0 CPython/3.10.14
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
020f5edfdccd1ee5a1aeedfac56d6d0fd52d405a8a3b8edf86ed999ca3d6bfbd
|
|
| MD5 |
37702f250635260a02deac0fcfa7a589
|
|
| BLAKE2b-256 |
71c1b2f302058a438f6e0a6d328c3fe2d3da51ddcd5068cea82d08c9388c4683
|