Jax version of Stable Baselines, implementations of reinforcement learning algorithms.
Project description
Stable Baselines Jax (SB3 + JAX = SBX)
See https://github.com/araffin/sbx
Proof of concept version of Stable-Baselines3 in Jax.
Implemented algorithms:
- Soft Actor-Critic (SAC) and SAC-N
- Truncated Quantile Critics (TQC)
- Dropout Q-Functions for Doubly Efficient Reinforcement Learning (DroQ)
- Proximal Policy Optimization (PPO)
- Deep Q Network (DQN)
- Twin Delayed DDPG (TD3)
- Deep Deterministic Policy Gradient (DDPG)
Example
from sbx import TQC, DroQ, SAC, DQN, PPO, TD3, DDPG
model = TQC("MlpPolicy", "Pendulum-v1", verbose=1)
model.learn(total_timesteps=10_000, progress_bar=True)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
sbx-rl-0.8.0.tar.gz
(29.3 kB
view details)
Built Distribution
sbx_rl-0.8.0-py3-none-any.whl
(44.1 kB
view details)
File details
Details for the file sbx-rl-0.8.0.tar.gz
.
File metadata
- Download URL: sbx-rl-0.8.0.tar.gz
- Upload date:
- Size: 29.3 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
836dbb53001d6ca0c075b4b033802800cde57aed88b0364f5908e441369a083b
|
|
MD5 |
5c362350d5a59e3aa365afbded2dc49c
|
|
BLAKE2b-256 |
a9721798582e870f8257bcf04d49dd1d164ceb2e179752248a137119ce4853a2
|
File details
Details for the file sbx_rl-0.8.0-py3-none-any.whl
.
File metadata
- Download URL: sbx_rl-0.8.0-py3-none-any.whl
- Upload date:
- Size: 44.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/4.0.2 CPython/3.9.13
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 |
2b435ec2d22e0d65914c7724c9acb1dffe37c7f496b9012c39d8340582fd0c3f
|
|
MD5 |
a64e0ff1207deeef9ee63b25d4ea778c
|
|
BLAKE2b-256 |
f42a1cf3c19d31e7d9795258be738be25262f28cff2d620503978d04ea7bd1d7
|