Skip to main content

Jax version of Stable Baselines, implementations of reinforcement learning algorithms.

Project description

Stable Baselines Jax (SB3 + JAX = SBX)

See https://github.com/araffin/sbx

Proof of concept version of Stable-Baselines3 in Jax.

Implemented algorithms:

Example

from sbx import DDPG, DQN, PPO, SAC, TD3, TQC, CrossQ

model = TQC("MlpPolicy", "Pendulum-v1", verbose=1)
model.learn(total_timesteps=10_000, progress_bar=True)

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

sbx_rl-0.18.0.tar.gz (44.2 kB view details)

Uploaded Source

Built Distribution

sbx_rl-0.18.0-py3-none-any.whl (57.1 kB view details)

Uploaded Python 3

File details

Details for the file sbx_rl-0.18.0.tar.gz.

File metadata

  • Download URL: sbx_rl-0.18.0.tar.gz
  • Upload date:
  • Size: 44.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for sbx_rl-0.18.0.tar.gz
Algorithm Hash digest
SHA256 670f2bf095ec21ba6f8171602294baf0123787fe7be6811ebab276fb5010b8b3
MD5 efb29d66eb406205ac36afebe1db4cd8
BLAKE2b-256 c329a2e6d594afa11093e6bc0067f44ce650e1f62fa25691b0f9df9d7365f3e8

See more details on using hashes here.

File details

Details for the file sbx_rl-0.18.0-py3-none-any.whl.

File metadata

  • Download URL: sbx_rl-0.18.0-py3-none-any.whl
  • Upload date:
  • Size: 57.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/5.1.1 CPython/3.10.14

File hashes

Hashes for sbx_rl-0.18.0-py3-none-any.whl
Algorithm Hash digest
SHA256 75ade634a33555ad4c4a81523bb0f99c89d1b3bc89fb74990ef87b22379abd9c
MD5 d0b49172c5a9828e19b9dffc763a9e40
BLAKE2b-256 f165ee9f52fb688fe9d3419a155319d25bcefc8df4a664bed3392f09deccb6a3

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page