Vectorizable RL algorithms in pure JAX

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kerajli

These details have not been verified by PyPI

Project description

Rejax
Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!

Rejax is a library of RL algorithms which are implemented in pure Jax. It allows you to accelerate your RL pipelines by using jax.jit, jax.vmap, jax.pmap or any other transformation on whole training algorithms. Use it to quickly search for hyperparameters, evaluate agents for multiple seeds in parallel, or run meta-evolution experiments on your GPUs and TPUs. If you're new to rejax and want to learn more about it,

📸 Take a tour

rejax demo

🏗 Installing rejax

Install via pip: pip install rejax
Install from source: pip install git+https://github.com/keraJLi/rejax

⚡ Vectorize training for incredible speedups!

Use jax.jit on the whole train function to run training exclusively on your GPU!
Use jax.vmap and jax.pmap on the initial seed or hyperparameters to train a whole batch of agents in parallel!

from rejax import SAC

# Get train function and initialize config for training
algo = SAC.create(env="CartPole-v1", learning_rate=0.001)

# Jit the training function
train_fn = jax.jit(algo.train)

# Vmap training function over 300 initial seeds
vmapped_train_fn = jax.vmap(train_fn)

# Train 300 agents!
keys = jax.random.split(jax.random.PRNGKey(0), 300)
train_state, evaluation = vmapped_train_fn(keys)

Benchmark on an A100 80G and a Intel Xeon 4215R CPU. Note that the hyperparameters were set to the default values of cleanRL, including buffer sizes. Shrinking the buffers can yield additional speedups due to better caching, and enables training of even more agents in parallel.

Speedup over cleanRL on hopper Speedup over cleanRL on breakout

🤖 Implemented algorithms

Algorithm	Link	Discrete	Continuous	Notes
PPO	here	✔	✔
SAC	here	✔	✔	discrete version as in Christodoulou, 2019
DQN	here	✔		incl. DDQN, Dueling DQN
PQN	here	✔
IQN	here	✔
TD3	here		✔

🛠 Easily extend and modify algorithms

The implementations focus on clarity! Easily modify the implemented algorithms by overwriting isolated parts, such as the loss function, trajectory generation or parameter updates. For example, easily turn DQN into DDQN by writing

class DoubleDQN(DQN):
    def update(self, state, minibatch):
        # Calculate DDQN-specific targets
        targets = ddqn_targets(state, minibatch)

        # The loss function predicts Q-values and returns MSBE
        def loss_fn(params):
            ...
            return jnp.mean((targets - q_values) ** 2)

        # Calculate gradients
        grads = jax.grad(loss_fn)(state.q_ts.params)

        # Update train state
        q_ts = state.q_ts.apply_gradients(grads=grads)
        state = state.replace(q_ts=q_ts)
        return state

🔙 Flexible callbacks

Using callbacks, you can run logging to the console, disk, wandb, and much more. Even when the whole train function is jitted! For example, run a jax.experimental.io_callback regular intervals during training, or print the current policies mean return:

def print_callback(algo, state, rng):
    policy = make_act(algo, state)           # Get current policy
    episode_returns = evaluate(policy, ...)  # Evaluate it
    jax.debug.print(                         # Print results
        "Step: {}. Mean return: {}",
        state.global_step,
        episode_returns.mean(),
    )
    return ()  # Must return PyTree (None is not a PyTree)

algo = algo.replace(eval_callback=print_callback)

Callbacks have the signature callback(algo, train_state, rng) -> PyTree, which is called every eval_freq training steps with the config and current train state. The output of the callback will be aggregated over training and returned by the train function. The default callback runs a number of episodes in the training environment and returns their length and episodic return, such that the train function returns a training curve.

Importantly, this function is jit-compiled along with the rest of the algorithm. However, you can use one of Jax's callbacks such as jax.experimental.io_callback to implement model checkpoining, logging to wandb, and more, all while maintaining the advantages of a completely jittable training function.

💞 Alternatives in end-to-end GPU training

Libraries:

Brax along with several environments, brax implements PPO and SAC within their environment interface

Single file implementations:

PureJaxRL implements PPO, recurrent PPO and DQN
Stoix features DQN, DDPG, TD3, SAC, PPO, as well as popular extensions and more

✍ Cite us!

@misc{rejax, 
  title={rejax}, 
  url={https://github.com/keraJLi/rejax}, 
  journal={keraJLi/rejax}, 
  author={Liesen, Jarek and Lu, Chris and Lange, Robert}, 
  year={2024}
}

Project details

These details have been verified by PyPI

Project links

GitHub Statistics

Maintainers

kerajli

These details have not been verified by PyPI

Release history Release notifications | RSS feed

This version

0.1.3

Jun 10, 2026

0.1.2

May 23, 2025

0.1.1

Nov 14, 2024

0.1.0

Sep 2, 2024

0.0.1

Jun 3, 2024

Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rejax-0.1.3.tar.gz (22.7 MB view details)

Uploaded Jun 10, 2026 Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

The dropdown lists show the available interpreters, ABIs, and platforms. Enable javascript to be able to filter the list of wheel files.

rejax-0.1.3-py3-none-any.whl (41.7 kB view details)

Uploaded Jun 10, 2026 Python 3

File details

Details for the file rejax-0.1.3.tar.gz.

File metadata

Download URL: rejax-0.1.3.tar.gz
Upload date: Jun 10, 2026
Size: 22.7 MB
Tags: Source
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rejax-0.1.3.tar.gz
Algorithm	Hash digest
SHA256	`dc3bfb7d033169242d09012bc4c68764997975090fbfc483a0360b8e0f266b65`
MD5	`69a4a1d0e4442dacd2007d3f8f4a4f8a`
BLAKE2b-256	`03e2b0273e4e600c3a586cea915b633adc040b224b09ba3d3ca718001a4e2a53`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rejax-0.1.3.tar.gz:

Publisher: python-publish-2.yml on keraJLi/rejax

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rejax-0.1.3.tar.gz
- Subject digest: dc3bfb7d033169242d09012bc4c68764997975090fbfc483a0360b8e0f266b65
- Sigstore transparency entry: 1779020737
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: keraJLi/rejax@4587115ef2462f60219b8c65efdd6f7f3e027968
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/keraJLi
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish-2.yml@4587115ef2462f60219b8c65efdd6f7f3e027968
- Trigger Event: release

File details

Details for the file rejax-0.1.3-py3-none-any.whl.

File metadata

Download URL: rejax-0.1.3-py3-none-any.whl
Upload date: Jun 10, 2026
Size: 41.7 kB
Tags: Python 3
Uploaded using Trusted Publishing? Yes
Uploaded via: twine/6.1.0 CPython/3.13.12

File hashes

Hashes for rejax-0.1.3-py3-none-any.whl
Algorithm	Hash digest
SHA256	`86c7468b5b3c9b520b813d0e89137f36e1472d14c327ae86c5328b6e0ca629c0`
MD5	`929833fcc84232db5ed0df5c2408f05a`
BLAKE2b-256	`995426ff8e17574f3ec915833420282d630c6f94bba7f35b0897d5c8dd016d2c`

See more details on using hashes here.

Provenance

The following attestation bundles were made for rejax-0.1.3-py3-none-any.whl:

Publisher: python-publish-2.yml on keraJLi/rejax

Attestations: Values shown here reflect the state when the release was signed and may no longer be current.

Statement:
- Statement type: https://in-toto.io/Statement/v1
- Predicate type: https://docs.pypi.org/attestations/publish/v1
- Subject name: rejax-0.1.3-py3-none-any.whl
- Subject digest: 86c7468b5b3c9b520b813d0e89137f36e1472d14c327ae86c5328b6e0ca629c0
- Sigstore transparency entry: 1779021030
- Sigstore integration time: Jun 10, 2026
Source repository:
- Permalink: keraJLi/rejax@4587115ef2462f60219b8c65efdd6f7f3e027968
- Branch / Tag: refs/tags/v0.1.3
- Owner: https://github.com/keraJLi
- Access: public
Publication detail:
- Token Issuer: https://token.actions.githubusercontent.com
- Runner Environment: github-hosted
- Publication workflow: python-publish-2.yml@4587115ef2462f60219b8c65efdd6f7f3e027968
- Trigger Event: release

rejax 0.1.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

Rejax
Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!

📸 Take a tour

🏗 Installing rejax

⚡ Vectorize training for incredible speedups!

🤖 Implemented algorithms

🛠 Easily extend and modify algorithms

🔙 Flexible callbacks

💞 Alternatives in end-to-end GPU training

✍ Cite us!

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

rejax 0.1.3

Navigation

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Project description

Rejax Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!

📸 Take a tour

🏗 Installing rejax

⚡ Vectorize training for incredible speedups!

🤖 Implemented algorithms

🛠 Easily extend and modify algorithms

🔙 Flexible callbacks

💞 Alternatives in end-to-end GPU training

✍ Cite us!

Project details

Verified details

Project links

GitHub Statistics

Maintainers

Unverified details

Meta

Release history Release notifications | RSS feed

Download files

Source Distribution

Built Distribution

File details

File metadata

File hashes

Provenance

File details

File metadata

File hashes

Provenance

Rejax
Hardware-Accelerated Reinforcement Learning Algorithms in pure Jax!