Accelerated gridworld navigation with JAX for deep reinforcement learning
Project description
NAVIX
Quickstart | Installation | Examples | Cite
What is NAVIX?
NAVIX is minigrid in JAX, >10000x faster with Autograd and XLA support. You can see a superficial performance comparison here.
Installation
We currently support the OSs supported by JAX. You can find a description here.
You might want to follow the same guide to install jax for your faviourite accelerator (e.g. CPU, GPU, or TPU ).
Then, install navix
and its dependencies with:
pip install navix
Examples
XLA compilation
One straightforward use case is to accelerate the computation of the environment with XLA compilation. For example, here we vectorise the environment to run multiple environments in parallel, and compile the full training run.
You can find a partial performance comparison with minigrid in the docs.
import jax
import navix as nx
def run(seed)
env = nx.environments.Room(16, 16, 8)
key = jax.random.PRNGKey(seed)
timestep = env.reset(key)
actions = jax.random.randint(key, (N_TIMESTEPS,), 0, 6)
def body_fun(timestep, action):
timestep = env.step(timestep, jnp.asarray(action))
return timestep, ()
return jax.lax.scan(body_fun, timestep, jnp.asarray(actions, dtype=jnp.int32))[0]
final_timestep = jax.jit(jax.vmap(run))(jax.numpy.arange(1000))
Backpropagation through the environment
Another use case it to backpropagate through the environment transition function, for example to learn a world model.
TODO(epignatelli): add example.
Cite
If you use helx
please consider citing it as:
@misc{pignatelli2023navix,
author = {Pignatelli, Eduardo},
title = {Navix: Accelerated gridworld navigation with JAX},
year = {2023},
publisher = {GitHub},
journal = {GitHub repository},
howpublished = {\url{https://github.com/epignatelli/navix}}
}
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Hashes for Navix-0.2.0-py2.py3-none-any.whl
Algorithm | Hash digest | |
---|---|---|
SHA256 | 3e4c39a735fa3425624747c6aa1f38b015bae19b3ae5e7d0c918e190ba3e754c |
|
MD5 | d1ab8e2d574d60f1d43aee599f875ae0 |
|
BLAKE2b-256 | 09493ba7992dc3db720cda3247bcee918ba4728a2c2ab75b36777a7878fc40af |