JAX implementation of Reinforcement Learning agents
Project description
Jaxagents
Jaxagents is a Python implementation of Reinforcement Learning agents built upon JAX. The PyPI page of the project can be found here.
Installation
You can install the latest version of jaxagents from PyPI via:
pip install jaxagents
Content
So far, the project includes the following agents:
- Q-learning:
- Deep Q Networks (DQN)
- Double Deep Q Networks (DDQN)
- Categorical Deep Q Networks (often known as C51)
- Quantile Regression Deep Q Networks (QRDQN)
- Policy gradient:
- REINFORCE
- PPO with clipping and GAE
Background
Research and development in Reinforcement Learning can be computationally cumbersome. Utilizing JAX's high computational performance, Jaxagents provides a framework for applying and developing Reinforcement Learning agents that offers benefits in:
- computational speed
- easy control of random number generation
- hyperparameter optimization (via parallelized calculations)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
File details
Details for the file jaxagents-0.1.12.tar.gz
.
File metadata
- Download URL: jaxagents-0.1.12.tar.gz
- Upload date:
- Size: 37.5 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.4 CPython/3.10.11 Windows/10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 4b5e6743bca2837e60a9fe0d2b91c12011af55323faa33b1b24a3bfae12251dc |
|
MD5 | 35c3c9b0d7c883f3b6718d06e3645f6c |
|
BLAKE2b-256 | e808999f32b0a4e682a28515be5b8936f2e3f3033073bf3250da4ba18f51a0a6 |
File details
Details for the file jaxagents-0.1.12-py3-none-any.whl
.
File metadata
- Download URL: jaxagents-0.1.12-py3-none-any.whl
- Upload date:
- Size: 43.1 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.4 CPython/3.10.11 Windows/10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 6fe0e98c75a9b0b57aac6d1129b0fedae8917aca4313783684e5e30b2dbb0721 |
|
MD5 | b64668fed515d172c00d618f54ac2ba0 |
|
BLAKE2b-256 | 097caf3302b7cb9b0299d52f0a37cd0edd2f967386e6af14568fb233094dad84 |