JAX implementation of Reinforcement Learning agents
Project description
Jaxagents
Jaxagents is a Python implementation of Reinforcement Learning agents built upon JAX.
Content
So far, the project includes the following agents:
- Q-learning:
- Deep Q Networks (DQN)
- Double Deep Q Networks (DDQN)
- Categorical Deep Q Networks (often known as C51)
- Quantile Regression Deep Q Networks (QRDQN)
- Policy gradient:
- REINFORCE
- PPO with clipping and GAE
Background
Research and development in Reinforcement Learning can be computationally cumbersome. Utilizing JAX's high computational performance, Jaxagents provides a framework for applying and developing Reinforcement Learning agents that offers benefits in:
- computational speed
- easy control of random number generation
- hyperparameter optimization (via parallelized calculations)
Project details
Release history Release notifications | RSS feed
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
jaxagents-0.1.13.tar.gz
(37.4 kB
view details)
Built Distribution
File details
Details for the file jaxagents-0.1.13.tar.gz
.
File metadata
- Download URL: jaxagents-0.1.13.tar.gz
- Upload date:
- Size: 37.4 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.4 CPython/3.10.11 Windows/10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | 9ab6ac51ab9c0763c8a9ae109adca1fbb7c7e3fb05031713e8561a3736df7379 |
|
MD5 | 6e05662461b070d05c7288984b5637bc |
|
BLAKE2b-256 | 1494595138c150a10d54aab58fd579e90089f54b8a95af3293a394973bc19641 |
File details
Details for the file jaxagents-0.1.13-py3-none-any.whl
.
File metadata
- Download URL: jaxagents-0.1.13-py3-none-any.whl
- Upload date:
- Size: 42.9 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: poetry/1.8.4 CPython/3.10.11 Windows/10
File hashes
Algorithm | Hash digest | |
---|---|---|
SHA256 | f64d5a41d88024103479e511bf0ebc2092a5c7cd085a88f7903d13bd40174c78 |
|
MD5 | 74f36bcc60961cf940fcc426eabccc38 |
|
BLAKE2b-256 | 4c46af867541f5776c8dc3b1a5fa6bb802dc722de02b7d4ebcbf98b35d5a2433 |