Reinforcement Learning agents implemented in pytorch
Project description
prop
prop is a library of Reinforcment Learning agents implemented in pytorch.
Algorithms
| Model | Policy | |
|---|---|---|
| DQN | Model-Free | Off-Policy |
| A2C | Model-Free | On-Policy |
DQN
Deep Q-Learning is a variant of Q-learning with a deep neural network used for estimating Q-values (hence DQN; Deep Q-Network).
Both DQN and DDQN (Double DQN) are implemented.
A2C
Advantage Actor Critic is a variant of Actor-Critic that:
- Uses a neural network to approximate a policy and a value function.
- Computes the advantage of an action to scale the computed gradients. This acts as a vote of confidence (or skepticism) on actions produced by the actor.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
Built Distribution
Filter files by name, interpreter, ABI, and platform.
If you're not sure about the file name format, learn more about wheel file names.
Copy a direct link to the current filters
File details
Details for the file rlprop-0.0.4.tar.gz.
File metadata
- Download URL: rlprop-0.0.4.tar.gz
- Upload date:
- Size: 10.1 kB
- Tags: Source
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.1.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.8.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
62420e10ade4c2d28b943b672536639b0c00121a8aba272a5adb3fe4e86a798b
|
|
| MD5 |
115b18a2874fcddca33348d9ca9afced
|
|
| BLAKE2b-256 |
afdb536c84e895570b746f92159801f4283ebb36954848c0dc569feefdfa013a
|
File details
Details for the file rlprop-0.0.4-py3-none-any.whl.
File metadata
- Download URL: rlprop-0.0.4-py3-none-any.whl
- Upload date:
- Size: 12.3 kB
- Tags: Python 3
- Uploaded using Trusted Publishing? No
- Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.1.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.8.2
File hashes
| Algorithm | Hash digest | |
|---|---|---|
| SHA256 |
b488c8fae848404bb4f168c2ed83ab32a0ca73e7c87c4f0a530b5b556fde0c79
|
|
| MD5 |
68e3651570b64c5ad2f302afd4b1672b
|
|
| BLAKE2b-256 |
d8eba97bbb6a3522c068399b63b13adcc4b516bfceb0260c074e813e1765d5ce
|