Reinforcement Learning agents implemented in pytorch
Project description
prop
prop is a library of Reinforcment Learning agents implemented in pytorch.
Algorithms
Model | Policy | |
---|---|---|
DQN | Model-Free | Off-Policy |
A2C | Model-Free | On-Policy |
DQN
Deep Q-Learning is a variant of Q-learning with a deep neural network used for estimating Q-values (hence DQN; Deep Q-Network).
Both DQN and DDQN (Double DQN) are implemented.
A2C
Advantage Actor Critic is a variant of Actor-Critic that:
- Uses a neural network to approximate a policy and a value function.
- Computes the advantage of an action to scale the computed gradients. This acts as a vote of confidence (or skepticism) on actions produced by the actor.
Project details
Download files
Download the file for your platform. If you're not sure which to choose, learn more about installing packages.
Source Distribution
rlprop-0.0.4.tar.gz
(10.1 kB
view hashes)
Built Distribution
rlprop-0.0.4-py3-none-any.whl
(12.3 kB
view hashes)