Project description

prop

prop is a library of Reinforcment Learning agents implemented in pytorch.

	Model	Policy
DQN	Model-Free	Off-Policy
A2C	Model-Free	On-Policy

Deep Q-Learning is a variant of Q-learning with a deep neural network used for estimating Q-values (hence DQN; Deep Q-Network).

Both DQN and DDQN (Double DQN) are implemented.

Advantage Actor Critic is a variant of Actor-Critic that:

Uses a neural network to approximate a policy and a value function.
Computes the advantage of an action to scale the computed gradients. This acts as a vote of confidence (or skepticism) on actions produced by the actor.

These details have not been verified by PyPI

View statistics for this project via Libraries.io, or by using our public dataset on Google BigQuery

This version

0.0.4

Jul 4, 2020

0.0.3

Jul 4, 2020

0.0.2

Jul 4, 2020

0.0.1

Jul 4, 2020

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Uploaded Jul 4, 2020 Source

Uploaded Jul 4, 2020 Python 3

Hashes for rlprop-0.0.4-py3-none-any.whl
Algorithm	Hash digest
SHA256	`b488c8fae848404bb4f168c2ed83ab32a0ca73e7c87c4f0a530b5b556fde0c79`
MD5	`68e3651570b64c5ad2f302afd4b1672b`
BLAKE2b-256	`d8eba97bbb6a3522c068399b63b13adcc4b516bfceb0260c074e813e1765d5ce`