Skip to main content

Reinforcement Learning agents implemented in pytorch

Project description

<abstractpaper>

prop

prop is a library of Reinforcment Learning agents implemented in pytorch.

Algorithms

Model Policy
DQN Model-Free Off-Policy
A2C Model-Free On-Policy

DQN

Deep Q-Learning is a variant of Q-learning with a deep neural network used for estimating Q-values (hence DQN; Deep Q-Network).

Both DQN and DDQN (Double DQN) are implemented.

A2C

Advantage Actor Critic is a variant of Actor-Critic that:

  • Uses a neural network to approximate a policy and a value function.
  • Computes the advantage of an action to scale the computed gradients. This acts as a vote of confidence (or skepticism) on actions produced by the actor.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

rlprop-0.0.4.tar.gz (10.1 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

rlprop-0.0.4-py3-none-any.whl (12.3 kB view details)

Uploaded Python 3

File details

Details for the file rlprop-0.0.4.tar.gz.

File metadata

  • Download URL: rlprop-0.0.4.tar.gz
  • Upload date:
  • Size: 10.1 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.1.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.8.2

File hashes

Hashes for rlprop-0.0.4.tar.gz
Algorithm Hash digest
SHA256 62420e10ade4c2d28b943b672536639b0c00121a8aba272a5adb3fe4e86a798b
MD5 115b18a2874fcddca33348d9ca9afced
BLAKE2b-256 afdb536c84e895570b746f92159801f4283ebb36954848c0dc569feefdfa013a

See more details on using hashes here.

File details

Details for the file rlprop-0.0.4-py3-none-any.whl.

File metadata

  • Download URL: rlprop-0.0.4-py3-none-any.whl
  • Upload date:
  • Size: 12.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.2.0 pkginfo/1.5.0.1 requests/2.24.0 setuptools/49.1.0 requests-toolbelt/0.9.1 tqdm/4.47.0 CPython/3.8.2

File hashes

Hashes for rlprop-0.0.4-py3-none-any.whl
Algorithm Hash digest
SHA256 b488c8fae848404bb4f168c2ed83ab32a0ca73e7c87c4f0a530b5b556fde0c79
MD5 68e3651570b64c5ad2f302afd4b1672b
BLAKE2b-256 d8eba97bbb6a3522c068399b63b13adcc4b516bfceb0260c074e813e1765d5ce

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page