Skip to main content

Unsupervised pre-training with PPG

Project description

Unsupervised On-Policy Reinforcement Learning

This work combines Active Pre-Training with an On-Policy algorithm, Phasic Policy Gradient.

Active Pre-Training

Is used to pre-train a model free algorithm before defining a downstream task. It calculates the reward based on an estimatie of the particle based entropy of states. This reduces the training time if you want to define various tasks - i.e. robots for a warehouse.

Phasic Policy Gradient

Improved Version of Proximal Policy Optimization, which uses auxiliary epochs to train shared representations between the policy and a value network.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unsupervised-on-policy-0.1.2.tar.gz (16.5 kB view details)

Uploaded Source

Built Distribution

unsupervised_on_policy-0.1.2-py3-none-any.whl (40.1 kB view details)

Uploaded Python 3

File details

Details for the file unsupervised-on-policy-0.1.2.tar.gz.

File metadata

  • Download URL: unsupervised-on-policy-0.1.2.tar.gz
  • Upload date:
  • Size: 16.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.6 tqdm/4.62.2 importlib-metadata/4.8.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.8

File hashes

Hashes for unsupervised-on-policy-0.1.2.tar.gz
Algorithm Hash digest
SHA256 eebaa2e0c0c6d9647d60955ff64a8804b666ff0a40d9330b214f6d3c47fa2f2b
MD5 cc2690cc1b03fcf7d80facfaec0d9811
BLAKE2b-256 c65bd41ce1ba55cdf79c2680cd2883b18a843e5d096b64f7afae84f6db066cbf

See more details on using hashes here.

File details

Details for the file unsupervised_on_policy-0.1.2-py3-none-any.whl.

File metadata

  • Download URL: unsupervised_on_policy-0.1.2-py3-none-any.whl
  • Upload date:
  • Size: 40.1 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.6 tqdm/4.62.2 importlib-metadata/4.8.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.8

File hashes

Hashes for unsupervised_on_policy-0.1.2-py3-none-any.whl
Algorithm Hash digest
SHA256 a43aa28eaaf0c53de8e28fc1918b28002758732409dabd0f299e2ab1dc1ec3db
MD5 02d4ee76fc8e0be8c3cda0647f5ae0dd
BLAKE2b-256 594bdab164f48d54d2e539cc11aa5f22839386653311fbe2de064960ed5459e7

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page