Skip to main content

Unsupervised pre-training with PPG

Project description

Unsupervised On-Policy Reinforcement Learning

This work combines Active Pre-Training with an On-Policy algorithm, Phasic Policy Gradient.

Active Pre-Training

Is used to pre-train a model free algorithm before defining a downstream task. It calculates the reward based on an estimatie of the particle based entropy of states. This reduces the training time if you want to define various tasks - i.e. robots for a warehouse.

Phasic Policy Gradient

Improved Version of Proximal Policy Optimization, which uses auxiliary epochs to train shared representations between the policy and a value network.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unsupervised-on-policy-0.1.0.tar.gz (16.2 kB view details)

Uploaded Source

Built Distribution

unsupervised_on_policy-0.1.0-py3-none-any.whl (39.3 kB view details)

Uploaded Python 3

File details

Details for the file unsupervised-on-policy-0.1.0.tar.gz.

File metadata

  • Download URL: unsupervised-on-policy-0.1.0.tar.gz
  • Upload date:
  • Size: 16.2 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.6 tqdm/4.62.2 importlib-metadata/4.8.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.8

File hashes

Hashes for unsupervised-on-policy-0.1.0.tar.gz
Algorithm Hash digest
SHA256 560a404e1c7f8cb7a96f1c1ad2b2a78b72fc9e637888f4706df2af1e07e98e70
MD5 289312b1602174a66b534dea1fab2b46
BLAKE2b-256 49e5d2691b0c3370c46d259e1c84a4f996e85218ce72e52be92db3460fe728b1

See more details on using hashes here.

File details

Details for the file unsupervised_on_policy-0.1.0-py3-none-any.whl.

File metadata

  • Download URL: unsupervised_on_policy-0.1.0-py3-none-any.whl
  • Upload date:
  • Size: 39.3 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.6 tqdm/4.62.2 importlib-metadata/4.8.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.8

File hashes

Hashes for unsupervised_on_policy-0.1.0-py3-none-any.whl
Algorithm Hash digest
SHA256 6fd186daabf3357dd6d5cc9025d93fe4a13ff8e4c6e8130331106df3410b43a9
MD5 6520eefc67505ecea616a6577bca6622
BLAKE2b-256 7c806b92e84b9fa5ee0c3d0870c6baeb62cc501e70c938d5c2798e9c1c3d3217

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page