Skip to main content

Unsupervised pre-training with PPG

Project description

Unsupervised On-Policy Reinforcement Learning

This work combines Active Pre-Training with an On-Policy algorithm, Phasic Policy Gradient.

Active Pre-Training

Is used to pre-train a model free algorithm before defining a downstream task. It calculates the reward based on an estimatie of the particle based entropy of states. This reduces the training time if you want to define various tasks - i.e. robots for a warehouse.

Phasic Policy Gradient

Improved Version of Proximal Policy Optimization, which uses auxiliary epochs to train shared representations between the policy and a value network.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

unsupervised-on-policy-0.1.1.tar.gz (16.5 kB view details)

Uploaded Source

Built Distribution

unsupervised_on_policy-0.1.1-py3-none-any.whl (40.0 kB view details)

Uploaded Python 3

File details

Details for the file unsupervised-on-policy-0.1.1.tar.gz.

File metadata

  • Download URL: unsupervised-on-policy-0.1.1.tar.gz
  • Upload date:
  • Size: 16.5 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.6 tqdm/4.62.2 importlib-metadata/4.8.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.8

File hashes

Hashes for unsupervised-on-policy-0.1.1.tar.gz
Algorithm Hash digest
SHA256 3a99f4be017fb5c7974c9834119b717c30cd52ca1dd35d00d8f19cca1e14a744
MD5 582eeb2302b6268a401969b5ea93f650
BLAKE2b-256 234d94295db7eab5e80f24f6294a827901bd239e20d097f2310f3fd0ee09f29b

See more details on using hashes here.

File details

Details for the file unsupervised_on_policy-0.1.1-py3-none-any.whl.

File metadata

  • Download URL: unsupervised_on_policy-0.1.1-py3-none-any.whl
  • Upload date:
  • Size: 40.0 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.8.0 pkginfo/1.8.2 readme-renderer/33.0 requests/2.26.0 requests-toolbelt/0.9.1 urllib3/1.26.6 tqdm/4.62.2 importlib-metadata/4.8.2 keyring/23.5.0 rfc3986/2.0.0 colorama/0.4.4 CPython/3.8.8

File hashes

Hashes for unsupervised_on_policy-0.1.1-py3-none-any.whl
Algorithm Hash digest
SHA256 fccccee5eb36886a0cea710d345b7870022b4071bc2d1ed27e796fbf5f2be06e
MD5 b5be7066c8f50a887a73183ac8cf04bc
BLAKE2b-256 1d2ddbbdd44f85145969cee8c858933cdab28dad12a007c74dcf221be185818a

See more details on using hashes here.

Supported by

AWS AWS Cloud computing and Security Sponsor Datadog Datadog Monitoring Fastly Fastly CDN Google Google Download Analytics Microsoft Microsoft PSF Sponsor Pingdom Pingdom Monitoring Sentry Sentry Error logging StatusPage StatusPage Status page