Skip to main content

Yet Another Reinforcement Learning Library

Project description

Yet Another Reinforcement Learning Library (YARLL)

Codacy Badge

Update 25/03/2019: For now, the master branch won't get big changes. Instead, algorithms are adapted for TensorFlow 2 (and new ones may be added) on the TF2 branch.
Update 29/10/2018: New library name.
Update 25/10/2018: Added SAC implementation.

Status

Different algorithms have currently been implemented (in no particular order):

Asynchronous Advantage Actor Critic (A3C)

The code for this algorithm can be found here. Example run after training using 16 threads for a total of 5 million timesteps on the PongDeterministic-v4 environment:

Pong example run

How to run

First, install the library using pip (you can first remove OpenCV from the setup.py file if it is already installed):

pip install yarll

Algorithms/experiments

You can run algorithms by passing the path to an experiment specification (which is a file in json format) to main.py:

python -m yarll.main <path_to_experiment_specification>

Examples of experiment specifications can be found in the experiment_specs folder.

Statistics

Statistics can be plot using:

python -m yarll.misc.plot_statistics <path_to_stats>

<path_to_stats> can be one of 2 things:

  • A json file generated using gym.wrappers.Monitor, in case it plots the episode lengths and total reward per episode.
  • A directory containing TensorFlow scalar summaries for different tasks, in which case all of the found scalars are plot.

Help about other arguments (e.g. for using smoothing) can be found by executing python -m yarll.misc.plot_statistics -h.

Alternatively, it is also possible to use Tensorboard to show statistics in the browser by passing the directory with the scalar summaries as --logdir argument.

Project details


Download files

Download the file for your platform. If you're not sure which to choose, learn more about installing packages.

Source Distribution

yarll-0.0.12.tar.gz (55.6 kB view details)

Uploaded Source

Built Distribution

If you're not sure about the file name format, learn more about wheel file names.

yarll-0.0.12-py3-none-any.whl (83.7 kB view details)

Uploaded Python 3

File details

Details for the file yarll-0.0.12.tar.gz.

File metadata

  • Download URL: yarll-0.0.12.tar.gz
  • Upload date:
  • Size: 55.6 kB
  • Tags: Source
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.38.0 CPython/3.7.4

File hashes

Hashes for yarll-0.0.12.tar.gz
Algorithm Hash digest
SHA256 1abf7774e7a6ffc29363d559fa325363c70c579e0eea94613d2b00b76e587a3e
MD5 626899e56dcc7e9a44af5cffe92ad1a8
BLAKE2b-256 d78a3f01b1f168a803910af1a617696e82412512cc193e69a48691fa00dc4d29

See more details on using hashes here.

File details

Details for the file yarll-0.0.12-py3-none-any.whl.

File metadata

  • Download URL: yarll-0.0.12-py3-none-any.whl
  • Upload date:
  • Size: 83.7 kB
  • Tags: Python 3
  • Uploaded using Trusted Publishing? No
  • Uploaded via: twine/3.0.0 pkginfo/1.5.0.1 requests/2.22.0 setuptools/41.4.0 requests-toolbelt/0.9.1 tqdm/4.38.0 CPython/3.7.4

File hashes

Hashes for yarll-0.0.12-py3-none-any.whl
Algorithm Hash digest
SHA256 56d2b8b298798c5e30c453ad4fd480adb8c7a7de69418a359422b1559a05524d
MD5 3bc5bf76788eda00125b6d9b8a367357
BLAKE2b-256 049f1bcf8dd7d6a484ad19599b396b03df70841ca63de2082ea5d13e282959a5

See more details on using hashes here.

Supported by

AWS Cloud computing and Security Sponsor Datadog Monitoring Depot Continuous Integration Fastly CDN Google Download Analytics Pingdom Monitoring Sentry Error logging StatusPage Status page